Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion mdp.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -288,7 +288,7 @@
"metadata": {},
"source": [
"Now let us implement the simple MDP in the image below. States A, B have actions X, Y available in them. Their probabilities are shown just above the arrows. We start with using MDP as base class for our CustomMDP. Obviously we need to make a few changes to suit our case. We make use of a transition matrix as our transitions are not very simple.\n",
"<img src=\"files/images/mdp-a.png\">"
"<img src=\"images/mdp-a.png\">"
]
},
{
Expand Down
4 changes: 2 additions & 2 deletions probability.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -1288,7 +1288,7 @@
"\n",
"The example below where we implement the network shown in **Figure 14.3** of the book will make this more clear.\n",
"\n",
"<img src=\"files/images/bayesnet.png\">\n",
"<img src=\"images/bayesnet.png\">\n",
"\n",
"The alarm node can be made as follows: "
]
Expand Down Expand Up @@ -3275,7 +3275,7 @@
"source": [
"The function **prior_sample** implements the algorithm described in **Figure 14.13** of the book. Nodes are sampled in the topological order. The old value of the event is passed as evidence for parent values. We will use the Bayesian Network in **Figure 14.12** to try out the **prior_sample**\n",
"\n",
"<img src=\"files/images/sprinklernet.jpg\" height=\"500\" width=\"500\">\n",
"<img src=\"images/sprinklernet.jpg\" height=\"500\" width=\"500\">\n",
"\n",
"Traversing the graph in topological order is important.\n",
"There are two possible topological orderings for this particular directed acyclic graph.\n",
Expand Down
2 changes: 1 addition & 1 deletion reinforcement_learning.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -128,7 +128,7 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"The `sequential_decision_environment` is a GridMDP object as shown below. The rewards are **+1** and **-1** in the terminal states, and **-0.04** in the rest. <img src=\"files/images/mdp.png\"> Now we define actions and a policy similar to **Fig 21.1** in the book."
"The `sequential_decision_environment` is a GridMDP object as shown below. The rewards are **+1** and **-1** in the terminal states, and **-0.04** in the rest. <img src=\"images/mdp.png\"> Now we define actions and a policy similar to **Fig 21.1** in the book."
]
},
{
Expand Down
Loading