Skip to content

Commit

Permalink
resize image size
Browse files Browse the repository at this point in the history
  • Loading branch information
qiwang067 committed Nov 11, 2020
1 parent c7ebeb5 commit 9bebd20
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/chapter2/chapter2.md
Original file line number Diff line number Diff line change
Expand Up @@ -792,7 +792,7 @@ $$
* 首先来看 policy iteration。之前的例子在每个状态都是采取固定的随机策略,就每个状态都是 0.25 的概率往上往下往左往右,没有策略的改变。
* 但是我们现在想做 policy iteration,就是每个状态的策略都进行改变。Policy iteration 的过程是一个迭代过程。

![](img/2.55.png ':size=450')
![](img/2.55.png)

我们先在这个状态里面 run 一遍 policy evaluation,就得到了一个 value function,每个状态都有一个 value function。

Expand Down

0 comments on commit 9bebd20

Please sign in to comment.