policy_improvement() should be renamed to policy_iteration() #202

link2xt · 2019-06-23T20:38:27Z

In the DP directory there is a Policy Iteration.ipynb. It contains function policy_improvement() which returns optimal policy and its value function. In the book this algorithm is called "Policy Iteration" (see p.80), while policy improvement is just a 3rd step inside of it.

The text was updated successfully, but these errors were encountered:

This was referenced Jun 23, 2019

Provided policy_improvement() solution is not guaranteed to terminate #203

Open

Provided policy_improvement() solution initializes values to zero for each iteration #204

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

policy_improvement() should be renamed to policy_iteration() #202

policy_improvement() should be renamed to policy_iteration() #202

link2xt commented Jun 23, 2019

policy_improvement() should be renamed to policy_iteration() #202

policy_improvement() should be renamed to policy_iteration() #202

Comments

link2xt commented Jun 23, 2019