You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the DP directory there is a Policy Iteration.ipynb. It contains function policy_improvement() which returns optimal policy and its value function. In the book this algorithm is called "Policy Iteration" (see p.80), while policy improvement is just a 3rd step inside of it.
The text was updated successfully, but these errors were encountered:
In the
DP
directory there is aPolicy Iteration.ipynb
. It contains functionpolicy_improvement()
which returns optimal policy and its value function. In the book this algorithm is called "Policy Iteration" (see p.80), while policy improvement is just a 3rd step inside of it.The text was updated successfully, but these errors were encountered: