Search code examples
reinforcement-learningvalue-iteration

How to Solve reinforcement learning Grid world examples using value iteration?


I find either theories or python example which is not satisfactory as a beginner. I just need to understand a simple example for understanding the step by step iterations. Could anyone please show me the 1st and 2nd iterations for the Image that I have uploaded for value iteration? Grid world problem


Solution

  • I recommend this PDF: http://www.cis.upenn.edu/~cis519/fall2015/lectures/14_ReinforcementLearning.pdf, which is very clear about the grid world problem. And there are codes on github:

    https://github.com/kevlar1818/grid-world-rl

    https://github.com/dennybritz/reinforcement-learning/blob/master/DP/Policy%20Evaluation%20Solution.ipynb

    Hope those help.