Pacman AI, Part III


This is the third part of the Pacman AI project. In this part of the project, I implemented value iteration agent, a Q-learning reinforcement learning agent, and an approximate Q agent.

Value Iteration Agent

The value iteration agent an offline planner. In the initial planning phase, we set the number of value iterations it should run. It takes an MDP on construction and runs value iteration for a specified number of iterations before the constructor returns.

Screen Shot 2017-10-23 at 7.33.54 PM.png

Q-learning Reinforcement Learning Agent

Screen Shot 2017-10-23 at 7.37.10 PM.png

Screen Shot 2017-10-23 at 7.38.28 PM.png

Approximate Q Agent

Screen Shot 2017-10-23 at 7.39.48 PM.png


This is the end of Pacman AI, Part III.

Readers of the post should not copy any of my code for their own course assignment, but feel free to be inspired and come up with your own ones.

For this project, we should all follow the same algorithms of value iteration, computing action based on value, computing Q value etc. The reinforcement learning agent is the most AI-like agent I have done so far in this project because we don’t interfere with how they make specific decisions.


Leave a Reply

Please log in using one of these methods to post your comment: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

Create a website or blog at

Up ↑

%d bloggers like this: