Implementation of Deep Mind's Paper titled: "Human Level Control through Deep Reinforcement Learning". The policy gradient algorithm was also implemented and tested on Cart Pole just for experiments.
- Python 3.6.1
- Tensorflow
- Gym
- numpy
- sklearn
- matplotlib
- For DQN python main.py When prompted to select train, play or visualize type 'train'
- For Policy Gradients python policy_gradients.py When prompted to select train or play type 'train'
- For DQN python main.py When prompted to select train, play or visualize type 'play'
- For Policy Gradients python policy_gradients.py When prompted to select train or play type 'play'
Game | Highest Score |
---|---|
Breakout | 45 |
Cart Pole | 200 |