dqn-on-space-invaders

Overview

This is a PyTorch implementation of a Deep Q-Network agent trained to play the Atari 2600 game of Space Invaders. The related paper is the following: Playing Atari with Deep Reinforcement Learning, published in 2014 by Google Deepmind.

This repository also corresponds to the source code for this post I have written on the subject.

Dependencies

Install the requirements using this command:

pip install -r requirements.txt

There is one more thing to install to have access to the Atari environment. In fact, OpenAI gym library does not support by default the Atari environment.

Linux users

Simply run the following command:

pip install atari-py

Windows users

Start by running the same command as Linux users, if you have some errors popping up then detailed instructions to install Atari environments in Windows platforms are given here.

Usage

Once dependencies are installed, you can open main.py and decide whether you want to train or test the agent. This can be done by setting the TRAIN variable to either Trueor False. Other hyper-parameters are to be specified in the same file.

If trained, the agent's weights are saved in ./train. Otherwise, videos of the agent playing are stored in ./test/.

Results

Below are the curves of the scores obtained throughout the training phase by the DQN agent as well as a random agent used as a baseline:

The DQN agent has played 100 episodes, 10000 timesteps each, and it has been able to improve its decision-making process as the training progresses. In fact, it starts by randomly selecting actions, waiting for the replay buffer to be sufficiently full to start the training. After several episodes of playing, the agent starts showing learning improvements and rather satisfactory results by the end of the training. This is due to the fact that its policy becomes progressively less random, as the update rule encourages it to exploit actions with higher rewards.

Here is a game where the agent is playing after being trained:

It has done a pretty good job overall. Nevertheless, it has to be trained more and perhaps get its policy network tuned so that it can get a higher score.

TODO

Add the possibility of hyper-parameters tuning.
TensorBoard support.
Add a run manager.

Resources

Playing Atari with Deep Reinforcement Learning

Frame Skipping and Pre-Processing for Deep Q-Networks on Atari 2600 Games

How RL agents learn to play Atari games

Human-level control through deep reinforcement learning

qarchli / dqn-on-space-invaders Goto Github PK