Giter Site home page Giter Site logo

dqn_vs_human_gaze's Introduction

Using human gaze data for training a DQN to play Atari games

DRL Project SS22

image

This is our final project for the course 'Deep Reinforcement Learning' given in SS22 at Osnabrück University.

We used the Atari-HEAD dataset [1] to compare human attention maps and DQN saliency maps, and explore the possibility of adding human attention data to the training process. We have also experimented with making inputs to the DQN more similar to human perceptual input. Further information can be found in the project report.

Our project is divided into four parts:

  1. Training a baseline DQN to play Asterix (Atari):

    • Training scripts can be found in the folder asterix.
    • The weights of the best-performing model are also included.
  2. Creating saliency maps for the baseline DQN

    • The scripts for creating saliency maps and analysing them are in saliency.
  3. Giving gaze data to the DQN

  4. Making the learning more human like by foveating the input and focusing agent's attention on one region at a time.

In the root folder, there are the files for reading in and processing the gaze data.

  • my_reader_class.py is used to read in the data and save it in an instance of Reader to be able to access it easily. The object then also has methods to create fixation maps and gaze heatmaps to compare them with saliency maps.
  • For reading in gaze coordinates from the Atari-HEAD files, the file data_reader.py is used which was shared by the creators of the Atari-HEAD dataset and can be found in their GitHub repository.
  • The file create_heatmaps.py is used to create heatmaps for training the gaze prediction network, which is then used to augment the DQN for the third part of our project.

Lastly, the report and meeting summaries are in the report folder.

[1] Zhang, R., Walshe, C., Liu, Z., Guan, L., Muller, K. S., Whritner, J. A., Zhang, L., Hayhoe, M., & Ballard, D. (2019). Atari-HEAD: Atari Human Eye-Tracking and Demonstration Dataset (Version 4). Zenodo. https://doi.org/10.5281/zenodo.3451402

dqn_vs_human_gaze's People

Contributors

egrund avatar leonl42 avatar nataliasc avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.