Giter Site home page Giter Site logo

adaptive_stopping_mc_rl's Introduction

This repository can be used to reproduce all the experiments in the paper AdaStop: sequential testing for efficient and reliable comparisons of Deep RL Agents.

Structure of the repository

  • Folder toy_examples_xp contains code for Section 6.1 "Toy examples"
  • Folder Non_adaptive_comparison_xp contains code for Section 6.2 "Comparison with non-adaptive approach"
  • Folder Deep_RL_xp contains code for Section 6.3 "AdaStop for Deep Reinforcement Learning"

We provide a requirements.txt file which contains the dependencies and their versions at the time of the article.

What is reproducible?

We aim at having a completely reproducible paper. All the scripts for all the experiments are available and using the makefile all the figures and results of the paper can be reproduced. One can use make all (on linux) to do the figures from the article with a reduced size of the experiments so that it runs in a reasonable amount of time, using a virtual environment that is constructed in tmp folder. FOr more courageous people, you can use make ARG=--full-xp all which does the exact same figures as the ones in the article.

Remark: due to its complexity, the deep rl experiments from Section 6.3. are not automatically recomputed, we only recompute the adastop algorithm and not the actual training of the agents.

Using Guix

For (almost) perfect reproducibility, you can use guix with the following commands to fix the versions of the shared libraries (e.g. C libraries)

guix time-machine --channels=channels.scm --  shell -CF -m manifest.scm
export LD_LIBRARY_PATH=/lib
make all 

adaptive_stopping_mc_rl's People

Contributors

timotheemathieu avatar aleshi94 avatar kohlerhector avatar mmcenta avatar

Stargazers

 avatar

Watchers

Odalric-Ambrym Maillard avatar Riccardo Della Vecchia avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.