Giter Site home page Giter Site logo

rasoolfa / explorationconsciousrl Goto Github PK

View Code? Open in Web Editor NEW

This project forked from shanlior/explorationconsciousrl

0.0 2.0 0.0 5.29 MB

An Alpha-DDQN and Sigma-DDPG implementation based on OpenAI Baselines

License: MIT License

Python 99.99% Shell 0.01%

explorationconsciousrl's Introduction

This is a partial baselines package, containing an added implementation of the Expected and Surrogate Sigma-DDPG Algorithms, and the Expected and Surrogate alpha-DQN algorithms, presented in the paper:

"Exploration Conscious Reinforcement Learning Revisited", Shani, L., Efroni, Y., & Mannor, S. (2018). https://arxiv.org/abs/1812.05551

Running the Algorithms

alpha-DQN

Running the alpha-DQN for Atari is made by running the following line:

python baselines/deepq/experiment/run_atari.py

  • The alpha flag turns on the alpha-DQN.
  • The surrogate flag turns on the surrogate version.
  • The expected flag turns on the expected version.

sigma-DDPG

Running the sigma-DDPG for Mujoco is made by running the following line:

python baselines/ddpg/main.py

  • The sigma flag turns on sigma-DDPG.

  • The surrogate flag turns on the surrogate version.

  • The expected flag turns on the expected version.

    For the expected version:

    • The sigma_num_samples and grad_num_samples flags determine the number of samples used to approximate the expected target and the expected gradient, respectively.

Installation

For installation, first follow the normal baselines installation.

Notice that this is an old baselines implementation, tested with tensorflow-gpu-1.6.0 and CUDA 9.0.

Baselines

OpenAI Baselines is a set of high-quality implementations of reinforcement learning algorithms.

These algorithms will make it easier for the research community to replicate, refine, and identify new ideas, and will create good baselines to build research on top of. Our DQN implementation and its variants are roughly on par with the scores in published papers. We expect they will be used as a base around which new ideas can be added, and as a tool for comparing a new approach against existing ones.

You can install it by typing:

git clone https://github.com/shanlior/ExplorationConsciousRL.git
cd baselines
pip install -e .

To cite this repository in publications:

@misc{baselines,
  author = {Dhariwal, Prafulla and Hesse, Christopher and Klimov, Oleg and Nichol, Alex and Plappert, Matthias and Radford, Alec and Schulman, John and Sidor, Szymon and Wu, Yuhuai},
  title = {OpenAI Baselines},
  year = {2017},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/openai/baselines}},
}

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.