Giter Site home page Giter Site logo

pouncealot / reinforcement-learning Goto Github PK

View Code? Open in Web Editor NEW

This project forked from dennybritz/reinforcement-learning

0.0 2.0 0.0 5.11 MB

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Home Page: http://www.wildml.com/2016/10/learning-reinforcement-learning/

License: MIT License

Jupyter Notebook 97.04% Python 2.96%

reinforcement-learning's Introduction

Overview

This repository provides code, exercises and solutions for popular Reinforcement Learning algorithms. These are meant to serve as a learning tool to complement the theoretical materials from

Each folder in corresponds to one or more chapters of the above textbook and/or course. In addition to exercises and solution, each folder also contains a list of learning goals, a brief concept summary, and links to the relevant readings.

All code is written in Python 3 and uses RL environments from OpenAI Gym. Advanced techniques use Tensorflow for neural network implementations.

Table of Contents

List of Implemented Algorithms

  • [Dynamic Programming Policy Evaluation](DP/Policy Evaluation Solution.ipynb)
  • [Dynamic Programming Policy Iteration](DP/Policy Iteration Solution.ipynb)
  • [Dynamic Programming Value Iteration](DP/Value Iteration Solution.ipynb)
  • [Monte Carlo Prediction](MC/MC Prediction Solution.ipynb)
  • [Monte Carlo Control with Epsilon-Greedy Policies](MC/MC Control with Epsilon-Greedy Policies Solution.ipynb)
  • [Monte Carlo Off-Policy Control with Importance Sampling](MC/Off-Policy MC Control with Weighted Importance Sampling Solution.ipynb)
  • [SARSA (On Policy TD Learning)](TD/SARSA Solution.ipynb)
  • [Q-Learning (Off Policy TD Learning)](TD/Q-Learning Solution.ipynb)
  • [Q-Learning with Linear Function Approximation](FA/Q-Learning with Value Function Approximation Solution.ipynb)
  • [Deep Q-Learning for Atari Games](DQN/Deep Q Learning Solution.ipynb)
  • [Double Deep-Q Learning for Atari Games](DQN/Double DQN Solution.ipynb)
  • Deep Q-Learning with Prioritized Experience Replay (WIP)
  • [Policy Gradient: REINFORCE with Baseline](PolicyGradient/CliffWalk REINFORCE with Baseline Solution.ipynb)
  • [Policy Gradient: Actor Critic with Baseline](PolicyGradient/CliffWalk Actor Critic Solution.ipynb)
  • [Policy Gradient: Actor Critic with Baseline for Continuous Action Spaces](PolicyGradient/Continuous MountainCar Actor Critic Solution.ipynb)
  • Deterministic Policy Gradients for Continuous Action Spaces (WIP)
  • Deep Deterministic Policy Gradients (DDPG) (WIP)
  • Asynchronous Advantage Actor Critic (A3C)

Resources

Textbooks:

Classes:

Talks/Tutorials:

Other Projects:

Selected Papers:

reinforcement-learning's People

Contributors

dennybritz avatar rockingdingo avatar praveen-palanisamy avatar j-min avatar shadowen avatar himanshusahni avatar zuzoovn avatar yenchenlin avatar rianrajagede avatar bruinbear avatar andytwigg avatar ei-grad avatar ajschumacher avatar bfontaine avatar fengyuleidian0615 avatar kismuz avatar ronaldseoh avatar shivamvats avatar tigerneil avatar absolutelynowarranty avatar alvarosg avatar david1309 avatar sstarzycki avatar

Watchers

James Cloos avatar JS avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.