Giter Site home page Giter Site logo

60_days_rl_challenge's Introduction


I designed this Challenge for you and me: Learn Deep Reinforcement Learning in depth in 60 days!!

You heard about the amazing results achieved by Deepmind with AlphaGo Zero and by OpenAI in Dota 2! Don't you want to know how they work? This is the right opportunity for you and me to finally learn Deep RL and use it on new exciting projects.

The ultimate aim is to use these general-purpose technologies and apply them to all sorts of important real world problems. Demis Hassabis


This repository wants to guide you through the Deep Reinforcement Learning algorithms, from the most basic ones to the highly advanced AlphaGo Zero. You will find the main topics organized by week and the resources suggested to learn them. Also, every week I will provide practical examples implemented in python to help you better digest the theory. You are highly encouraged to modify and play with them!


This is my first project of this kind, so please, if you have any idea, suggestion or improvement contact me at [email protected].

During the whole challenge, I will update continuously this repository.. so stay tuned! #60DaysRLChallenge

Follow me Twitter Follow GitHub followers

Projects (Yet to decide)

  • Q-learning
  • DQN
  • AC2
  • ES
  • AlphaGo Zero

Week 1 - Introduction

Week 2 - RL Basics: MDP, Dynamic Programming and Model-Free Control

Those who cannot rember the part are condomned to repeat it - George Santayana

This week, we will learn about the basic blocks of reinforcement learning, starting from the definition of the problem all the way through the estimation and optimization of the functions that are used to express the quality of a policy or state.


Theoretical material

  • Markov Decision Process - RL by David Silver

    Formalizing RL problem using MDP

    • Markov Processes
    • Markov Decision Processes
  • Planning by Dynamic Programming - RL by David Silver

    How to solve known MDP

    • Policy iteration
    • Value iteration
  • Model-Free Prediction - RL by David Silver

    Estimate the value function of unknown MDP

    • Monte Carlo Learning
    • Temporal Difference Learning
    • TD(ฮป)
  • Model-Free Control - RL by David Silver

    Optimise the value function of an unknown MDP

    • ฦ-greedy policy iteration
    • GLIE Monte Carlo Search
    • SARSA
    • Importance Sampling

Project of the Week

Q-learning applied to FrozenLake. For exercise, you can solve the game using SARSA or implement Q-learning by yourself. In the former case, only few changes are needed.


To know more

Week 3 - Value Function Approximation and DQN

Week 4 - A2C and A3C

Week 5 - RL in continous space - TRPO/PPO

Week 6 - Evolution Strategies and Genetic Algorithms

Week 7 - I2A

Week 8 - AlphaGoZero + Bonus

Last 4 days - Review + sharing

Best RL papers

Best resources

๐Ÿ“บ Deep Reinforcement Learning - UC Berkeley class by Levine, check here their site.

๐Ÿ“บ Reinforcement Learning course - by David Silver, DeepMind. Great introductory lectures by Silver, a lead researcher on AlphaGo. They follow the book Reinforcement Learning by Sutton & Barto.

๐Ÿ““ Reinforcement Learning: An Introduction - by Sutton & Barto. The "Bible" of reinforcement learning. Here you can find the PDF draft of the second version.

Additional resources

๐Ÿ“š Awesome Reinforcement Learning. A curated list of resources dedicated to reinforcement learning

60_days_rl_challenge's People

Contributors

andri27-ts avatar cjauvin avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.