delmaksym / deep-reinforcement-learning-solutions Goto Github PK

View Code? Open in Web Editor NEW

This project forked from udacity/deep-reinforcement-learning

0.0 0.0 0.0 6.07 MB

My solutions to the Deep Reinforcement Learning Nanodegree program

Home Page: https://www.udacity.com/course/deep-reinforcement-learning-nanodegree--nd893

License: MIT License

TeX 0.47% Jupyter Notebook 90.89% Python 8.65%

deep-reinforcement-learning-solutions's Introduction

My Solutions to the Deep Reinforcement Learning Nanodegree

This repository contains my solutions to the Labs / Projects of Udacity's Deep Reinforcement Learning Nanodegree program in addition to the default materials provided here.

I also updated the code to work with newest versions of the environmetns and OpenAI gym. I use python 3.10+ for the solutions!

Labs / Projects

My solutions to the labs and projects can be found below. All of the projects use rich simulation environments from Unity ML-Agents. In the Deep Reinforcement Learning Nanodegree program, I received reviews of my projects.

The Taxi Problem: In this lab, you will train a taxi to pick up and drop off passengers.
Navigation: In the first project, you will train an agent to collect yellow bananas while avoiding blue bananas.
Continuous Control: In the second project, you will train an robotic arm to reach target locations.
Collaboration and Competition: In the third project, you will train a pair of agents to play tennis!

Tutorials

The tutorials led me through implementing various algorithms in reinforcement learning. All of the code is in PyTorch (v0.4) and Python 3.

Resources

RL Cheatsheet: The PDF file contains key definitions, formulas and pseudocodes.

OpenAI Gym Benchmarks

Classic Control

Acrobot-v1 with Tile Coding and Q-Learning
Cartpole-v0 with Hill Climbing | solved in 13 episodes
Cartpole-v0 with REINFORCE | solved in 691 episodes
MountainCarContinuous-v0 with Cross-Entropy Method | solved in 47 iterations
MountainCar-v0 with Uniform-Grid Discretization and Q-Learning | solved in <50000 episodes
Pendulum-v0 with Deep Deterministic Policy Gradients (DDPG)

Box2d

BipedalWalker-v2 with Deep Deterministic Policy Gradients (DDPG)
CarRacing-v0 with Deep Q-Networks (DQN) | Coming soon!
LunarLander-v2 with Deep Q-Networks (DQN) | solved in 1504 episodes

Toy Text

FrozenLake-v0 with Dynamic Programming
Blackjack-v0 with Monte Carlo Methods
CliffWalking-v0 with Temporal-Difference Methods

Dependencies

To set up your python environment to run the code in this repository, follow the instructions below.

Create (and activate) a new environment with Python 3.6.

Linux or Mac:

conda create --name drlnd python=3.6
source activate drlnd

Windows:

conda create --name drlnd python=3.6 
activate drlnd

If running in Windows, ensure you have the "Build Tools for Visual Studio 2019" installed from this site. This article may also be very helpful. This was confirmed to work in Windows 10 Home.
Follow the instructions in this repository to perform a minimal install of OpenAI gym.
- Next, install the classic control environment group by following the instructions here.
- Then, install the box2d environment group by following the instructions here.

Clone the repository (if you haven't already!), and navigate to the python/ folder. Then, install several dependencies.

git clone https://github.com/deldelmax/deep-reinforcement-learning-solutions.git
cd deep-reinforcement-learning/python
pip install .
pip install gym[all]

Create an IPython kernel for the drlnd environment.

python -m ipykernel install --user --name drlnd --display-name "drlnd"

Before running code in a notebook, change the kernel to match the drlnd environment by using the drop-down Kernel menu.

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.

Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

TensorFlow

An Open Source Machine Learning Framework for Everyone

Django

The Web framework for perfectionists with deadlines.

Laravel

A PHP framework for web artisans

D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

web

Some thing interesting about web. New door for the world.

server

A server is a program made to process requests and deliver data to clients.

Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

Visualization

Some thing interesting about visualization, use data art

Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.

Microsoft

Open source projects and samples from Microsoft.

Google

Google ❤️ Open Source for everyone.

Alibaba

Alibaba Open Source for everyone

D3

Data-Driven Documents codes.

Tencent

China tencent open source team.