pytorch-rl's Introduction

Reinforcement Learning Models in PyTorch

Description

This repo contans PyTorch implementations of reinforcement learning models for personal skill development.

Background

REINFORCE is a policy gradient method that calculates the policy gradient at the end of every episode and updates the agents parameters accordingly.

Deep Deterministic Policy Gradient (DDPG) is an off-policy, actor-critic policy gradient method. Similar to Deep Q-Learning, a target and current model are used for the actor (policy) and critic (value) functions, and the target model is gradually updated. DDPG also utilizes an experience replay buffer. Losses are computed from the temporal difference error signal.

Dependencies

Results

DDPG

Reward per episode on HalfCheetah-v1

Visualization of learned policy on HalfCheetah-v1

Useful References

pytorch-rl's People

Contributors

Stargazers

Watchers

pytorch-rl's Issues

How to record a full screen?

I can only record 500x500 video, only can see a few black and white grids.
Another question is how to automatically track agent when recording.

Thanks.

Recommend Projects

dxyang / pytorch-rl Goto Github PK

pytorch-rl's Introduction

Reinforcement Learning Models in PyTorch

Description

Background

Dependencies

Results

DDPG

Useful References

pytorch-rl's People

Contributors

Stargazers

Watchers

pytorch-rl's Issues

How to record a full screen?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent