Zhiyuan Nan's Projects
This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.
Reinforcement Learning codebase for self-driving car in Carla
Code With Deep Reinforcement Learning
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
OpenDILab Auto-driving platform
DMControl Generalization Benchmark
High-speed Autonomous Drifting with Deep Reinforcement Learning
Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.
A PyTorch implementation of FQF, IQN and QR-DQN.
A PyTorch implementation of GAIL and AIRL based on PPO.
A goal-driven autonomous exploration through deep reinforcement learning (ICRA 2022) system that combines reactive and planned robot navigation in unknown environments
this repository accompanies my forthcoming book "Grokking Deep Learning"
An OpenAI gym wrapper for CARLA simulator
A minimalist environment for decision-making in autonomous driving
Integrating Deep Reinforcement Learning with Path planning for Automated Driving
Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation
Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)
Library for Model Based RL
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
[ICCV 2019] Monocular depth estimation from a single image
This is the official implementation of Multi-Agent PPO (MAPPO).
References on Optimal Control, Reinforcement Learning and Motion Planning
Repository for the paper "Planning to Explore via Self-Supervised World Models"
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..