Giter Site home page Giter Site logo

haiyinpiao's Projects

aca_arena icon aca_arena

A hyper-real time flight simulation project.

airgym icon airgym

AirSim with openAI gym and keras-rl integration for autonomous copter RL

dqn-atari icon dqn-atari

add cnn as front-end to accomplish end to end DQN

dqn-flappybird icon dqn-flappybird

A flappybird game solver with nips 2013 classical DQN algorithm.

fly-trajectory-mining icon fly-trajectory-mining

A fighter fly out trajectory time series data mining demo, I use agnes and k-means to clustering the flyout data samples into left, straight and right categories. This app includes a fly out trajectory generation program which was developed by matlab and simulink

keras-ddpg icon keras-ddpg

A pure keras implemented DDPG without uing any specific backend(TF,theano) API, originally branched from yanpanlu's TORCS car racing project.

keras-multiprocessing-ga3c icon keras-multiprocessing-ga3c

A multiprocessing GA3C modification branched and modified from jaara's a3c demo( https://github.com/jaara/AI-blog/ )

keras-policy-gradient icon keras-policy-gradient

Stochastic Policy Gradient algorithm branched from keon's project, fixed softmax, one-hot coding, and CE loss issues.

ma-gym icon ma-gym

A collection of multi agent environments based on OpenAI gym.

maa2c icon maa2c

MAA2C for solving openai's water world

marl-zoo icon marl-zoo

A multi-agent reinforcement learning sdk, inhereted and significant modified from https://github.com/Khrylx/PyTorch-RL

pytorch-a2c-ppo icon pytorch-a2c-ppo

bateched A2C and PPO algorithm inherited from https://github.com/Khrylx/PyTorch-RL

pytorch-a3c icon pytorch-a3c

A3C algorithm originally inherited from https://github.com/MorvanZhou/pytorch-A3C, fixed devergence problem, add training feature data plot mechanism.

pytorch-dppo icon pytorch-dppo

my dppo, originally inhereted from https://github.com/alexis-jacq/Pytorch-DPPO

pytorch-rl icon pytorch-rl

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

transformer-encoder icon transformer-encoder

A ready to use transformer-encoder for MARL inherited from https://github.com/jadore801120/attention-is-all-you-need-pytorch, I made some important improvement for MARL usage.

uav-gym icon uav-gym

A uav swarm mission planning algorithms test bed.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.