A fighter fly out trajectory time series data mining demo, I use agnes and k-means to clustering the flyout data samples into left, straight and right categories. This app includes a fly out trajectory generation program which was developed by matlab and simulink

keras-ddpg

A pure keras implemented DDPG without uing any specific backend(TF,theano) API, originally branched from yanpanlu's TORCS car racing project.

keras-multiprocessing-ga3c

A multiprocessing GA3C modification branched and modified from jaara's a3c demo( https://github.com/jaara/AI-blog/ )

keras-policy-gradient

Stochastic Policy Gradient algorithm branched from keon's project, fixed softmax, one-hot coding, and CE loss issues.

learn2stop

adaptive action stop learning

ma-gym

A collection of multi agent environments based on OpenAI gym.

maa2c

MAA2C for solving openai's water world

maca

marl-zoo

A multi-agent reinforcement learning sdk, inhereted and significant modified from https://github.com/Khrylx/PyTorch-RL

minimaxq-learning

Applying minimaxQ learning algorithm to 2 agents games

monte-carlo-tree-search

Monte carlo tree search in python

pytorch-a2c-ppo

bateched A2C and PPO algorithm inherited from https://github.com/Khrylx/PyTorch-RL

pytorch-a2clstm-drqn

using recurrent networks(LSTM) to solve POMDPs

pytorch-a3c

A3C algorithm originally inherited from https://github.com/MorvanZhou/pytorch-A3C, fixed devergence problem, add training feature data plot mechanism.

pytorch-cluster

A simple pytorch cluster

pytorch-dppo

my dppo, originally inhereted from https://github.com/alexis-jacq/Pytorch-DPPO

pytorch-rl

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

transformer-encoder

A ready to use transformer-encoder for MARL inherited from https://github.com/jadore801120/attention-is-all-you-need-pytorch, I made some important improvement for MARL usage.

uav-gym

A uav swarm mission planning algorithms test bed.

who_is_mr_right_network_for_marl

A bunch of trial results on different MARL algorithms and network sturctures.

haiyinpiao Goto Github PK

haiyinpiao's Projects

Recommend Projects

Recommend Topics

Recommend Org