antymon / ppo_cpp Goto Github PK
View Code? Open in Web Editor NEWC++ version of a Deep Reinforcement Learning algorithm PPO
C++ version of a Deep Reinforcement Learning algorithm PPO
Hi Antymon:
Your PPO code is interesting. I am wondering whether it can be used for parallel training, where each episode is a mpi/openmp solver. I would like to perform simulations on a cluster with each episode as a single node openmp based simulation. do you have any suggestion?
Yours Sincerely
Wei
Some cherry-picking needs to be done.
Hi Antymon,
Thanks very much for the repo so that people can export the trained agent from python to c++.
I would like to confirm the changes I have to make in order to run my own task:
Make my environment inherit from Env abstract class under env\env.hpp
Modify the main ppo2.cpp which creates instance of an environment and passes it to PPO
*I have two questions:
a. if I would like to run inference only, should I use algorithm.eval(obs) directly? As it seems it uses get_deterministic_action() and I noticed there are also step(const tensorflow::Tensor& obs) and value(const tensorflow::Tensor& obs) which I can't tell exactlly what the differences are between them.
b. It seems a way to resume training using your implementation so that online learning can be achieved?
Thank you again!
Due to main dependency of the Singularity file, it is impossible to build a project simg file since 29th Nov. I am working to resolve this problem. If the previous parent image file was cached, it would be possible to build against this file as a workaround.
Dear Antymon,
I am at the stage of trying to compile my_own_ppo2.cpp with my_own_env.hpp inherited from the env.hpp. I am wondering if there is an option of not using the singularity img, eg MAKEFILE style compilation? I compiled the tensorflow from sourse using bazel and downloaded the Eigen/Dense ,but there are still some denpendency issues while I was comiling.
I appreciate any helps on this.
Thank you very much,
Wei
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.