mauicv / evo-quad Goto Github PK
View Code? Open in Web Editor NEWREINFORCE-ES applied to quadruped environment
REINFORCE-ES applied to quadruped environment
Want the quadruped to be able to balance on multiple slopes and regain balance after being knocked.
Current the network recieves absolute x, y position data from the environment which we should replace with rate of change of x, y.
currently we have and Env
class and then a StandingEnv
and WalkingEnv
class that derive from it. I think this is correct, but should be reviewed. The behavour we want is that given some behavour we want the quadruped to learn it should implemented as a EnvBehavour class. It might be possible to ensure this is the only code that needs to be touched and the training functionality is common across all training tasks.
If we're training mutliple behavours how do we organise mutlple env settings and config?
Currently models output single stances but nothing like gates. I think part of this might be that the network weight initalizations mean that all the inputs are all mapped to the same output. A similar issue was present in the TF models for ddpg however which means it's something to do with how the environment differs from openai gym enviornments. Note that It's quite likely that openai gym environments scale state.
Once we've trained solutions to multple tasks how do we then combine those solutions for more general problems.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.