huangwl18 / modular-rl Goto Github PK

[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"

Home Page: https://huangwl18.github.io/modular-rl/

License: Other

Python 13.64% Jupyter Notebook 86.36%

deep-learning reinforcement-learning modularity graph-neural-networks locomotion generalization modular-control decentralized-control message-passing emergent-communication

modular-rl's People

Contributors

Stargazers

Watchers

modular-rl's Issues

Difference with and without torchfold? (Question)

What is the advantage of using torchfold for trainning?

The program can't use SubprocVecEnv

SubprocVecEnv uses multiprocessing which means utils.makeEnvWrapper will fail in

def helper():
e = gym.make("%s-v0" % env_name)
e.seed(seed)
return wrappers.ModularEnvWrapper(e, obs_max_len)

gym.make can't find the registered env. Because registry in gym.envs.registration will initialize again in subprocess.

Have you tried this method to train the standard 3d humanoid?

How many steps does it take to train a 3D model in this way? Your idea is very innovative, and I want to use this to improve my policy.

List of training/test environments

Hi @huangwl18, @pathak22! Thanks for the code release.

The paper does not specify which environments were used for training and for zero-shot evaluation. For instance, Humanoid++ has 8 environments out of which the two were used for zero-shot evaluation. Can you tell me which?

Can you, please, provide the full list for all of the environments used in the paper?

Incorrect assignment of limb_type_vec in environment modules

Hello Wenlong,
Thanks for sharing your interesting work!

I have run some experiments with your code, but I think there are some typo in _get_obs_per_limb() in environment .py files.
Specifically, for humanoid++, the limb_type_vec is assigned incorrectly, i.e. every limbs are assigned to (0, 0, 0, 0).

In humanoid xmls, the name of limb body belongs to {'torso', '(left/right) shoulder', '(left/right) thigh', '(left/right) shin', '(left/right) upper arm', '(left/right) lower arm'}, but the limb type assignment condition compares the name with {'hip', 'knee', 'shoulder', 'elbow'} which are the names of motor (joint) in your code.

I'm wondering if this is intended and would like to hear from you if this makes some difference in model performance.

Thank you

code crashed after reaching maxstep=20k

Very interesting work! Thanks for sharing the code.

I run into an issue when setting the max_timesteps=20000,

To reproduce it:

python main.py --expID 002 --td --bu --morphologies walker_7_main --max_timesteps 20000

It looks the training is finished, but an error was produced at the end:

ExpID: 2, FPS: 5.03, TotalT: 19902, EpisodeNum: 157, SampleNum: 20059, ReplayBSize: 20059
walker_7_main === EpisodeT: 98, Reward: 232.93
*** training finished and model saved to ./results/EXP_0002/model.pyth ***
Process Process-1:
Traceback (most recent call last):
  File "/opt/anaconda3/envs/modular-rl/lib/python3.6/multiprocessing/process.py", line 258, in _bootstrap
    self.run()
  File "/opt/anaconda3/envs/modular-rl/lib/python3.6/multiprocessing/process.py", line 93, in run
    self._target(*self._args, **self._kwargs)
  File "/opt/anaconda3/envs/modular-rl/lib/python3.6/site-packages/baselines/common/vec_env/subproc_vec_env.py", line 10, in worker
    cmd, data = remote.recv()
  File "/opt/anaconda3/envs/modular-rl/lib/python3.6/multiprocessing/connection.py", line 250, in recv
    buf = self._recv_bytes()
  File "/opt/anaconda3/envs/modular-rl/lib/python3.6/multiprocessing/connection.py", line 407, in _recv_bytes
    buf = self._recv(4)
  File "/opt/anaconda3/envs/modular-rl/lib/python3.6/multiprocessing/connection.py", line 383, in _recv
    raise EOFError
EOFError

huangwl18 / modular-rl Goto Github PK

modular-rl's People

Contributors

Stargazers

Watchers

Forkers

modular-rl's Issues

Difference with and without torchfold? (Question)

The program can't use SubprocVecEnv

Have you tried this method to train the standard 3d humanoid?

List of training/test environments

Incorrect assignment of limb_type_vec in environment modules

code crashed after reaching maxstep=20k

Not able to run

Multi-CPU parallel training

Why xpos[0] -= torso_x_pos

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent