Hello, I am trying to train a robot pick and place using HER + DDPG and BC loss.
I have modified her.config.py to use BC loss and Q-Filter. The number of demonstrations set is 50, as the ones I recorded.
fileName = "pick_rail_norandom"
fileName += ".npz"
np.savez_compressed(fileName, acs=actions, obs=observations, info=infos)
I know the file is ok because it is the same of another work that used baselines with no problems at all. I also checked that the agent performed well while recording, looking at the environment and trajectories recorded.
mpirun -n 4 python3 -m baselines.run --alg=her --env=dVRLPick-v0 --num_timesteps=1e6 --demo_file=/home/neri/Desktop/RL4dVRK/dVRL_simulator/record_demonstration_dVRL/pick_rail_norandom.npz --save_path=~/models/pickrail_norandom --log_path=~/logs/pickrail_norandom
Traceback (most recent call last):
File "/usr/lib/python3.6/runpy.py", line 193, in _run_module_as_main
"__main__", mod_spec)
File "/usr/lib/python3.6/runpy.py", line 85, in _run_code
exec(code, run_globals)
File "/home/neri/Desktop/Reach_Rail/baselines/baselines/run.py", line 255, in <module>
main(sys.argv)
File "/home/neri/Desktop/Reach_Rail/baselines/baselines/run.py", line 221, in main
model, env = train(args, extra_args)
File "/home/neri/Desktop/Reach_Rail/baselines/baselines/run.py", line 85, in train
**alg_kwargs
File "/home/neri/Desktop/Reach_Rail/baselines/baselines/her/her.py", line 177, in learn
policy_save_interval=policy_save_interval, demo_file=demo_file)
File "/home/neri/Desktop/Reach_Rail/baselines/baselines/her/her.py", line 35, in train
if policy.bc_loss == 1: policy.init_demo_buffer(demo_file) #initialize demo buffer if training with demonstrations
File "/home/neri/Desktop/Reach_Rail/baselines/baselines/her/ddpg.py", line 166, in init_demo_buffer
demo_data_obs = demoData['obs']
File "/home/neri/.local/lib/python3.6/site-packages/numpy/lib/npyio.py", line 255, in __getitem__
pickle_kwargs=self.pickle_kwargs)
File "/home/neri/.local/lib/python3.6/site-packages/numpy/lib/format.py", line 727, in read_array
raise ValueError("Object arrays cannot be loaded when "
ValueError: Object arrays cannot be loaded when allow_pickle=False
-------------------------------------------------------
Primary job terminated normally, but 1 process returned
a non-zero exit code.. Per user-direction, the job has been aborted.
-------------------------------------------------------
--------------------------------------------------------------------------
mpirun detected that one or more processes exited with non-zero status, thus causing
the job to be terminated. The first process to do so was:
Process name: [[59978,1],0]
Exit code: 1
--------------------------------------------------------------------------
Any advice would be highly appreciated.