sdpkjc / abcdrl Goto Github PK
View Code? Open in Web Editor NEWModular Single-file Reinfocement Learning Algorithms Library
Home Page: http://docs.abcdrl.xyz
License: Other
Modular Single-file Reinfocement Learning Algorithms Library
Home Page: http://docs.abcdrl.xyz
License: Other
ppo_atari
sac_atari
redq
drqv2_dmc
wandb/wandb#5008 completes support for gymnasium env monitoring.
Update wandb version to v0.13.11, and modify our custom logger to use the monitor provided by wandb directly. This will simplify the code and provide its robustness.
This adjustment might fix the #70 bug.
No response
No response
dqn_atari_torch
and dqn_atari_tf
, the optimize_memory_usage
option is set to false by default. It should be set to true.No response
No response
if self.size != self.buffer_size:
idxs = np.random.choice(self.size, size=batch_size, replace=True)
➡️
if self.size != self.buffer_size:
idxs = np.random.choice(self.size - 1, size=batch_size, replace=True)
With that change, we might be able to fix the problem and wait for testing.
Deprecated python-fire and use tyro.
Makes arguments management clearer.
...
No response
No response
Good PR ➡️ brentyi/tyro#50
We need to follow up on this and unify the way the bool flag is set.
benchmark.py
No response
No response
Left video on gymnasium0.27.0, Right video on gymnasium0.26.3
python abcdrl/ddpg_torch.py --track True --capture-video True
No response
No response
No response
The training stops while capturing videos and the experiment gets stuck.
This is because the video_recorder of Gymnasium is not close. Maybe this is a bug in the gym, so we can consider closing it manually for now.
Exec this cmd:
python abcdrl/dqn.py --num-envs 2 --learning-starts 8 --buffer-size 10 --batch-size 4 --total-timesteps 30 --capture-video True
output:
{'log_type': 'collect', 'sample_step': 24, 'logs': {'episodic_length': 12, 'episodic_return': 12.0}}
Moviepy - Building video /Users/zhaoyanxiao/Dev/abcdrl/videos/CartPole-v1__dqn/rl-video-episode-0.mp4.
Moviepy - Writing video /Users/zhaoyanxiao/Dev/abcdrl/videos/CartPole-v1__dqn/rl-video-episode-0.mp4
Moviepy - Done !
Moviepy - video ready /Users/zhaoyanxiao/Dev/abcdrl/videos/CartPole-v1__dqn/rl-video-episode-0.mp4
{'log_type': 'collect', 'sample_step': 26, 'logs': {'episodic_length': 13, 'episodic_return': 13.0}}
# Didn't stop
No response
No response
No response
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.