This is a PyTorch implementation of Exploration by Random Network Distillation paper.
CUDA_VISIBLE_DEVICES=0
python train.py
python enjoy.py
tensorboard --logdir runs
Open in browser http://localhost:6006
Random Network Distillation(RND) algo in Pytorch
License: MIT License
This is a PyTorch implementation of Exploration by Random Network Distillation paper.
CUDA_VISIBLE_DEVICES=0
python train.py
python enjoy.py
tensorboard --logdir runs
Open in browser http://localhost:6006
Hello,
This is really nice pytorch edition. I want to do some research based on ur content. But when I run train.py, I met this problem. Do u have any ideas? Thank you so much!
GPU training: False
Sticky actions: True
WARNING:root:This caffe2 python run does not have GPU support. Will run in CPU only mode.
ERROR: Couldn't open /anaconda3/envs/RND/lib/python3.6/site-packages/atari_py/atari_roms/montezuma_revenge.bin ...
in train.py file, you give epoch 3:
def train_model(args, device, output_size, model, rnd, optimizer, s_batch, target_ext_batch, target_int_batch, y_batch, adv_batch, next_obs_batch, old_action_probs):
epoch = 3
but in argument.py:
parser.add_argument('--epoch', type=int, default=4,
help='number of epochs (default: 4)')
so, which parameter do we use?
Thank you!
Hi, I'm getting a frustrating error on windows:
File "train.py", line 226, in main
for parent_conn, action in zip(parent_conns, actions):
TypeError: zip argument #2 must support iteration
Num Workers is set to 1, everythin else is unchanged.
Hello wisdom,
Thank you for your pretty neat code. We learnt a lot from it. One question we are not sure how long will model get out of room 7? We stayed at room 7 for about 1 days, current global step is 6M, rewards still keep at 2500. We found from 400 points to 2500 points is pretty fast. But at 2500, it did not make any progress.
You should change requirements and readme to show tensorboardX==1.6 is required since 1.7 seems to be bugged (read another issue on different repo saying it's broken) and train.py will not run with it
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.