Comments (2)
I noticed this as well and believe it's a significant cause of performance degradation. Additionally, you don't seem to be adding the entropy term to the objective which they mention in the paper as being useful for improving exploration.
from async-rl.
Oh interesting! I will definitely take a look at this. Thank you.
from async-rl.
Related Issues (20)
- clipping
- How to speed up training with GPU?
- ValueError: Filter must not be larger than the input: Filter: (8, 8) Input: (4, 84) HOT 3
- Intel MKL FATAL ERROR: Cannot load libmkl_avx2.so or libmkl_def.so. HOT 3
- ValueError: need more than 4 values to unpack HOT 2
- About the randomness of the performance
- FailedPreconditionError HOT 4
- null
- May I know the version of keras and tensorflow? HOT 1
- Reward doesn't go up ....
- No local network synchronization
- Tensorflow outdated HOT 1
- Attempting to use uninitialized value conv2d_1/kernel HOT 9
- RGB image HOT 4
- duplicate
- about epsilon
- OSError,why?
- t_max = 32 HOT 6
- Do results differ only because of the seed?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from async-rl.