Giter Site home page Giter Site logo

Comments (5)

archcra avatar archcra commented on August 28, 2024

I encountered this error only when set MCTS_SIMS = 1000. Does that mean, when in such case, I should change maxQU's initial value more smaller, such as -999999 ( one more 9)?

from deepreinforcementlearning.

LeeTaeHoon97 avatar LeeTaeHoon97 commented on August 28, 2024

hi archcra, have you checked your vh_loss or ph_loss?
in my case,
when i face this error, my vh_loss and ph_loss values show NaN,
so, I changed Nan values to 0

i wish you solved this problem

from deepreinforcementlearning.

archcra avatar archcra commented on August 28, 2024

from deepreinforcementlearning.

tian-jie avatar tian-jie commented on August 28, 2024

hi archcra, have you checked your vh_loss or ph_loss? in my case, when i face this error, my vh_loss and ph_loss values show NaN, so, I changed Nan values to 0

i wish you solved this problem

Can you explain more, I don't find vh_loss or ph_loss in code. I am also facing this issue now.
I tried to comment out if clause, but now it doesn't predict any more, it shows:

MCTS perceived value for O: nan
NN perceived value for O: nan

from deepreinforcementlearning.

LeeTaeHoon97 avatar LeeTaeHoon97 commented on August 28, 2024

hi archcra, have you checked your vh_loss or ph_loss? in my case, when i face this error, my vh_loss and ph_loss values show NaN, so, I changed Nan values to 0
i wish you solved this problem

Can you explain more, I don't find vh_loss or ph_loss in code. I am also facing this issue now. I tried to comment out if clause, but now it doesn't predict any more, it shows:

MCTS perceived value for O: nan NN perceived value for O: nan

Here is the training content I wrote down as I don't remember the details of a project I attempted a long time ago. Please understand that this translation is done using GPT.

Case1 - When initially training, errors occurred during the training process. Unsure where the problem originated, I printed out the cost values and observed that the pH values diverged in the negative direction, eventually resulting in NaN values. It appeared to be a gradient exploding problem. While I knew that conventional CNNs typically use techniques like dropout to address this, I was unsure how to handle it in a self-learning scenario. Through searching online, I found that inserting code to replace NaN values with 0 could sometimes resolve the issue. Subsequently, the code proceeded without errors. However, the loss function appeared to fluctuate erratically in the output (loss could decrease to negative values, but NaN values caused it to be converted to 0, resulting in erratic behavior). Further investigation revealed that the problem occurred during the backward pass. It turned out that the issue stemmed from a custom loss function implemented in Keras. Switching to a PyTorch loss function resolved the problem, and training commenced smoothly.

I hope it helped you.

from deepreinforcementlearning.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.