Comments (1)
Hi @AceMcAwesome77,
The error message you're encountering, "cls_logits is NaN or Inf.", is letting you know that at some point in your training, the cls_logits tensor contains a Not a Number (NaN) or Infinity (Inf).
This can occur due to various reasons. This could come from a learning rate that's too high, instabilities in your numerical operations, uninitialized variables, or it could also be a problem with the specific data you're inputting into the model. It's a sign that the model is diverging, and gradients are getting out of control, which can also stem from exploding or vanishing gradients.
You indeed can attempt to mitigate this issue using gradient clipping, which can help ensure gradients never exceed a certain threshold. However, applying gradient clipping doesn't guarantee to resolve the root cause of the problem.
I would recommend looking at your training process more holistically. Inspect the learning rate, look for possible issues in the data, try normalizing the inputs, or use different weight initialization techniques.
Hope it helps, thanks.
from tutorials.
Related Issues (20)
- Unrecognized arguments local-rank in multi-gpu train in self_supervised_pretraining
- unrecognized arguments `local-rank` in "brats_training_ddp.py"
- FileNotFoundError in "acceleration/TensorRT_inference_acceleration.ipynb"
- KeyError in "reconstruction/MRI_reconstruction/unet_demo/inference.ipynb"
- swin_unetr_btcv_segmentation_3d: pre trained model download link broken HOT 4
- Link for Installation of MONAI Generative Models gives 404 error
- AutoRunner demo needs to set auto_scale_allowed to False HOT 1
- Incorporating ONNX Support into Brain Tumor Segmentation Example HOT 1
- Decollate_batch() should be used with LoadImaged or LoadImage(image_only= "False") with dictionary_based input HOT 1
- module 'cv2.dnn' has no attribute 'DictValue'
- ImportError: libGL.so.1: cannot open shared object file: No such file or directory
- Kernel hangs in "TCIA_PROSTATEx_Prostate_MRI_Anatomy_Model.ipynb" HOT 11
- nbclient.exceptions.DeadKernelError: Kernel died in ./modules/3d_image_transforms.ipynb HOT 7
- Issue with multi-GPU support in Auto3DSeg on Windows HOT 1
- the argument needed to change the default directory in pathology/tumor_detection/README.MD
- please upload more famous diffusion model about image to image,thanks HOT 1
- Certificate verify failed when downloading OASIS data
- monailabel tutorials contain broken and outdated links to Orthanc HOT 2
- IndexError in modules/resample_benchmark.ipynb HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tutorials.