Comments (5)
which example?
from examples.
:) You are too quick for me to even complete writing up my issue:
https://github.com/pytorch/examples/blob/master/word_language_model/main.py#L177
from examples.
args.lr is of type float. float / int in python is float right?
from examples.
But my lr appears to go down to zero, and stay there after epoch 14 (for my particular settings):
| epoch 14 | 2000/ 2323 batches | lr 1.00 | ms/batch 132.84 | loss 3.93 | ppl 51.05
| epoch 14 | 2200/ 2323 batches | lr 1.00 | ms/batch 130.91 | loss 3.87 | ppl 47.75
-----------------------------------------------------------------------------------------
| end of epoch 14 | time: 326.76s | valid loss 4.84 | valid ppl 126.41
-----------------------------------------------------------------------------------------
epoch 15 | 200/ 2323 batches | lr 0.00 | ms/batch 133.49 | loss 4.12 | ppl 61.81
| epoch 15 | 400/ 2323 batches | lr 0.00 | ms/batch 131.63 | loss 4.27 | ppl 71.50
| epoch 15 | 600/ 2323 batches | lr 0.00 | ms/batch 132.10 | loss 4.15 | ppl 63.44
| epoch 15 | 800/ 2323 batches | lr 0.00 | ms/batch 132.22 | loss 4.12 | ppl 61.75
-----------------------------------------------------------------------------------------
| end of epoch 15 | time: 315.69s | valid loss 4.84 | valid ppl 126.41
-----------------------------------------------------------------------------------------
| epoch 16 | 200/ 2323 batches | lr 0.00 | ms/batch 131.21 | loss 4.12 | ppl 61.81
| epoch 16 | 400/ 2323 batches | lr 0.00 | ms/batch 134.99 | loss 4.27 | ppl 71.50
from examples.
fixed via c4b48c4
from examples.
Related Issues (20)
- Daily CI failed
- RL Examples had bugs on current gym version
- The doc build deployment has been failing since jan HOT 1
- word_language_model/data.py - two areas of redundant code
- word_language_model/data.py - remove '<eos>'
- If I am training on a SINGLE GPU, should this "--dist-backend 'gloo'" argument be added to the command? HOT 10
- SSL Error When downloading dataset HOT 3
- Testing a C++ case with MPI failed.
- Long training time for ResNet50 on ImageNet-1k HOT 1
- Segmentation fault (core dumped) at `model(images)` for examples/imagenet/main.py HOT 1
- RuntimeError in Partialconv-master HOT 1
- Pytorch is insufficiently opinionated
- Documentation Mismatch and AssertionError in language_translation
- RuntimeError: HIP error when running ResNet-50 on PRO W7900 with PyTorch HOT 1
- Drawbacks of making the C++ API look like Python HOT 10
- Build error on cpp/custom-dataset
- multi-node Tensor Parallel
- `local_rank` or `rank` for multi-node FSDP
- reference of weight initialization for llama2 model HOT 1
- [DOC] Update mnist.py example HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from examples.