Comments (2)
Because gradient accumulation will cause a certain degree of performance degradation, it is recommended to use multiple GPU parallel training to increase the batch size.
As for the hyperparameters mentioned in supplimentray, it is because the optimized parameters forgot to update later.
Finally, the file ’utils_sample.py‘ is uploaded now. Thanks for your issues.
from docunet.
Thank you! I will try to use multiple GPU then
from docunet.
Related Issues (20)
- No train_bio.py HOT 7
- ERROR: No matching distribution found for transformers==3.0.4 HOT 5
- 给的shell脚本用windows改过吗?(Resolved) HOT 4
- 对预测矩阵的疑问 HOT 3
- Did someone try run on multi-gpu? Got an error in the multi-gpu setting. HOT 1
- The BC5CDR dataset result HOT 2
- 关于context-base strategy的疑问
- ModuleNotFoundError: No module named 'overrides' HOT 4
- TypeError: ElementWiseMatrixAttention.forward: `matrix_1` is not present. HOT 1
- More recent trained DocRED Weights? HOT 4
- 您好,请问随机种子固定后,复现后每次结果都不相同是什么原因?怎么才能在随机种子相同时固定复现结果? HOT 3
- 数据集咋下载 HOT 1
- 结果不一致 HOT 3
- I am unable to obtain the results you presented. HOT 2
- the result of roberta-large HOT 3
- Question about Roberta-large results HOT 4
- 关于分割区域的一点疑问 HOT 2
- 关于损失函数的一些疑问 HOT 3
- 复现实验结果一直达不到论文的结果,怎么搞 HOT 2
- 数据集 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from docunet.