Comments (13)
thanks for sharing the results of your experiments.
I'll like to keep the model as simple and minimalistic as possible, but I'll link to this issue from the "Improve accuracy" section such that others can benefit from your findings.
from simplehtr.
Expanding the layers a bit more, I hit a top word accuracy of 78% using layer depth/width values similar to VGG16 but with batch normalization. Based off my other hyperparameter runs, increasing model size further than that won't meaningfully impact accuracy without a resnet-like approach (obviously outside the scope of this project).
from simplehtr.
Nice article btw. My results show that even with more layers the model only overfits by about 5% (even with data augmentation off!), and accuracy takes about a 1% hit when turning data augmentation off. If anything the model insufficiently overfits (by not overfitting on train it effectively underfits on test). He et al., 2015 demonstrated that increasing the number of layers is insufficient to guarantee overfit, and I would expect their results to apply to SimpleHTR as well.
from simplehtr.
@jevinruv Let me check, it should be possible.
from simplehtr.
Oh, well done. Thank you. And I have one question. How do you print this plot above?
from simplehtr.
Tensorboard plus a bunch of hooks which aren't committed anywhere.
from simplehtr.
When increasing the model size, at some point the model is able to perfectly learn the training data without improving validation accuracy, i.e. it overfits. Therefore you could try to make the task a bit harder while training by using data augmentation. At the moment, the model is very sensitive to small translations (see this article) [1]. By adding random translations, validation accuracy should get better.
[1] However, this behaviour improved since you uploaded the new pretrained model.
from simplehtr.
@Chazzz can you kindly let me know of a rough idea of how much time it took you to train the system with your system specific details. I am planning to apply a range of image augmentation like translation, adding Gaussian noise, Random cropping and etc to make the model more robust.
from simplehtr.
Hi @RajPratim21 I trained the above on a GTX 980 Ti, and as shown in the graph in my initial post, training would take between 40 mins to 80 mins. LMK if there are other system details which are of interest.
from simplehtr.
Tensorboard plus a bunch of hooks which aren't committed anywhere.
possible to share the code for the tensorboard integration? thanks!
from simplehtr.
@jevinruv Let me check, it should be possible.
Thank you, looking forward for it!
from simplehtr.
@jevinruv https://github.com/Chazzz/SimpleHTR-experimental
from simplehtr.
@jevinruv https://github.com/Chazzz/SimpleHTR-experimental
Thank you !
from simplehtr.
Related Issues (20)
- TypeError: a bytes-like object is required, not 'NoneType' (dataloader_iam.py line 119) HOT 4
- Blank line filter in dataloader doesn't quite work HOT 1
- Deep Stream HOT 1
- How to use in ML.NET c#?
- unable to build wheel for word_beam_search HOT 1
- Where can I find the tagset.txt file HOT 3
- Add feature to save train loss in summary + minor bug fix HOT 2
- Data visualization HOT 2
- pip install error: ModuleNotFoundError: No module named 'patch_ng' HOT 1
- which version of python used?
- Add cudnn64_8.dll to the Windows/System32 folder, otherwise the program cannot run properly.
- How to convert checkpoint to ONNX HOT 2
- Outdated version of tensorflow used HOT 6
- Training Model
- Training the model from scratch and error "model not found" HOT 4
- Wrong detection of words in model validation HOT 8
- Missing CITATION.cff file for repository HOT 1
- where is json HOT 9
- Hello, I'm sorry to disturb you again. How to make a front-end webpage for this project, which only needs to be able to open locally. Could you please teach me? HOT 1
- performance evaluation of the experimental results HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from simplehtr.