Comments (9)
Hi,
yes, that's correct. The reason is not the number of words, but the size of the input image.
The input image is downsized to 128x32px, so if it contains a long sentence you won't be able to read the text on it anymore.
However, groups of short words should work fine, e.g. "I go home".
For more information see section 2 of this article: https://towardsdatascience.com/27648fb18519
from simplehtr.
from simplehtr.
There is no need to resize your input to 128x32, this is done automatically (also taking care that no distortion happens). You just have to take care that in your image, when downsized, the text is still large enough to be recognized.
I recommend either approach 2.1 or 2.2 of the linked article.
from simplehtr.
from simplehtr.
A small illustration:
- Top: your image containing some words.
- Left: downsize it and feed complete image to model.
- Right: apply word-segmentation (only one word shown). Feed each word individually to the model.
As you can see, the images of the segmented words contain much bigger text, which can easily be recognized by the model.
from simplehtr.
from simplehtr.
from simplehtr.
Is the writing style for you alhpnum images the same as for your "normal" text?
Just to be sure that the bad results are not just because the writing looks entirely different.
If it's not the writing style: maybe the model learned some language properties, such that certain character combinations are more likely than others, e.g. "na" is more likely than "nx" in the IAM dataset which the model was trained on. But this is just an assumption, I didn't do any experiments to check if this is really the case.
from simplehtr.
How can I train your network for some other language like Hindi??
from simplehtr.
Related Issues (20)
- TypeError: a bytes-like object is required, not 'NoneType' (dataloader_iam.py line 119) HOT 4
- Blank line filter in dataloader doesn't quite work HOT 1
- Deep Stream HOT 1
- How to use in ML.NET c#?
- unable to build wheel for word_beam_search HOT 1
- Where can I find the tagset.txt file HOT 3
- Add feature to save train loss in summary + minor bug fix HOT 2
- Data visualization HOT 2
- pip install error: ModuleNotFoundError: No module named 'patch_ng' HOT 1
- which version of python used?
- Add cudnn64_8.dll to the Windows/System32 folder, otherwise the program cannot run properly.
- How to convert checkpoint to ONNX HOT 2
- Outdated version of tensorflow used HOT 6
- Training Model
- Training the model from scratch and error "model not found" HOT 4
- Wrong detection of words in model validation HOT 8
- Missing CITATION.cff file for repository HOT 1
- where is json HOT 9
- Hello, I'm sorry to disturb you again. How to make a front-end webpage for this project, which only needs to be able to open locally. Could you please teach me? HOT 1
- performance evaluation of the experimental results HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from simplehtr.