Giter Site home page Giter Site logo

Comments (9)

githubharald avatar githubharald commented on August 21, 2024

Hi,

yes, that's correct. The reason is not the number of words, but the size of the input image.
The input image is downsized to 128x32px, so if it contains a long sentence you won't be able to read the text on it anymore.
However, groups of short words should work fine, e.g. "I go home".

For more information see section 2 of this article: https://towardsdatascience.com/27648fb18519

from simplehtr.

IamDixit avatar IamDixit commented on August 21, 2024

from simplehtr.

githubharald avatar githubharald commented on August 21, 2024

There is no need to resize your input to 128x32, this is done automatically (also taking care that no distortion happens). You just have to take care that in your image, when downsized, the text is still large enough to be recognized.
I recommend either approach 2.1 or 2.2 of the linked article.

from simplehtr.

IamDixit avatar IamDixit commented on August 21, 2024

from simplehtr.

githubharald avatar githubharald commented on August 21, 2024

A small illustration:

  • Top: your image containing some words.
  • Left: downsize it and feed complete image to model.
  • Right: apply word-segmentation (only one word shown). Feed each word individually to the model.

As you can see, the images of the segmented words contain much bigger text, which can easily be recognized by the model.

downsize_imgs

from simplehtr.

IamDixit avatar IamDixit commented on August 21, 2024

from simplehtr.

IamDixit avatar IamDixit commented on August 21, 2024

from simplehtr.

githubharald avatar githubharald commented on August 21, 2024

Is the writing style for you alhpnum images the same as for your "normal" text?
Just to be sure that the bad results are not just because the writing looks entirely different.

If it's not the writing style: maybe the model learned some language properties, such that certain character combinations are more likely than others, e.g. "na" is more likely than "nx" in the IAM dataset which the model was trained on. But this is just an assumption, I didn't do any experiments to check if this is really the case.

from simplehtr.

PartheshSoni avatar PartheshSoni commented on August 21, 2024

How can I train your network for some other language like Hindi??

from simplehtr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.