Giter Site home page Giter Site logo

Comments (6)

chrys87 avatar chrys87 commented on April 27, 2024

Hi,
for me it helps often to upscale a image.
in ocrdesktop i resize the image by factor 3 (Bicubic). This brings quite good results.
Cheers

from tesseract.

Wikinaut avatar Wikinaut commented on April 27, 2024

In case that you use screenshots, please notice, that screenshots usually have 72dpi, which is not sufficient for Tesseract. I admit, that this is not so well known, however, it is mentioned since a long time here https://github.com/tesseract-ocr/tesseract/wiki/FAQ#is-there-a-minimum-text-size-it-wont-read-screen-text

Is there a Minimum Text Size? (It won't read screen text!)

There is a minimum text size for reasonable accuracy. You have to consider resolution as well as point size. Accuracy drops off below 10pt x 300dpi, rapidly below 8pt x 300dpi. A quick check is to count the pixels of the x-height of your characters. (X-height is the height of the lower case x.) At 10pt x 300dpi x-heights are typically about 20 pixels, although this can vary dramatically from font to font. Below an x-height of 10 pixels, you have very little chance of accurate results, and below about 8 pixels, most of the text will be "noise removed".

Here's another useful page:
https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality

I recommend:

  • remember never to use JPEG (because this is a lossy compresison format) when saving images with text (use: PNG, or use TIF, or GIF as the last choice)
  • resize your screenshots or images to at least 300dpi, or 400dpi, or upscale by 400%:
convert -resize 400% ...
density=400
gs -dNOPAUSE -sDEVICE=png16m -sOutputFile=$image -r$density -q $file -c quit

[Posting updated with information which came in later]

from tesseract.

amitdo avatar amitdo commented on April 27, 2024

@Wikinaut, why do you link to the old wiki at googlecode instead to the new one at github?
https://github.com/tesseract-ocr/tesseract/wiki/FAQ#is-there-a-minimum-text-size-it-wont-read-screen-text
Here is another useful wiki page.
https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality

[Wikinaut updated his comment.]

from tesseract.

Wikinaut avatar Wikinaut commented on April 27, 2024

I updated my posting above accordingly. Thanks

from tesseract.

zdenop avatar zdenop commented on April 27, 2024

Please use tesseract user forum for support[1]. Also do not forget to search forum for you topic before asking for help.
[1] https://github.com/tesseract-ocr/tesseract/wiki/FAQ#rules-and-advice

from tesseract.

suraj990 avatar suraj990 commented on April 27, 2024

open
raise ValueError("bad mode %r" % mode)
ValueError: bad mode 'sign.png'

what to do i am helpless

from tesseract.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.