Comments (6)
Hi,
for me it helps often to upscale a image.
in ocrdesktop i resize the image by factor 3 (Bicubic). This brings quite good results.
Cheers
from tesseract.
In case that you use screenshots, please notice, that screenshots usually have 72dpi, which is not sufficient for Tesseract. I admit, that this is not so well known, however, it is mentioned since a long time here https://github.com/tesseract-ocr/tesseract/wiki/FAQ#is-there-a-minimum-text-size-it-wont-read-screen-text
Is there a Minimum Text Size? (It won't read screen text!)
There is a minimum text size for reasonable accuracy. You have to consider resolution as well as point size. Accuracy drops off below 10pt x 300dpi, rapidly below 8pt x 300dpi. A quick check is to count the pixels of the x-height of your characters. (X-height is the height of the lower case x.) At 10pt x 300dpi x-heights are typically about 20 pixels, although this can vary dramatically from font to font. Below an x-height of 10 pixels, you have very little chance of accurate results, and below about 8 pixels, most of the text will be "noise removed".
Here's another useful page:
https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality
I recommend:
- remember never to use JPEG (because this is a lossy compresison format) when saving images with text (use: PNG, or use TIF, or GIF as the last choice)
- resize your screenshots or images to at least 300dpi, or 400dpi, or upscale by 400%:
convert -resize 400% ...
- when you start form a pdf then read http://bertanguven.com/faster-conversions-from-pdf-to-pngjpeg-imagemagick-vs-ghostscript/
density=400
gs -dNOPAUSE -sDEVICE=png16m -sOutputFile=$image -r$density -q $file -c quit
[Posting updated with information which came in later]
from tesseract.
@Wikinaut, why do you link to the old wiki at googlecode instead to the new one at github?
https://github.com/tesseract-ocr/tesseract/wiki/FAQ#is-there-a-minimum-text-size-it-wont-read-screen-text
Here is another useful wiki page.
https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality
[Wikinaut updated his comment.]
from tesseract.
I updated my posting above accordingly. Thanks
from tesseract.
Please use tesseract user forum for support[1]. Also do not forget to search forum for you topic before asking for help.
[1] https://github.com/tesseract-ocr/tesseract/wiki/FAQ#rules-and-advice
from tesseract.
open
raise ValueError("bad mode %r" % mode)
ValueError: bad mode 'sign.png'
what to do i am helpless
from tesseract.
Related Issues (20)
- Tesseract fails to OCR text with very clear hexadecimal digits HOT 5
- Two little bugs for tesseract HOT 1
- multithreaded tesseract causes Linux crash HOT 5
- Linker Error for tesseract53.lib HOT 1
- Add redirect function HOT 1
- Add ICD Codes in english trained Data HOT 2
- Some CI jobs (GitHub Actions) are failing HOT 10
- uuencode-generated text is OCRed with many mistakes HOT 2
- Error! The command "tesseract" was not found. HOT 2
- Error! The command "tesseract" was not found
- unicharset_extractor segfault HOT 31
- Please add the API call to translate the language code to the full language name HOT 3
- Warning: LSTMTrainer deserialized an LSTMRecognizer! Error, data/eng/eng_num_vert.lstm is an integer (fast) model, cannot continue training HOT 7
- Add the NN for a 'random' ASCII language HOT 1
- "min_characters_to_try" parameter does not work HOT 2
- phonetic symbols and special characters HOT 1
- inform where we can find tesseract.exe HOT 1
- Native Crash in otsuthr.cpp HOT 2
- CI: vcpkg failure due to missing xz tarball HOT 4
- link error LNK1120 with text2image.exe
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tesseract.