Giter Site home page Giter Site logo

tesseract-samples's Introduction

Samples that demonstrate using Tesseract .Net Wrapper.

ConsoleDemo

A short demo that ocr's an image and prints out the results using both Page.GetText() and a result iterator.

Web Demo

A simple demonstration of using Tesseract from within ASP.NET.

Warning: To keep things simple the sample will create a new instance of the TesseractEngine each time a image is processed. However this is not performant as creating a new TesseractEngine is expensive and would be a good candiate for pooling to allow a single engine instance to be reused.

License

Copyright 2012-2015 Charles Weld.

Licensed under the [Apache License, Version 2.0][apache2] (the "License"); you may not use this software except in compliance with the License. You may obtain a copy of the License at:

[http://www.apache.org/licenses/LICENSE-2.0][apache2]

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

tesseract-samples's People

Contributors

charlesw avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

tesseract-samples's Issues

System.Runtime.InteropServices.SEHException: 'External component has thrown an exception.'

In the console demo, running with no changes, I immediately get the error in the title on line 24 in Program.cs
var text = page.GetText();

stacktrace:
Fatal error. System.Runtime.InteropServices.SEHException (0x80004005): External
component has thrown an exception.
at InteropRuntimeImplementer.TessApiSignaturesInstance.TessApiSignaturesImplementation.BaseApiRecognize(System.Runtime.InteropServices.HandleRef, System.Runtime.InteropServices.HandleRef)
at Tesseract.Page.Recognize()
at Tesseract.Page.GetText()
at Tesseract.ConsoleDemo.Program.Main(System.String[])

Failed to find library "liblept1753.dll" for platform x64

When I try to deploy the solution to server, I get following issue.

image

This is the event trace I got. Already tried adding CustomSearchPath to bin x64 and x86 folders, even tried copying those dll to the bin folder.

Following line is causing the above error.
TesseractEngine engine = new TesseractEngine(tesseractDataFolder, "eng", EngineMode.Default);

This works perfectly fine on local machine, even copied server dll to local and tried, then again works fine.

Local machine - Windows 10 and server side - Windows 7 (No idea if there's any link to this issue)

Any suggestions to solve this?

Strong Name

Previous versions V3 etc where built and signed with a strong name (framework versions)
Can we get a V5 built with a strong name

How to Train special character.

I need to get the currency symbol (৳) of Bangladesh. Please see the image below:
image

output is:
image

expected values are ৳00 instead of 400, ৳15 instead of 615.....
OR
expected values are 00 instead of 400, 15 instead of 615...

Failed to load image

I am using the sample code provided in a console application. For some sample files, i am able to get the results printed while in some case it throws an exception. I tried to see the difference between the working and non working images but the meta data seems very similar.

I am using magick.net library with ghost script to convert PDF to TIF/PNG. I use those images in the tesseract engine to validate the results.

using (var img = Pix.LoadFromFile(testImagePath)) : Line 26

Exception:
System.IO.IOException: Failed to load image '\Snakeware.Page1.tif'.
at Tesseract.Pix.LoadFromFile(String filename)
at ConsoleApp3.utils.Ocr.performOCR() in \Ocr.cs:line 26

Input pattern?

Is it possible to use an user patterns file for TesseractEngine.Process?

Failed to initialise tesseract engine..

Hi
I am facing the error while initializing the TesseractEngine. Below is the stacktrace
at Tesseract.TesseractEngine.Initialise(String datapath, String language, EngineMode engineMode, IEnumerable1 configFiles, IDictionary2 initialValues, Boolean setOnlyNonDebugVariables)
at Tesseract.TesseractEngine..ctor(String datapath, String language, EngineMode engineMode, IEnumerable1 configFiles, IDictionary2 initialOptions, Boolean setOnlyNonDebugVariables)
at Tesseract.TesseractEngine..ctor(String datapath, String language, EngineMode engineMode)
at ConsoleApplication.Program.Main(String[] args) i

error finding Default.aspx

Almost certianly due to my own stupidity - but I keep getting "The requested URL /Default.aspx was not found on this server"
Any suggestions?
I've set this page as the startup page, etc. - but still get this error.

Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.