Giter Site home page Giter Site logo

tf_cv_ocr_project's Introduction

OCR (Optical Character Recognition) Android App.

OCR is the process of recognizing characters from images using computer vision and machine learning techniques. The current implementation uses the EAST detection model and the Keras-OCR text recognition model as a pipeline to recognize texts.

Requirements

  • Android Studio 4.2 (installed on a Linux, Mac or Windows machine)
  • An Android device, or an Android Emulator

Build and run

Step 1. Clone the TensorFlow examples source code

Clone the TF_CV_OCR_PROJECT GitHub repository to your computer to get the demo application.

git clone https://github.com/claumoncada/TF_CV_OCR_PROJECT

Step 2. Import the sample app to Android Studio

Open the TensorFlow source code in Android Studio. To do this, open Android Studio and select Import Projects (Gradle, Eclipse ADT, etc.), setting the folder to TF_CV_OCR_PROJECT

Step 3. Run the Android app

Connect the Android device to the computer and be sure to approve any ADB permission prompts that appear on your phone. Select Run -> Run app. Select the deployment target in the connected devices to the device on which the app will be installed. This will install the app on the device.

To test the app, open the app called TFL OCR on your device. Re-installing the app may require you to uninstall the previous installations.

For gradle CLI, running ./gradlew build can create APKs for both solutions under app/build/outputs/apk.

Download the APK-DEMO

Feel free to download the OCR demo application directly from google drive at: https://drive.google.com/file/d/1YxZNUN5SP73qWeoGEgN1mYVyykc9ckaI/view This apk returns the same results as the source code presented in this repository.

Limitations

  • The current text recognition model is an improvement of the original keras-ocr model trained with English letters and numbers. In the same way as the original, the new model was trained using synthetic data but adding new characters to the alphabet.

  • The current model, similar to the original, is not general enough for OCR in the wild (say, random images taken by a smartphone camera in a low lighting condition).

References

tf_cv_ocr_project's People

Contributors

ziyeqinghan avatar tensorflower-gardener avatar lintian06 avatar khanhlvg avatar wangtz avatar markdaoust avatar nealwu avatar yashk2810 avatar abhinavsp0730 avatar tfboyd avatar lamberta avatar vincentvanhoucke avatar lu-wang-g avatar jdduke avatar paultr avatar valeriyvan avatar mhyttsten avatar daisuke-t-jp avatar elibixby avatar martinwicke avatar mingxingtan avatar itsmeolivia avatar saryongkang avatar ehennis avatar xunkai55 avatar tanjinprity4 avatar ozbek avatar noranglemon avatar raskutti avatar jaeyoo avatar

Watchers

Claudia Moncada avatar

Forkers

saulfigue

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.