Giter Site home page Giter Site logo

arabicocr's Introduction

Arabic OCR

This repository utilizes a number of algorithms and builds on top of them in order to find the best Algorithm for Arabic Optical Character Recognition.

Difficulties

Since most algortihms such as Word Segementation and Character Segmentation are available and finetuned to more mainstream languages, while very little attention is on the Arabic langauge in this subject. Arabic has been one of the most growing languages in the last decade and still going. The research on Arabic OCR is very little.

Some of the challenges we face with our OCR is character overlapping, some characters in Arabic can cause overlapping which results in bad segmentation. Some fonts can change the character structure fully, which forces us to increase the classes size, Arabic OCR usually contains 110-115 classes. Other languages may have 26-50 classes.

Solution

Using state of the art Computer Vision algoritms to detect characters and recognize them, by developing our own algorithms that fit the Arabic text properly. We follow the basic OCR flow to maintain a good accuracy. Check out the image below




Below is the result of Line Segmentation on a page. Where region with the most populated black pixels is considered a splitting region.



Below image is the before and after applying Word Segmentation on a sentence from the page.

Before


After


arabicocr's People

Contributors

s4lm-xi avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.