Giter Site home page Giter Site logo

varsha359 / handwritten-text-recognition-1 Goto Github PK

View Code? Open in Web Editor NEW

This project forked from psarpei/handwritten-text-recognition

0.0 0.0 0.0 11.44 MB

As part of the project we examine several approaches for recognizing text in images and predicting the whole digital text. All aproaches follow the method to break the image down into the smaller parts like lines, words or characters.

License: MIT License

Python 4.30% Jupyter Notebook 95.70%

handwritten-text-recognition-1's Introduction

Handwritten-Text-Recognition

As part of the project we examine several approaches for handwriting text recognition based on convolutional neural networky and long short-term memories.

All aproaches follow the method to break the image down into the smaller parts:

  • lines
  • words
  • characters

The two best approaches are explained in the written elaboration (only available in german), that you can find between the source code folders of this repository. On top of that there is a explanation of the object detection approach YOLOv1, and the End-to-End Trainable Neural Network for Image-based-Sequence Recognition which are used in all approaches.

General information

Instructors

Institutions

Project team

Tools

  • Python 3
  • PyTorch
  • Pillow
  • OpenCV

Project

Dataset

We only use the data of the IAM Handwriting Database for training and testing.

The database consists of:

  • 657 writers contributed samples of their handwriting
  • 1'539 pages of scanned text
  • 5'685 isolated and labeled sentences
  • 13'353 isolated and labeled text lines
  • 115'320 isolated and labeled words

All form, line and word images are provided as PNG files and the corresponding form label files, including segmentation information and variety of estimated parameters, are included in the image files as meta-information in XML format which is described in XML file and XML file format (DTD).



Results

We compare our best approach with the state-of-the-art CRNN approach by the character error rate (cer).

Approach CER %
CRNN 5.7
Our best 10.64

Source Code

The source code of all approaches are available in the .pynb Python formats in the way of google-colab

Open In Colab

handwritten-text-recognition-1's People

Contributors

psarpei avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.