Giter Site home page Giter Site logo

vtd's Introduction

Scene Text Detection and Recognition


The VTD project is on scene text detection and recognition, based on `EAST', 'CTPN' and 'CRNN'.

RoadMap

Scene Text Detection is a fast evolving field with new techniques and architectures being published frequently.
The goal of this project is facilitating the development of such techniques and applications. While constantly improving the quality of code and readme.
The main functions and models of VTD are as follows:

  • Detection: EAST, CTPN and my designed network
  • Recognition: CRNN
  • GPU_Tracker: track your gpu usage

Usage

First of all, the data package is a preprocess lib to resize or extract the text area.
'Detection' includes these fancy models, how to use them? start with the main.py script of each model. And do not foget to change the data path for your own data.
Recognition only includes the CRNN model, and start with main.py to train your network. For details, you can read the arguments of DataGenerator.py.

Demo

Comparison

To verify the effectiveness of our proposed framework, we did a series of comparative experiments on the ICDAR2013, ICDAR2015 and ICDAR2017 data sets. These experiments mainly focus on evaluating two essential methods in our model: Channel Attention Block, and Spatial Attention Block. All the experiments are based on single model and single-scale, the scales of testing images are set as 720 for three benchmark datasets.

Author

  • Shuai Li (ๆŽๅธ…) - all work

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.