View Code? Open in Web Editor NEW

Scene Text Detection with Fully Convolutional Neural Networks

Python 4.67% CMake 1.22% Makefile 0.27% HTML 0.08% CSS 0.10% Jupyter Notebook 57.59% C++ 33.01% Shell 0.30% Cuda 2.36% MATLAB 0.36% Dockerfile 0.03%

wsrtcb's Introduction

1. Introduction

This project includes the text detection source code and trained model about the word stroke region and text center block.

2. Installation

Clone the repo

git clone [email protected]:zdlcaffe/WSRTCB.git

Then you can do as follow:

cd ${WSRTCB_root/Train_WSR_TCB/caffe}  
make –j  
make pycaffe

3. Testing

3.1 Generate WSR/TCB score map

Download the TD_MKEI_Word.caffemodel, trained on KAIT dataset.
Then you can do as follow:

cd ${WSRTCB_root/}  
mkdir snapshot  
mkdir pre_model

put TD_MKEI_Word.caffemodel to the fold of ${WSRTCB_root/Train_WSR_TCB/ snapshot}.
Suppose you have downloaded the MSRA-TD500 dataset, execute the following commands to test the model on MSRA-TD500. Then you can do as follow:

cd ${WSRTCB_root/Train_WSR_TCB/demo}  
python Demo.py

3.2 There are some samples:

3.3 Threshold WSR/TCB maps:

You can do as follow:

cd ${WSRTCB_root/Text_Demo}  
python fuse_thred	.py

3.4 Generate detection results

You can do as follow:

cd ${WSRTCB_root/Text_Demo}  
python Demo_region_word.py

3.5 There are some samples:

4. Training

Download the pretrained model vgg16convs.caffemodel, and put it to ${WSRTCB_root/Train_WSR_TCB/pre_model}

4.1 Generate your map

Scripts for generating ground truth have been provided in the label_generate directory. It not hard to write a converting script for your own dataset.

4.2 Train your own model

Modify ${WSRTCB_root/Train_WSR_TCB/model/TD_MKEI_Word.py} to configure your dataset name and dataset path like:
......
data_params['root'] = 'data/MKEIWord'
data_params['source'] = "MKEI_Word.lst"
......

4.3 Start training

You can do as follow:

cd ${WSRTCB_root/Train_WSR_TCB}  
sh ./train.sh

Citation

Use this bibtex to cite this repository:

@article{liu2019scene,
  title={Scene text detection with fully convolutional neural networks},
  author={Liu, Zhandong and Zhou, Wengang and Li, Houqiang},
  journal={Multimedia Tools and Applications},
  pages={1--23},
  year={2019},
  publisher={Springer}
}

Acknowlegement

Recommend Projects

zdlcaffe / wsrtcb Goto Github PK

wsrtcb's Introduction

1. Introduction

2. Installation

3. Testing

3.1 Generate WSR/TCB score map

3.2 There are some samples:

3.3 Threshold WSR/TCB maps:

3.4 Generate detection results

3.5 There are some samples:

4. Training

4.1 Generate your map

4.2 Train your own model

4.3 Start training

Citation

Acknowlegement

wsrtcb's People

Contributors

Watchers

Forkers

Recommend Projects

Recommend Topics

Recommend Org