Giter Site home page Giter Site logo

r-net's Introduction

R-Net

Description

This is an implement of "R-Net: A Relationship Network for Efficient and Accurate Scene Text Detection". R-Net is a direct regression based method for multi-oriented scene text detection, which outperforms parallel methods by a large margin. The paper is avaliabel at Paperlink(TMM2020).

Result

Model recall precision F-measure TIoU-R TIoU-P TIoU-F
This implementation (IC15) 84.5 88.1 86.3 60.1 67.9 63.7

The released model is pre-trained on SynthText (1 epoch) and then finetune on ICDAR2015 (batchsize=8 on 1 gpu). Reasults on other datasets (e.g. MSRA-TD500, ICDAR2013, MLT, etc.) can be easily obtained by following our training setting.

Updates

2020/5/18 We have updated the code.

Requirement

  • Python3
  • PyTorch-0.4.1
  • torchvision-0.2.1
  • shapely-1.6.4.post2
  • lanms-1.0.2
  • opencv-python(4.1.0.25)

Installation

Data

Download data images and annotions from ICDAR 2015 Challenge 4. Prepare data as:

./dataset/train/imgs
./dataset/train/txt
./dataset/test/imgs
./dataset/test/txt

Pre-trained Model and Our trained Model.

Download pre-trained VGG16 (must rename as vgg16.pth) and our traind model (passward:oowz) Put both models into output/

Test

CUDA_VISIBLE_DEVICES=0 python test_rnet.py

Train

CUDA_VISIBLE_DEVICES=0,1 python train_rnet.py

Evaluation

We use online tool to evaluate our results.

Speed

Run

CUDA_VISIBLE_DEVICES=0 python speed_eval.py.

We add the model prediction time and the NMS time as the inference speed.

Citation

If you find our method useful for your reserach, please cite

@ARTICLE{rnet2020wang,
  author={Y. {Wang} and H. {Xie} and Z. {Zha} and Y. {Tian} and Z. {Fu} and Y. {Zhang}},
  journal={IEEE Transactions on Multimedia}, 
  title={R-Net: A Relationship Network for Efficient and Accurate Scene Text Detection}, 
  year={2020},
  pages={1-1},}

Feedback

Suggestions and discussions are greatly welcome. Please contact the authors by sending email to [email protected]

r-net's People

Contributors

wangyuxin87 avatar

Stargazers

ppzze avatar  avatar  avatar LI Wentong avatar  avatar  avatar wuxiaolian avatar TIAN Xin avatar  avatar  avatar Ming Qi avatar yangxue avatar Kanak Manjari avatar sudo avatar Mike avatar  avatar  avatar  avatar Yanshuang avatar Kalo avatar Hesuo Zhang avatar sql avatar Qing avatar  avatar  avatar  avatar hylrh avatar DL avatar LateBird avatar 爱可可-爱生活 avatar fangxuwei avatar  avatar Phan Hoang avatar Zuming Huang avatar Kira Kravets avatar  avatar Ekrem Alper Kesen avatar Thananop Kobchaisawat avatar Khanh Tran avatar  avatar Chun Chet Ng avatar fireae avatar  avatar 无限未来 avatar  avatar Ruitong XU avatar sdp avatar  avatar Li Jiaming avatar  avatar JY avatar  avatar  avatar  avatar  avatar  avatar Alex Lee avatar Saicoco avatar  avatar jerome avatar colorr avatar  avatar

Watchers

Mike avatar Saicoco avatar Yangchun W avatar  avatar

r-net's Issues

trained weights on Google Drive

Good day! Can you provide a google drive link to download the trained weights on Google Drive. I found it hard to download using the Baidu link. Thank you

cannot import name 'get_rotate_mat'

@wangyuxin87
when running
CUDA_VISIBLE_DEVICES=0 python test_rnet.py
I get error:

(final) home@home-desktop:~/p3/R-Net$ CUDA_VISIBLE_DEVICES=0 python test_rnet.pyTraceback (most recent call last):
  File "test_rnet.py", line 6, in <module>
    from dataset import get_rotate_mat
ImportError: cannot import name 'get_rotate_mat'

paper pdf

你好,偶然间发现您这个仓库,但paper链接没办法下载pdf,作者可以发我一个pdf文件吗?email: [email protected],谢谢!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.