Semantic-Aware-Video-Text-Detection

Introduction

This is a PyTorch implemntation of the CVPR 2021 paper Semantic-Aware-Video-Text-Detection.

Installation

The code is based on the mmdetection(2.11.0) framework.

Requirements:

Python3.6+
PyTorch 1.3+ and torchvision that matches the Pytorch installation.
CUDA 9.2+
GCC 5+
MMCV

# install the mmcv
pip install mmcv-full==1.3.9
# clone our model
git clone https://github.com/zjb-1/Semantic-Aware-Video-Text-Detection.git
# install the cocoapi
cd Semantic-Aware-Video-Text-Detection/cocoapi/PythonAPI
python setup.py build_ext install
# install our model
cd ../../
pip install -r requirements.txt
pip install -v -e .

Models

If you need a pre-trained model or a trained model, you can contact me.

Datasets

The video datasets format is as follows:

dataset
├─Video1
│    ├─1.jpg
│    ├─1.txt
│    ├─2.jpg
│    ├─2.txt
│    └─...
├─Video2
│    ├─1.jpg
│    ├─1.txt
│    ├─2.jpg
│    ├─2.txt
│    └─...
├─ ...

The txt file format is as follows(Coordinate points arranged clockwise, text, id):

x1,y1,x2,y2,x3,y3,x4,y4 text id

Then, you need to run the train_label_gen.py / test_label_gen.py to generate the label file.(Remember to modify the file path in the file).

Training

Before training, you need to modify the profile(mask_track_rcnn_r50_fpn.py) and shell file(train.sh).

# training
bash train.sh

Evaluation

Before evaluation, you need to modify the test shell file(test.sh).

# test
bash test.sh

You will get visual results.

zjb-1 / semantic-aware-video-text-detection Goto Github PK

semantic-aware-video-text-detection's Introduction

Semantic-Aware-Video-Text-Detection

Introduction

Installation

Requirements:

Models

Datasets

Training

Evaluation

semantic-aware-video-text-detection's People

Contributors

Stargazers

Watchers

Forkers

semantic-aware-video-text-detection's Issues

Need a train model

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent