faceboxes

introduction

A tensorflow 2.0 implement faceboxes.

CAUTION: this is the tensorflow2 branch, if you need to work on tensorflow1, please switch to tf1 branch

And some changes has been made in RDCL module, to achieve a better performance and run faster:

input size is 512 (1024 in the paper), then the first conv stride is 2, kernel size 7x7x12.
replace the first maxpool by conv 3x3x24 stride 2
replace the second 5x5 stride2 conv and maxpool by two 3x3 stride 2 conv
anchor based sample is used in data augmentaion.

codes like below

    with tf.name_scope('RDCL'):
     net = conv2d(net_in, 12, [7, 7], stride=2,activation_fn=tf.nn.relu, scope='init_conv1')
     net = conv2d(net, 24, [3, 3], stride=2, activation_fn=tf.nn.crelu, scope='init_conv2')
    
     net = conv2d(net, 32, [3, 3], stride=2,activation_fn=tf.nn.relu,scope='conv1x1_before1')
     net = conv2d(net, 64, [3, 3], stride=2, activation_fn=tf.nn.crelu, scope='conv1x1_before2')
     
     return net

I want to name it faceboxes++ ,if u don't mind

Pretrained model can be download from:

baidu disk (code zn3x )
google drive

Evaluation result on fddb

fddb
0.96

Speed: it runs over 70FPS on cpu (i7-8700K), 30FPS (i5-7200U), 140fps on gpu (2080ti) with fixed input size 512, tf2.0, multi thread. And i think the input size, the time consume and the performance is very appropriate for application :)

Hope the codes can help you, contact me if u have any question, [email protected] .

requirment

tensorflow2.0
tensorpack (data provider)
opencv
python 3.6

useage

train

download widerface data from http://shuoyang1213.me/WIDERFACE/ and release the WIDER_train, WIDER_val and wider_face_split into ./WIDER,
download fddb, and release FDDB-folds into ./FDDB ， 2002,2003 into ./FDDB/img
then run python prepare_data.pyit will produce train.txt and val.txt

(if u like train u own data, u should prepare the data like this: ...../9_Press_Conference_Press_Conference_9_659.jpg| 483(xmin),195(ymin),735(xmax),543(ymax),1(class) ...... one line for one pic, caution! class should start from 1, 0 means bg)
then, run:

python train.py

and if u want to check the data when training, u could set vis in train_config.py as True

finetune

(if u like train u own data, u should prepare the data like this: ...../9_Press_Conference_Press_Conference_9_659.jpg| 483(xmin),195(ymin),735(xmax),543(ymax),1(class) ...... one line for one pic, caution! class should start from 1, 0 means bg)

set config.MODEL.pretrained_model='./model/detector/variables/variables', in train_config.py, and the model dir structure is :

./model/
├── detector
│   ├── saved_model.pb
│   └── variables
│       ├── variables.data-00000-of-00001
│       └── variables.index

adjust the lr policy
python train.py

evaluation

    python test/fddb.py [--model [TRAINED_MODEL]] [--data_dir [DATA_DIR]]
                          [--split_dir [SPLIT_DIR]] [--result [RESULT_DIR]]
    --model              Path of the saved model,default ./model/detector
    --data_dir           Path of fddb all images
    --split_dir          Path of fddb folds
    --result             Path to save fddb results

example python model_eval/fddb.py --model model/detector --data_dir 'FDDB/img/' --split_dir FDDB/FDDB-folds/ --result 'result/'

jnulzl / faceboxes-tensorflow Goto Github PK

faceboxes-tensorflow's Introduction

faceboxes

introduction

requirment

useage

train

finetune

evaluation

visualization

reference

FaceBoxes: A CPU Real-time Face Detector with High Accuracy

faceboxes-tensorflow's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent