Giter Site home page Giter Site logo

naviocean / faster_rcnn_sku110 Goto Github PK

View Code? Open in Web Editor NEW
37.0 4.0 18.0 26.84 MB

VoVNet, MobileNet, ShuffleNet, HarDNet, GhostNet, EfficientNet backbone networks and SKU-110K dataset for detectron2

Python 100.00%
sku110k sku-110k vovnet detectron2 mobilenetv2 fasterrcnn shufflenetv2 shufflenet mobilenet hardnet ghostnet efficientnet

faster_rcnn_sku110's Introduction

This repository is based on VoVNet-v2

Faster R-CNN on SKU-110K dataset

Note

We measure the inference time of all models with batch size 1 on the same RTX2080Ti GPU machine.

  • pytorch1.4.0
  • CUDA 10.2
  • cuDNN 7.3

Lightweight with FPNLite

Backbone Param. lr sched inference time AP AP75 AP50 download
MobileNetV2-0.5-64 N/A 1x 0.033 43.31 44.66 78.08 model | metrics
MobileNetV2-0.5 N/A 1x 0.037 42.93 44.27 77.31 model | metrics
MobileNetV2 3.5M 3x 0.031 52.11 58.72 85.98 model | metrics
MobileNetV2 3.5M 1x 0.031 51.20 56.93 85.71 model | metrics
MobileNetV2-FLGC N/A 1x 0.030 50.59 56.05 85.21 model | metrics
ShuffleNetV2-0.5 N/A 1x 0.039 48.24 52.95 82.10 model | metrics
ShuffleNetV2 N/A 1x 0.028 52.60 59.55 86.19 model | metrics
V2-19 11.2M 1x 0.034 41.46 44.97 71.32 model | metrics
V2-19-DW 6.5M 1x N/A N/A N/A N/A model | metrics
V2-19-Slim 3.1M 1x 0.027 47.68 51.47 82.36 model | metrics
V2-19-Slim-DW 1.8M 3x N/A N/A N/A N/A model | metrics
  • 64 FPN.OUT_CHANNELS = 64
  • DW and Slim denote depthwise separable convolution and a thiner model with half the channel size, respectively.

FPN

Backbone Param. lr sched inference time AP AP75 AP50 download
V2-19-FPN 37.6M 3x N/A N/A N/A N/A model | metrics
R-50-FPN 51.2M 3x N/A N/A N/A N/A model | metrics
V2-39-FPN 52.6M 3x 0.071 51.47 57.5 85.5 model | metrics

Using this command with --num-gpus 1

python /path/to/sku110/train_net.py --config-file /path/to/sku110/configs/<config.yaml> --eval-only --num-gpus 1 MODEL.WEIGHTS <model.pth>

Installation

As this repository is implemented as a extension form (detectron2/projects) upon detectron2, you just install detectron2 following INSTALL.md.

Prepare for SKU-110K dataset:

  • To download dataset, please visit here
  • Extract the file downloaded to datasets/sku110/images
  • Extract datasets/sku110/Annotations.zip, there are 2 folders Annotations and ImageSets

Training

To train a model, run

python /path/to/sku110/train_net.py --config-file /path/to/sku110/configs/<config.yaml>

For example, to launch end-to-end Faster R-CNN training with VoVNetV2-39 backbone on 8 GPUs, one should execute:

python /path/to/sku110/train_net.py --config-file /path/to/sku110/configs/faster_rcnn_V_39_FPN_3x.yaml --num-gpus 8

Evaluation

Model evaluation can be done similarly:

python /path/to/sku110/train_net.py --config-file /path/to/sku110/configs/faster_rcnn_V_39_FPN_3x.yaml --eval-only MODEL.WEIGHTS <model.pth>

Visualization

To visual the result, run

python /path/to/sku110/demo.py --config-file /path/to/sku110/configs/faster_rcnn_V_39_FPN_3x.yaml --input image.jpg --output image.jpg MODEL.WEIGHTS <model.pth>

Citing VoVNet

If you use VoVNet, please use the following BibTeX entry.

@inproceedings{lee2019energy,
  title = {An Energy and GPU-Computation Efficient Backbone Network for Real-Time Object Detection},
  author = {Lee, Youngwan and Hwang, Joong-won and Lee, Sangrok and Bae, Yuseok and Park, Jongyoul},
  booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops},
  year = {2019}
}

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.