Giter Site home page Giter Site logo

carscan / mobile_mask_rcnn Goto Github PK

View Code? Open in Web Editor NEW

This project forked from gustavz/mobile_mask_rcnn

0.0 0.0 0.0 266.18 MB

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow for Mobile Deployment

Python 73.21% Jupyter Notebook 26.70% Shell 0.08%

mobile_mask_rcnn's Introduction

Mobile Mask R-CNN

This is a Mask R-CNN implementation with MobileNet V1/V2 as Backbone architecture to be finally able to deploy it on mobile devices such as the Nvidia Jetson TX2. The major changes to the original matterport project are:

  • Add Mobilenet V1 and V2 as backbone options (besides ResNet 50 and 101) + dependencies in the model
  • Make the whole project py2 / py3 compatible (original only works on py3)
  • Investigate Training Setup for Mobilenet V1 and implement it in coco_train.py
  • Add a Speedhack to mold /unmold image functions
  • Make the project lean and focused on COCO + direct training on passed class names (IDs before)
  • Inclue more speed up options to the Model (Light-Head RCNN)
  • Release a trained Mobile_Mask_RCNN Model

Getting Started

  • install required packages (mostly over pip)
  • clone this repository
  • download and setup the COCO Dataset: setup_coco.py
  • inside coco.py subclass Config (defined in config.py) and change model params to your needs
  • train mobile mask r-cnn on COCO with: train_coco.py
  • evaluate your trained model with: eval_coco.py
  • do both interactively with the notebook train_coco.ipynb
  • if you face killed kernels due to memory errors, use bash train.sh for infinite training
  • visualize / control training with tensorboard: cd into your current log dir and run:
    tensorboard --logdir="$(pwd)"
  • inspect your model with notebooks/:
    inspect_data.ipynb,inspect_model.ipynb, inspect_weights.ipynb,detection_demo.ipynb
  • convert keras h5 to tensorflow .pb model file, in notebooks/ run:
    export_model.ipynb

Performance

Mobile Mask R-CNN trained on 512x512 input size

  • 100 Proposals: 0.22 mAP (VOC) @ 250ms
  • 1000 Proposals: 0.25 mAP (VOC) @ 330ms

Requirements

  • numpy
  • scipy
  • Pillow
  • cython
  • matplotlib
  • scikit-image
  • tensorflow>=1.3.0
  • keras>=2.1.5
  • opencv-python
  • h5py
  • imgaug
  • IPython[all]
  • pycocotools

mobile_mask_rcnn's People

Contributors

waleedka avatar gustavz avatar philferriere avatar borda avatar cpruce avatar ps48 avatar jningwei avatar np-csu avatar llltttppp avatar jmtatsch avatar concerttttt avatar stevenhickson avatar scitator avatar imgyuri avatar elejke avatar dingkunliu avatar gakarak avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.