Giter Site home page Giter Site logo

hand-seg-tpv's Introduction

Hand Segmentation in Third-Person Point of View

  • The code is modified from CSAILVision GitHub Repository.
  • EgoHands dataset is utilised to train this neural network.
  • ROS node for this module can be found here, and it is implemented in this paper.
  • GTX 1080 is used to train the network whereas the inference is running on GTX 1060.

Network Architecture

Pyramid Scene Parsing Network (PSPNet) is implemented to train the model. This paper can be found in this link.

Pretrained Model

Download the pretrained model from Google Drive and place the model in ckpt/egohands-resnet50dilated-ppm_deepsup

Quick Start: Running Inference

  1. Simple Demo
python3 run_inference.py --cfg config/egohands-resnet50dilated-ppm_deepsup.yaml --visualise
  1. To change resolution
python3 run_inference.py --cfg config/egohands-resnet50dilated-ppm_deepsup.yaml \
                         --resolution 720p --visualise

Offline Testing

  1. Generate an annotation file
python3 generate_odgt.py --folder [FOLDER] --out testing.odgt
  1. To test on a folder of images
python3 test.py --cfg config/egohands-resnet50dilated-ppm_deepsup.yaml

lib.segmentation package

lib.segmentation.module_init(cfg)
Parameters:
cfg (class 'yacs.config.CfgNode') – configuration file
Returns:
segmentation_module (class 'models.models.SegmentationModule') – segmentation module

lib.segmentation.hand_segmentation(frame, segmentation_module, save)
Parameters:
frame (numpy.ndarray) – Input frame
segmentation_module (class 'models.models.SegmentationModule') – segmentation module
save (boolean) – a flag to save the numpy array
Returns:
pred (numpy.ndarray) – prediction

hand-seg-tpv's People

Contributors

junwenkwan avatar tpatten avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.