Giter Site home page Giter Site logo

phyllish / pmfnet Goto Github PK

View Code? Open in Web Editor NEW

This project forked from bobwan1995/pmfnet

0.0 2.0 0.0 29.36 MB

Implementation of "Pose-aware Multi-level Feature Network for Human Object Interaction Detection"(ICCV 2019 Oral)

License: MIT License

Python 79.13% MATLAB 0.17% Shell 0.38% Cuda 5.44% C 4.59% C++ 0.24% Jupyter Notebook 10.06%

pmfnet's Introduction

Pose-aware Multi-level Feature Network for Human Object Interaction Detection

Official implementation of "Pose-aware Multi-level Feature Network for Human Object Interaction Detection"(ICCV 2019 Oral).

This code follows the implementation architecture of roytseng-tw/mask-rcnn.pytorch.

Getting Started

Requirements

Tested under python3.

  • python packages
    • pytorch==0.4.1
    • torchvision==0.2.2
    • pyyaml==3.12
    • cython
    • matplotlib
    • numpy
    • scipy
    • opencv
    • packaging
    • ipdb
    • pycocotools — for COCO dataset, also available from pip.
    • tensorboardX — for logging the losses in Tensorboard
  • An NVIDAI GPU and CUDA 8.0 or higher. Some operations only have gpu implementation.

Assume the project is located at $ROOT.

Compilation

Compile the NMS code:

cd $ROOT/lib 
sh make.sh

Data and Pretrained Model Preparation

Create a data folder under the repo,

cd $ROOT
mkdir data
  • COCO: Download the coco images and annotations from coco website.

    Our data: Download the our dataset annotations and detection/keypoint proposals from our data.

    Pose estimatiotn We use the repo pytorch-cpn to train our pose estimator. We have released our keypoint predictions of vcoco dataset on our data.

    And make sure to put the files as the following structure:

    data
    ├───coco
    │   ├─images
    │   │  ├─train2014
    │   │  ├─val2014 
    │   │
    │   ├─vcoco
    │      ├─annotations
    │      ├─annotations_with_keypoints
    │      ├─vcoco
    │
    ├───cache
    │   ├─addPredPose
    │
    ├───pretrained_model
        ├─e2e_faster_rcnn_R-50-FPN_1x_step119999.pth
        ├─vcoco_best_model_on_test.pth
    
    

Training

cd $ROOT
sh script/train_vcoco.sh

Test

cd $ROOT
sh script/test_vcoco.sh

Our pretrained model vcoco_best_model_on_test.pth has 52.05 AP on vcoco test set.

pmfnet's People

Contributors

bobwan1995 avatar

Watchers

James Cloos avatar paper2code - bot avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.