Giter Site home page Giter Site logo

crowddetection's Introduction

Detection in Crowded Scenes: One Proposal, Multiple Predictions

A pytorch implementation is available on https://github.com/Purkialo/CrowdDet now.

This is the MegEngine implementation of our paper "Detection in Crowded Scenes: One Proposal, Multiple Predictions", https://arxiv.org/abs/2003.09163, published in CVPR 2020.

Our method aiming at detecting highly-overlapped instances in crowded scenes.

The key of our approach is to let each proposal predict a set of instances that might be highly overlapped rather than a single one in previous proposal-based frameworks. With this scheme, the predictions of nearby proposals are expected to infer the same set of instances, rather than distinguishing individuals, which is much easy to be learned. Equipped with new techniques such as EMD Loss and Set NMS, our detector can effectively handle the difficulty of detecting highly overlapped objects.

The network structure and results are shown here:

Citation

If you use the code in your research, please cite:

@article{chu2020detection,
	title={Detection in Crowded Scenes: One Proposal, Multiple Predictions},
	author={Chu, Xuangeng and Zheng, Anlin and Zhang, Xiangyu and Sun, Jian},
	journal={arXiv preprint arXiv:2003.09163},
	year={2020}
}

Run

  1. Requirements:

    • python3.6.9, MegEngine 0.3.1, cuda10.0
  2. CrowdHuman data:

    • CrowdHuman is a benchmark dataset to better evaluate detectors in crowd scenarios. The dataset can be downloaded from http://www.crowdhuman.org/. The path of the dataset is set in config.py.
  3. Compile gpu_nms library:

    cd lib/layers
    sh ./setup.sh
    
    • Please note that in some environments you may need to modify setup.sh!
  4. Steps to run:

    • Step1: training. More training and testing settings can be set in config.py.
    python3 train.py
    
    • Step2: testing. If you have multiple GPUs, you can use -d 2 to use more GPUs.
    python3 test.py -r 30
    
    • Step3: evaluating.
    python3 .evaluate/compute_APMR.py --detfile ./model/crowd_emd_simple/outputs/eval_dump/dump-30.json --target_key 'box'
    python3 .evaluate/compute_JI.py --detfile ./model/crowd_emd_simple/outputs/eval_dump/dump-30.json --target_key 'box'
    

Models

We use pre-trained model from MegEngine ModelHub: https://data.megengine.org.cn/models/weights/resnet50_fbaug_76254_4e14b7d1.pkl. (or resnet50_fbaug_76254_4e14b7d1.pkl)

All models are based on ResNet-50 FPN.

AP MR JI Model
FPN Baseline 0.8662 0.4227 0.7957 meg_fpn_baseline.pkl
EMD Simple 0.8996 0.4171 0.8223 meg_emd_simple.pkl
EMD with RM 0.9034 0.4110 0.8273 meg_emd_refine.pkl

Contact

If you have any questions, please do not hesitate to contact Xuangeng Chu ([email protected]).

crowddetection's People

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.