Giter Site home page Giter Site logo

chkswiftly / clustered-object-detection-in-aerial-image Goto Github PK

View Code? Open in Web Editor NEW

This project forked from haobabuhaoba/clustered-object-detection-in-aerial-image

0.0 0.0 0.0 9.79 MB

小目标检测的新思路

License: Apache License 2.0

CMake 3.45% Makefile 0.06% Python 92.32% MATLAB 3.55% C++ 0.33% Cuda 0.20% Dockerfile 0.09%

clustered-object-detection-in-aerial-image's Introduction

Clustered-Object-Detection-in-Aerial-Image

The repo is about our recent work on object detection in aerial image, the paper of the work "Clustered Object Detection in Aerial Image" (ICCV2019) and its supplementatry are available here. If you think this project is useful for you, feel free to leave a star. (^^)

Installing codebase

  1. The work is implemented based on Caffe2 , please install it according to the corresponding instruction.
  2. Clone the repo
git clone https://github.com/fyangneil/Clustered-Object-Detection-in-Aerial-Image.git.

Please follow the instruction in Detectron to install the repo.

Generating cluster region ground truth

Here, we use VisDrone dataset as an example to demonstrate the process to generate cluster region ground truth.

  1. run "./detectron/ops/add_cluster_annotation.m" to generate cluster ground truth and add it to original object annotation files.
  2. run "./detectron/ops/visdrone2cocoformat.m" to convert VisDrone format annotation to COCO format.

Train CPNet and global detector

cd $ROOT_DIR/ClusDet
python ./tools/train_net.py \
    --cfg ./configs/e2e_faster_rcnn_R-50-FPN_CPNet_1x_1GPU.yaml \
    OUTPUT_DIR ./trainedmodel/faster_rcnn_R-50-FPN_CPNet_1x_1GPU

Inference CPNet to produce cluster regions on global image

python tools/test_net.py \
    --cfg ./configs/e2e_faster_rcnn_R-50-FPN_CPNet_1x_1GPU.yaml \
    TEST.WEIGHTS ./trainedmodel/faster_rcnn_R-50-FPN_CPNet_1x_1GPU/train/coco_2014_train/generalized_rcnn/model_final.pkl \
    NUM_GPUS 1

crop cluster regions by running

python detectron/ops/crop_cluster_proposals.py

Please Change the corresponding path when used on your computer.

Train detector on global images and cropped cluster chips

python ./tools/train_net.py \
  --cfg ./configs/e2e_faster_rcnn_R-50-FPN_1x_1GPU.yaml \
  OUTPUT_DIR ./trainedmodel/faster_rcnn_R-50-FPN_1x_1GPU

Inference detector on global images and cropped cluster chips

python tools/test_net.py \
    --cfg ./configs/e2e_faster_rcnn_R-50-FPN_1x_1GPU.yaml \
    TEST.WEIGHTS ./trainedmodel/faster_rcnn_R-50-FPN_1x_1GPU/train/coco_2014_train/generalized_rcnn/model_final.pkl \
    NUM_GPUS 1

Fuse the detections from global images and cluster chips

run "./detectron/ops/fuse_global_cluster_detections.m"

Note

If you want to crop the image evenly, run "./detectron/ops/evenly_image_partition.m"

If you have any questions regarding the code, please feel free to contact me [email protected]

Cite

@InProceedings{Yang_2019_ICCV, author = {Yang, Fan and Fan, Heng and Chu, Peng and Blasch, Erik and Ling, Haibin}, title = {Clustered Object Detection in Aerial Images}, booktitle = {The IEEE International Conference on Computer Vision (ICCV)}, month = {October}, year = {2019} }

clustered-object-detection-in-aerial-image's People

Contributors

fyangneil avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.