Giter Site home page Giter Site logo

Comments (6)

bearpaw avatar bearpaw commented on May 18, 2024

Hi @HuAndrew , same question here.

I am playing with this code recently and was also wondering about how did you generate the detection part.

To be more specific, I am detecting humans from the COCO val 2017 keypoints images (5000 images) from the person_keypoints_val2017.json. I try to use Yolo v3 detector and keep only the bounding boxes regarding humans. Then I dump the JSON file which is similar to this repo's.

However, the size of the generated JSON is quite small compared with theirs (~1.3MB vs 16.4MB). Also, when I run cocoEval and use person_keypoints_val2017.json as groundtruth, I can only get about 40 AP.

Any suggestions? Thank you in advance :)

from human-pose-estimation.pytorch.

yurymalkov avatar yurymalkov commented on May 18, 2024

I have the same question. Can you please share your detector or give a link to a similar one?

from human-pose-estimation.pytorch.

namheegordonkim avatar namheegordonkim commented on May 18, 2024

👍 . Related papers keep mentioning of the "person detector used in Simple Baseline..." but it's nowhere to be found

from human-pose-estimation.pytorch.

Odaimoko avatar Odaimoko commented on May 18, 2024

Hi @HuAndrew , same question here.

I am playing with this code recently and was also wondering about how did you generate the detection part.

To be more specific, I am detecting humans from the COCO val 2017 keypoints images (5000 images) from the person_keypoints_val2017.json. I try to use Yolo v3 detector and keep only the bounding boxes regarding humans. Then I dump the JSON file which is similar to this repo's.

However, the size of the generated JSON is quite small compared with theirs (~1.3MB vs 16.4MB). Also, when I run cocoEval and use person_keypoints_val2017.json as groundtruth, I can only get about 40 AP.

Any suggestions? Thank you in advance :)

Well the author said 56.4 AP on person category. I have used Detectron's model . In End-to-End Faster & Mask R-CNN Baselines, the entry X-101-64x4d-FPN with 42.4 box AP can get 55.7 AP on person cat. I think this is competitive.

from human-pose-estimation.pytorch.

HuAndrew avatar HuAndrew commented on May 18, 2024

@bearpaw @Odaimoko Hello, I test multi detector, like mask, cascade_RCNN , and the detector vis and other preds' results are as follows:

vis samples

image

preds samples

256x192_pose_resnet_50_d256d256d256 total person detect AP keypoint
ground truth 11004 XXXXX 72.4
faster author 104125 56.4 70.5
mask rcnn_0.7 13167 48.6 68.1
mask rcnn_0.5 15530 49.5 68.6
mask rcnn_0.3 15796 49.6 68.7
Cascade_RCNN 73597 53.0 70.0

Then
From the test results, something can be found:

  • In order to achieve the purpose of rescore tricks, the author let detector gives multiple detection boxes for every person instance(rescore operation refer to COCO17-Keypoints-TeamOKS). And rescore could amend pred results.
  • Then if we want to get multi bboxs, we can adjust NMS postprocess.
  • As long as the detector position is very correct like gt bbox, the prediction results are also very good.
  • Top-down methods, the detector is very import to improve preds results.
  • But I use multi bbox, preds results are worse. So I guess the author used the byte bboxs and the NMS operation together amend the detectors performance.
  • Other detectors: maskrcnn-benchmark, yolov3.

Welcome to Join pose forum www.ilovepose.com

from human-pose-estimation.pytorch.

wmcnally avatar wmcnally commented on May 18, 2024

Evaluated using the Detectron2 repo:

  • Faster R-CNN with ResNeXt-101 FPN backbone gets 56.6 AP for the person category on COCO val2017.
  • Faster R-CNN with ResNet-101 FPN backbone gets 55.7 AP for the person category on COCO val2017.

https://github.com/facebookresearch/detectron2/blob/main/MODEL_ZOO.md

from human-pose-estimation.pytorch.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.