Giter Site home page Giter Site logo

Training code release? about dex-ycb-toolkit HOT 6 CLOSED

nvlabs avatar nvlabs commented on August 15, 2024
Training code release?

from dex-ycb-toolkit.

Comments (6)

ychao-nvidia avatar ychao-nvidia commented on August 15, 2024 1

For 6D object pose, we have released the code for PoseCNN, DeepIM, and PoseRBPF:
https://github.com/NVlabs/PoseCNN-PyTorch#training-and-testing-on-the-dexycb-dataset
https://github.com/NVlabs/DeepIM-PyTorch#training-and-testing-on-the-dexycb-dataset
https://github.com/NVlabs/PoseRBPF#testing-on-the-dexycb-dataset

As mentioned in the paper Sec. 5.3, we did not retrain PoseRBPF.

from dex-ycb-toolkit.

ychao-nvidia avatar ychao-nvidia commented on August 15, 2024 1

"I understood that the 3D models and poses annotation of the existing YCB and YCB (bop) is slightly different, am I correct?"
-> Correct.

Q1. Can you please clarify with regards to your dataset(ycbdex) pose annotation and 3d models related to bop dataset?? (ex, dexycb annotation follows the original YCB or YCB(bop))
-> We provide both.

  • The original 6D pose annotation we provide in pose.npz (e.g., loaded in the dex-ycb-toolkit API, such as in this example) uses the YCB-Video models. If you download the DexYCB dataset, you can also find a copy of these models under models/.
  • In addition, we also provide the 6D pose annotation using the YCB-V (BOP) models. This is stored under bop/. In fact, bop/ is a copy of the full DexYCB dataset under the BOP format, which you can directly use with any methods that consume this format, e.g. CosyPose. If you look at bop/models/ and bop/models_eval/, you'll find that these models are directly copied from YCB-V (BOP).

Q2. I want to check performance on dexycb dataset using pretrained CosyPose or PoseCNN trained from YCB or YCB(bop) without training dexycb dataset. Have you already tried this?? If yes, Can you share the results?? It would be really helpful.
-> We don't have pre-trained results. It should be possible to get that for PoseCNN using their released repo. You need to regenerate the results with the pre-trained model and then run eval (see here).

Q3. Is there any reason a large clamp was removed compared to the YCB dataset??
-> We did not include 051_large_clamp since it it sufficiently similar to 052_extra_large_clamp.

from dex-ycb-toolkit.

ychao-nvidia avatar ychao-nvidia commented on August 15, 2024

No, there is no plan for that.

from dex-ycb-toolkit.

taeyeopl avatar taeyeopl commented on August 15, 2024

I am curious if there is any plan to provide training codes related to other estimations (2D detection, 6d object pose estimation).

from dex-ycb-toolkit.

taeyeopl avatar taeyeopl commented on August 15, 2024

I have some simple question about the dataset and toolkit!
I observed your toolkit used the bop toolkit and I observed something in bop datasets.
They mentioned that the ground-truth poses are transformed by converted 3D models.
I understood that the 3D models and poses annotation of the existing YCB and YCB (bop) is slightly different, am I correct?
Screenshot 2021-09-09 at 16 43 00

Q1. Can you please clarify with regards to your dataset(ycbdex) pose annotation and 3d models related to bop dataset?? (ex, dexycb annotation follows the original YCB or YCB(bop))

Q2. I want to check performance on dexycb dataset using pretrained CosyPose or PoseCNN trained from YCB or YCB(bop) without training dexycb dataset. Have you already tried this?? If yes, Can you share the results?? It would be really helpful.

Q3. Is there any reason a large clamp was removed compared to the YCB dataset??

from dex-ycb-toolkit.

taeyeopl avatar taeyeopl commented on August 15, 2024

Thanks for the detailed reply!

from dex-ycb-toolkit.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.