Giter Site home page Giter Site logo

dvmnet's Introduction

DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses

PyTorch implementation of "DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses" (CVPR 2024)

[project page]         [paper]

Setup Dependencies

conda create -n dvmnet python=3.8 cmake=3.14.0
conda activate dvmnet
bash ./install.sh

Download the pretrained croco model:

wget https://download.europe.naverlabs.com/ComputerVision/CroCo/CroCo_V2_ViTBase_BaseDecoder.pth -P ./croco/

Data Preparation

Please refer to the instructions provided in 3DAHV for downloading and preprocessing Co3D, Objaverse, and LINEMOD.

Test pretrained model

We provide a model pretrained on the training set of CO3D. Please download it here. We store this pretrained model at ./models/checkpoint_co3d.ckpt by default. Run the following evaluation to get the results:

python ./test_co3d_dvmnet.py

Notably, the reproduced results might be slightly different from those reported in the paper. This is because the image pairs during testing are randomly sampled in the RelPose++ implementation.

Trainning

Co3D

python ./train_dvmnet_co3d.py

Objaverse

python ./train_dvmnet_objaverse.py

LINEMOD

python ./train_dvmnet_linemod.py

We also implement a 6D pose estimation model DVMNet_6D. The translation estimation module is borrowed from RelPose++.

Citation

If you find the project useful, please consider citing:

@article{zhao2024dvmnet,
  title={DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses},
  author={Zhao, Chen and Zhang, Tong and Dang, Zheng and Salzmann, Mathieu},
  journal={arXiv preprint arXiv:2403.13683},
  year={2024}
}

dvmnet's People

Contributors

sailor-z avatar

Stargazers

Jian Liu avatar Yihan Chen avatar Steve avatar Sebastian Jung avatar Fazeng Li avatar Yuanhong Yu avatar  avatar

Watchers

Tong Zhang avatar Kostas Georgiou avatar  avatar hiyyg avatar

Forkers

hiyyg ztt1024

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.