Giter Site home page Giter Site logo

About the jointattn layer about wonder3d HOT 2 CLOSED

xxlong0 avatar xxlong0 commented on September 17, 2024
About the jointattn layer

from wonder3d.

Comments (2)

flamehaze1115 avatar flamehaze1115 commented on September 17, 2024

Hi Xiaoxiao,

thanks for your great work!

I have a question about the jointattn layer for normal and rgb images. I found that you didn't use any jointattn layer in the released model, which is different from the paper. Just curious why this still works without the jointattn layer and will the performance becomes better without it?

Hello, thanks for your interetes in our work. Due to some constraints of company, we cannot provide the complete model weights and training codes at this time.
Although the joint cross-domain attention is not activated, the incomplete model still works but is less better than the complete model with cross-domain attention in some cases. Due to our domain switcher design and joint training, the two domains can be roughly aligned in the learned latent space, which should be the potentials of diffusion models. With a cross-domain attention, the consistency of the two domains can be further improved.

from wonder3d.

yf1019 avatar yf1019 commented on September 17, 2024

Hi

Hi Xiaoxiao,
thanks for your great work!
I have a question about the jointattn layer for normal and rgb images. I found that you didn't use any jointattn layer in the released model, which is different from the paper. Just curious why this still works without the jointattn layer and will the performance becomes better without it?

Hello, thanks for your interetes in our work. Due to some constraints of company, we cannot provide the complete model weights and training codes at this time. Although the joint cross-domain attention is not activated, the incomplete model still works but is less better than the complete model with cross-domain attention in some cases. Due to our domain switcher design and joint training, the two domains can be roughly aligned in the learned latent space, which should be the potentials of diffusion models. With a cross-domain attention, the consistency of the two domains can be further improved.

Hi Xiaoxiao,
thanks for your great work!
I have a question about the jointattn layer for normal and rgb images. I found that you didn't use any jointattn layer in the released model, which is different from the paper. Just curious why this still works without the jointattn layer and will the performance becomes better without it?

Hello, thanks for your interetes in our work. Due to some constraints of company, we cannot provide the complete model weights and training codes at this time. Although the joint cross-domain attention is not activated, the incomplete model still works but is less better than the complete model with cross-domain attention in some cases. Due to our domain switcher design and joint training, the two domains can be roughly aligned in the learned latent space, which should be the potentials of diffusion models. With a cross-domain attention, the consistency of the two domains can be further improved.

Got it. Thanks so much for your prompt response~

from wonder3d.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.