Giter Site home page Giter Site logo

Comments (8)

thuwzy avatar thuwzy commented on July 28, 2024 1

Thank you for your interest in our CRM paper!

  1. The decoder cannot directly work at resolution 512x3,072. However, I will upload the model trained on this resolution 512x3,072 if you can get the upsampling image work.
  2. The CCM is for better geometry and cannot be skipped. I have conducted ablation study on Figure 10 in my paper. CRM without CCM input has worse geometry.
  3. Actually depth map can be equivalently transformed to CCM in my framework. So I think it is highly likely to work.
    By the way, I think the resolution of CCM is not very important. I think a good pipeline may be generate 256*1536 image and CCM, and then use neural network to upsample the image and simply resize the CCM to be in the resolution of 512x3,072.

from crm.

mr-lab avatar mr-lab commented on July 28, 2024 1

original model render:
image
couple of Re-renders
image
image
your work is a blessing to us, those are Re-renders of the RGB to retexture the mesh . more consistency is needed .
will move to depth map after that , good depth comes from good RGB.
cheers.

from crm.

mr-lab avatar mr-lab commented on July 28, 2024

thank you very much will be waiting for that model
I will explore more Point 3 .

from crm.

mosvlad avatar mosvlad commented on July 28, 2024

original model render: image couple of Re-renders image image your work is a blessing to us, those are Re-renders of the RGB to retexture the mesh . more consistency is needed . will move to depth map after that , good depth comes from good RGB. cheers.

Awesome!!! Can you share your result with code?

from crm.

zz7379 avatar zz7379 commented on July 28, 2024

original model render: image couple of Re-renders image image your work is a blessing to us, those are Re-renders of the RGB to retexture the mesh . more consistency is needed . will move to depth map after that , good depth comes from good RGB. cheers.

is this a up-scaled rgb? or rendered mesh?

from crm.

mosvlad avatar mosvlad commented on July 28, 2024

I'm trying to upscale RGB from stage1

This:

CRM/run.py

Line 152 in 3e677cb

stage1_images = rt_dict["stage1_images"]

and this:
stage1_images = self.stage1_sample(pixel_img, prompt, scale=scale, step=step)

For upscale i'm used BSRGAN.
https://github.com/cszn/BSRGAN

  1. Generate images (256x1536) by stage 1
  2. Upscale it by BSRGAN (x2 or 4x)
  3. Resize images to (256x1536)
  4. Use upscaled and resized images for generate3d

изображение

This steps not make quality improvement like @mr-lab comments.

Another way i tried make upscale for every image generated in step1:

  1. Generate image (256x256)
  2. Upscale it by BSRGAN (x2 or x4)
  3. Resize to original size (256x256)
  4. Use upscaled and resized images for stage2

изображение
The results are not very good either

Maybe @mr-lab share some more information about his research....

from crm.

mr-lab avatar mr-lab commented on July 28, 2024

@mosvlad
we need a decoder that can process higher resolutions 512*3,072
@thuwzy is probably working on that .
Now we are working on an alternative ,Transfer CRM results to a 3d blob representing the shape of the subject ,
then remodel that blob into a model by moving vertex pos until they match target...still long way to see any good results .
CRM is the only True 3d generator , times and times again proven to provide consistent multi-view shots , no other project can do . will continue to prepare for a larger decoder .

from crm.

snowflakewang avatar snowflakewang commented on July 28, 2024

@thuwzy Hello, I am interested in upscaling the resolution of RGBs to get high-resolution textured meshes. You mentioned that you are working on 512-level decoders. I am curious about the maximum resolution that GPUs (maybe A100/A800) can handle. Is 1024 an acceptable resolution?

from crm.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.