Comments (1)
Hi @bertinma,
Thank you for your interest in our work!
Regarding experiments at a resolution of 1024x768, I must admit that we did not test the training process procedure at that specific resolution, so I may not be able to provide you with precise guidance in that regard.
At a resolution of 512x384, although the loss behavior appeared to be similar, we did notice improvements in the metrics as the training progressed. Did you notice the same improvements in the metrics during training??
Regarding the batch size, did you try to set the --gradient_accumulation_step parameter such that batch_size * gradient_accumulation_parameters is equal to the desired batch size?
Another point: if you want to achieve optimal performance you need to use the flag --train_inversion_adapter during VTO training.
If you have any more questions or need further insights, please feel free to ask.
Best regards,
Alberto
from ladi-vton.
Related Issues (20)
- What is your GPU type and How long you training
- try on tattoos
- How to get image "image-parse-v3" HOT 1
- Out of memory when training the emasc module
- how to using the Stable Diffusion VAE without using the denoising network HOT 1
- Could I ask you for some advice?
- How can i run this project with m1 mac
- Ask for training gpu spec HOT 1
- What data will affect inference results of VITON-HD? HOT 1
- How to make inference on only one photo not just the whole dataset? HOT 4
- Question about ablation study
- How about to maintain details when warping?
- VAE with intermediate features takes up more GPU memory than original VAE HOT 3
- Bad Result on custom image from DressCode Dataset HOT 5
- How to use this on custom dataset?
- hub
- 'CLIPTextTransformer' object has no attribute '_build_causal_attention_mask' HOT 1
- Batch images not being processed by AutoEncoder
- DressCode Garment Reconstruction
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ladi-vton.