Comments (7)
Can you please provide a more detailed explanation of your question? 😺
from diffbir.
No description provided.
Thanks for the reply!
There is torchtext in my local environment. When using DiffBIR and adapting to the inference environment, I found that pytorch_lightning has a dependency on torchtext, /opt/conda/lib/python3.8/site-packages/pytorch_lightning/utilities/apply_func.py. You can also run the code without torchtext installed. I uninstalled torchtext, and finally used the provided sample code to successfully run the results, but the quality of the recovered images was very poor, and I don't know why.
The degradation of the input is not very complex, but the results are strange.
from diffbir.
This issue is unrelated to torchtext. Your result is reasonable and falls under DiffBIR's bad cases. Currently, the stage1 model produces over-smoothed results, causing details like text and small faces to be erased in the first stage, making it impossible for the second stage to perform generation. Additionally, the image you've shown appears to be from a low-resolution video with video compression artifacts. In our experiments, we have also observed that DiffBIR tends to generate some noisy textures when dealing with such compression noise. These are known issues with DiffBIR at the moment, and we are actively working on improvements to enhance its performance. Thank you for bringing this issue to our attention!
from diffbir.
You can improve DiffBIR's performance by tuning inference parameters, such as:
- set
sr_scale
to a value larger than 1 to alleviate the "over-smoothed" problem. - set
prompt guidance scale
to a value larger than 1 to enable the negative prompt, which can reduce bad cases and improve the image quality.
Due to the stochastic nature of the diffusion model, you can also change the random seed to select a good result for yourself (we will never perform this action in our experiments!).
We hope these suggestions are helpful for you. Here are two examples with video compression artifacts:
from diffbir.
You can improve DiffBIR's performance by tuning inference parameters, such as:
- set
sr_scale
to a value larger than 1 to alleviate the "over-smoothed" problem.- set
prompt guidance scale
to a value larger than 1 to enable the negative prompt, which can reduce bad cases and improve the image quality.Due to the stochastic nature of the diffusion model, you can also change the random seed to select a good result for yourself (we will never perform this action in our experiments!).
We hope these suggestions are helpful for you. Here are two examples with video compression artifacts:
Thanks for the reply, I will try to adjust the parameters
from diffbir.
When I installed xformers==0.0.16, it prompted that torch requires 1.13. Have you encountered such a problem?
from diffbir.
Yes, I've encountered this problem as well. xformers has strict requirements for both PyTorch and CUDA versions. I recommend you to create a separate environment as shown in the README.md and then install the PyTorch version that matches with xformers.
from diffbir.
Related Issues (20)
- 使用更大规模的数据训练,会有更好的效果吗? HOT 16
- about how to show low quality image HOT 1
- set w/o restoration module HOT 1
- Can this network do the work of image deblurring? HOT 1
- 为什么要用在低清图上finetune的encoder? HOT 2
- AMD GPU support?
- 请问如何使用 diffbir v2 中的训练结果进行测试呢 HOT 1
- 使用tiled对大图像执行SR时失败了 HOT 1
- 第二张图芯片的那个是怎么跑出来的?我跑不下来这个效果 HOT 3
- 有关测试需要的显存问题 HOT 1
- How to train DiffBIR for face retouching? HOT 5
- 爆显存的问题
- optimizer selection HOT 6
- 有关restoration module的问题 HOT 1
- !!! Exception during processing!!! [Errno 2] No such file or directory
- About the Training Time HOT 1
- only train and infer stage 2 model HOT 1
- about feature modulation. HOT 2
- onnx
- Training problem
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from diffbir.