Comments (4)
It is possible that selected unreliable images and their pseudo masks are too noisy to learn. To check this, could you calculate the Dice of pseudo masks of reliable and unreliable images respectively?
from st-plusplus.
Thanks for your advise.
I calculated the dice of pseudo mask and gt, for reliable and unreliable data respectively. I found that some samples with high DICE could be chosen as unreliable and some samples with low DICE could be chosen as reliable. The mean DICE of reliable and unreliable are 0.76 and 0.58 respectively. Your thought is right maybe. And could your give me some advise how to improve this situation? What should I do to improve the result of the second retraining?
Thanks again!
Looking forward to your reply.
from st-plusplus.
Hi, it may be unavoidable that the automatic partition of reliable and unreliable samples is not perfect.
To pursue higher performance, my tips, which may be not elegant, are that
- You do not need to strictly follow the 50%-50% partition, you can select top 75% samples as reliable ones. The remaining 25% unreliable ones are directly abandoned. (The average dice 0.58 may be caused by a small portion of extremely bad samples.)
- You can slightly change the training procedures. For example, you can first train on all labeled and pseudo labeled samples, and then finetune the model with labeled and reliable unlabeled samples.
- You can additionally use a pixel-level threshold, e.g., 0.95, to filter out unconfident pixels in unreliable images. In this case, you use all pixels in reliable images and only confident pixels in unreliable images for training.
I currently only think of these modifications. I will update this if I come up with better solutions.
from st-plusplus.
I will close this issue. Please feel free to re-open it if you have further questions.
from st-plusplus.
Related Issues (20)
- pseudo- labels生成之后,更新labelled data的问题 HOT 2
- Why "Re-initialize S" in Algorithm2? HOT 2
- 作者你好,我想问一下unlable.txt的文件怎么来的。unlable的标签不应该是没有的吗? HOT 4
- Experiments Question HOT 1
- 有关于灰度图像 HOT 1
- torch.version? HOT 1
- Using the best model to do pseudo labeling HOT 2
- 最终的IOU HOT 3
- 监督训练过程 HOT 3
- 训练过程中的报错 HOT 1
- 数据集的文件放置 HOT 2
- @yyamx 你好,你使用的是Pasca VOC数据集吗?
- 测试验证集
- RuntimeError: cuDNN error: CUDNN_STATUS_BAD_PARAM HOT 2
- 关于 checkpoints 程式码和论文的叙述,似乎不同
- How to test our own custom data set? HOT 3
- 测试集评估代码 HOT 4
- 预训练模型选择
- mode test
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from st-plusplus.