Comments (3)
Thanks for your efforts in exploring it. There are mainly two factors contributing to the segmentation of the tiny 'gaps' or fine 'structures': 1) relatively "high" resolution of the feature map, 2) global and local feature extraction capabilities of network modules. The first factor, relatively "high" resolution, is easy to understand. As we know, the higher the resolution is, the more details we can perceive. Although the resolution of 320x320 is not that high, the tiny structure is recognizable if you view the tiny structures in zoomed-in view. But when we focus on the tiny 'gaps' of fine 'structures', we usually ignore the important fact: we are actually inferencing the tiny gaps from a large scale/global contexture. For example, we can recognize the hairs because we know that they are hairs of a girl, if the face and body or the girl are covered by other stuffs, it will be difficult for us to recognize the hairs (it might not be a good example). So only high resolution are not enough and large or global information are important in segmenting these tiny 'gaps'. Most of other networks only have one receptive in each stage. Although they can also provide high resolution feature maps just before the prediction. But the global contexture info are missing because of small convolution filters in relatively high resolution feature maps. The RSU blocks in each stage of the encoder and decoder is able to achieve both global and local contexture information so that they enable the segmenting of tiny 'gaps'. Another advantage of the RSU blocks against PSP or inception-like blocks is that it downsamples the feature maps to achieve larger scale info and upsamples to recover the resolution. Both downsample and upsample operations are gradually conducted, which avoids degradation of features by drastic downsampling and upsampling.
from u-2-net.
High level information is needed to for low level processing to extract more fine features. You're right, this does happen in a regular unet but the operations are very far apart. This reminds me of cross-scale connections used in EfficientDet(biFPN) and Path Aggregation networks where the idea is very similar but implemented differently. The same global features are propagated whereas here, each block has its own local/global features. How do you think this would differ from that? How do you contrast this with HRNet which has concurrent multi-resolution pathways so all levels can talk to each other?
from u-2-net.
@Nathanua Would appreciate your thoughts
from u-2-net.
Related Issues (20)
- 能否麻烦将整个项目打个包(包含运行文件和预训练模型),集成一个 .bat 文件,点击运行即可使用? HOT 2
- 请问该模型只能输出二分类结果么?可以输出为多分类么?比如三分类 HOT 1
- Is it support person segmentation now? HOT 2
- 如何添加评价指标
- ImportError: cannot import name 'U2NET' from 'model' HOT 2
- Can' Access Human Segmentation Model Weights HOT 1
- Usage on cross platform mobile devices using tenser flow or pytourch ( Special on Flutter )
- 请问大家,我需要前景而非mask,该怎样修改代码呢,感谢
- /alueror: At least one stride in the given numpy aray is mepative, ad tensors with nepative strides are not curently suported. (You can probably work around this by making a copy of your array with array.copy( ).)
- u2net_train.py returning ValueError HOT 1
- the effect of replacing background with U-2-Net is not as expected, anyone can offer a help
- can tell me , label mask must be 0 or 1,if 0 or 1,why binary is smooth, if dataset mask edge in (0,1),it can work? HOT 1
- The results are not good when the object has small holes. HOT 1
- 有什么办法只保留抠图部分尺寸,不需要原图尺寸大小,谢谢 HOT 1
- num_workers=1
- u2net for semantic segmentation
- 自己数据训练的u2net,归一化尺寸320,但是抠图效果边缘锯齿明显
- Paddlehub is no longer under maintenance, what should I do if an error message is installed
- How can i use my own GPU to train
- 如何实现多种分类
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from u-2-net.