Why did you choose Layer Attention instead of normal Channel Attention? Task-inter

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

<a class="user-mention notranslate" data-hovercard-type="user" data-hover

Layer Attention instead of Channel Attention? about tood HOT 5 CLOSED

fcjian commented on June 14, 2024

Layer Attention instead of Channel Attention?

from tood.

Comments (5)

fcjian commented on June 14, 2024

The tasks of object classification and localization have different targets, and thus focus on different types of features (e.g. different levels or receptive fields). The N interactive layers in T-head have different effective receptive fields, which allow them to capture multiple levels of semantics. The layer attention is designed to make full use of this rich information by computing more meaningful features from those layers.

from tood.

iumyx2612 commented on June 14, 2024

The tasks of object classification and localization have different targets, and thus focus on different types of features (e.g. different levels or receptive fields). The N interactive layers in T-head have different effective receptive fields, which allow them to capture multiple levels of semantics. The layer attention is designed to make full use of this rich information by computing more meaningful features from those layers.

I understand the intuition of Layer Attention. However, Layer Attention is just a special type of Channel Attention if we conduct Channel Attention on the concatenated feature maps from N interactive layers. And Channel Attention can also capture multi-level semantic features no? 🤔

from tood.

iumyx2612 commented on June 14, 2024

@fcjian hello can we discuss about this when you have free time?

from tood.

fcjian commented on June 14, 2024

@iumyx2612 The typical channel-wise attention is applied on a single layer. We hold the view that conducting channel-wise attention on multi-layers can be seen as the combination of the typical channel-wise attention and layer attention. So It can also capture multi-level semantic features but requires more parameters and FLOPs.

from tood.

iumyx2612 commented on June 14, 2024

@iumyx2612 The typical channel-wise attention is applied on a single layer. We hold the view that conducting channel-wise attention on multi-layers can be seen as the combination of the typical channel-wise attention and layer attention. So It can also capture multi-level semantic features but requires more parameters and FLOPs.

Thank you very much, understood

from tood.

Recommend Projects

Layer Attention instead of Channel Attention? about tood HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent