acheun9 / pytorch-implementation-of-mobile-former Goto Github PK

View Code? Open in Web Editor NEW

109.0 109.0 16.0 92 KB

Simple implementation of Mobile-Former on Pytorch

Python 100.00%

pytorch-implementation-of-mobile-former's People

Contributors

Stargazers

Watchers

Forkers

ichbinkk congheng pangyanhua mdahao cv-ip ming1993li 121644048 kyrie-zhao llz-lian fyting katherine121 pugangqiang beandkay 13136983989 zfy9822 xurui-joei

pytorch-implementation-of-mobile-former's Issues

some question

class Mobile(nn.Module):
def init(self, ks, inp, hid, out, se, stride, dim, reduction=4, k=2):

hi, call you tell me, the k value why equal to 2, What is it used for

two issues

Great job!
However, I think there are probably two tiny issues in you code.

The first one is in bridge.py(line 24 & line 53). I think there are some differences in the following two lines of code

x = x.reshape(b, c, h*w).transpose(1,2).unsqueeze(1)
x = x.contiguous().view(b, h * w, c).unsqueeze(1)

May be the first line is correct?

The second one is in config.py.Accroding to the original paper, in page 13,

Figure 7. Visualization of cross attention on the two-way bridge: Mobile→Former and Mobile←Former. Mobile-Former-294M is used,which includes 6 tokens (each corresponds to a column) and 11 Mobile-Former blocks (block 2–12) across 4 stages. Each block has two attention heads that are visualized in two rows. Attention in Mobile→Former (left half) is normalized over pixels, showing the focused region per token. Attention in Mobile←Former (right half) is normalized over tokens showing the contribution per token at each pixel.

But in config.py, there are some stages with only one head.

I'm not sure whether the above is correct. Looking forward to your reply!

Preweights?

Hi, I want to extend the model on my own task, will you release pre-trained weights?

Great job！
However, i try the training on imagent and it does not converge.
I also try another implement https://github.com/slwang9353/MobileFormer and it does not converge either.
Does anyone successfully reproduce the results in the paper?

acheun9 / pytorch-implementation-of-mobile-former Goto Github PK

pytorch-implementation-of-mobile-former's People

Contributors

Stargazers

Watchers

Forkers

pytorch-implementation-of-mobile-former's Issues

some question

two issues

Preweights?

Model

配置

Not converge

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent