Hi, I just found that you do not turn on the ema mode in the released code. However, t

Please refer to the following link. ( <div

Feedback on the issue of ema coefficient. about tadaconv HOT 5 CLOSED

PeiqinZhuang commented on May 28, 2024

Feedback on the issue of ema coefficient.

from tadaconv.

Comments (5)

huang-ziyuan commented on May 28, 2024

Thanks for the feedback. Could you specify which model/config are you referring to?

from tadaconv.

PeiqinZhuang commented on May 28, 2024

Please refer to the following link. (

TAdaConv/configs/pool/base.yaml

Line 65 in 75b7839

ENABLE: false

) As mentioned, all configure files are based on the base configure file. However, the ema mode is turned off in the base configure file. If we want to execute the setting of tadaformer_b16_ssv2_16f, it is supposed to explicitly turn on the ema mode. BTW, the ema decay coefficient is different from what you reported in your work, e.g. 0.99996 v.s. 0.9996. Maybe you can check it later.

from tadaconv.

huang-ziyuan commented on May 28, 2024

Yes, if you wish to turn on EMA, you will have to manually enable ema and set the ema decay factor in your config file to run. We indeed used EMA during training TAdaFormer, but we actually observe negligible performance differences for the ema models and the original model. Therefore, turning on ema is not a must for training TAdaFormers.

from tadaconv.

PeiqinZhuang commented on May 28, 2024

As mentioned in your paper, turning on the ema mode may prevent the model from over-fitting problems. I am curious how important it is for training.

from tadaconv.

huang-ziyuan commented on May 28, 2024

It affects the top-1 accuracy by 0.2~0.4 for large models.

from tadaconv.

Feedback on the issue of ema coefficient. about tadaconv HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent