There is a <a href="https://github.com/open-mmlab/OpenUnReID/blob/711a899b5826e72032b7

When samples_per_bn is smaller than <code class="notr

hi，<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

question about "samples_per_bn" about openunreid HOT 3 CLOSED

valencebond commented on May 27, 2024

question about "samples_per_bn"

from openunreid.

Comments (3)

yxgeee commented on May 27, 2024

When samples_per_bn is smaller than samples_per_gpu, sync bn won't work so it is disabled. And note that the actual samples_per_bn should be at least the same as samples_per_gpu (samples_per_bn=N x samples_per_gpu, N>=1), so the actual samples_per_bn will be the same as samples_per_gpu by default if samples_per_bn<samples_per_gpu.

In source-domain pre-training, the best setup is to train with a batch size of 64 and global sync bn (samples_per_bn=64). And in target-domain fine-tuning on 4 GPUs, the best setup is to train with a batch size of 64 and no sync bn (samples_per_bn=16, 16x4=64). However, when conducting experiments on 8 GPUs (8 batch_size on each GPU), sync bn will be activated since two GPUs need to sync their BNs to perform samples_per_bn=16.

from openunreid.

valencebond commented on May 27, 2024

hi，@yxgeee, thanks for your detailed explanation, but why in target-domain fine-tuning, the best setup is to train with no sync bn. i am not familiar with domain adaptation, but this is against common practice in general training CNN.

from openunreid.

yxgeee commented on May 27, 2024

Such an optimal setup (16 samples for each BN and 64 samples in a mini-batch) was found empirically, as the re-ID dataset is sensitive to the number of batch size.

from openunreid.

Recommend Projects

question about "samples_per_bn" about openunreid HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent