Comments (1)
Hi @ywdong,
Here is the config for the ViT-B/4, which can be run on 48 AWS p4d-24xlarge machines. We don't have a ViT-S/8 run with the current version of MSN, but will release all the configs shortly and let you know once they're available!
criterion:
ent_weight: 0.0
final_sharpen: 0.25
me_max: true
memax_weight: 1.0
num_proto: 1024
start_sharpen: 0.25
temperature: 0.1
batch_size: 3
use_ent: true
use_sinkhorn: true
data:
color_jitter_strength: 0.5
pin_mem: false
num_workers: 0
image_folder: imagenet_full_size/061417/
label_smoothing: 0.0
patch_drop: 0.7
rand_size: 224
focal_size: 96
rand_views: 1
focal_views: 10
root_path: /datasets/
logging:
folder: /path_to_save_vitb4_logs/
write_tag: msn
meta:
bottleneck: 1
copy_data: false
drop_path_rate: 0.0
hidden_dim: 2048
load_checkpoint: false
model_name: deit_base_p4
output_dim: 256
read_checkpoint: null
use_bn: true
use_fp16: false
use_pred_head: false
optimization:
clip_grad: 3.0
epochs: 400
final_lr: 1.0e-06
final_weight_decay: 0.4
lr: 0.001
start_lr: 0.0002
warmup: 15
weight_decay: 0.04
from msn.
Related Issues (20)
- linear_eval.py #L327 HOT 1
- pretrained weights are probably incorrect HOT 7
- Could you please give the version of cyanure you used? HOT 1
- About the 1% In1k semi-sup evaluation HOT 1
- module 'cyanure' has no attribute 'preprocess' HOT 4
- The detail setting for 1% evaluation HOT 1
- Include full checkpoint HOT 1
- Custom dataset Linear Eval
- Lambda for logistic regression evaluation
- Performance when small batch size
- why did not take block-wise mask strategy? HOT 1
- vit-b-16 config
- can you release the time complexity of training ViT B/4 and ViT L/7
- Using just the encoder
- How to change MSN loss to PMSN loss? (from paper "The Hidden Uniform Cluster Prior in Self-Supervised Learning") HOT 1
- What is the use of "AllReduce"?
- The loss has converged at early stage?
- No such file HOT 1
- Duplicate GPU detected error HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from msn.