xiangning-chen / smoothdarts Goto Github PK
View Code? Open in Web Editor NEWCode for our ICML'2020 paper "Stabilizing Differentiable Architecture Search via Perturbation-based Regularization"
Code for our ICML'2020 paper "Stabilizing Differentiable Architecture Search via Perturbation-based Regularization"
Hi, Xiangning,
I am very interested in your paper and your code. Great job!
My concern is how to draw the figure1 in your paper.
Look forward to your explanation.
Dear authors,
Thanks for providing the code of SDARTS.
I noticed that in sota/cnn/train_imagenet.py the batch_size is set to 1024. Does that mean the ImageNet results of SDARTS in the paper were obtained with 8-GPU training, i.e., the same setup used by PC-DARTS / P-DARTS?
Thanks!
Hi, xiangning.
Thanks for your great work and opensoure code.
I evaluate(training for 300 epoches) the initial architecture and the architectures after searching for 50 epoch by random
and pgd
repectively. The results are weird as below.
- The architecture does not improve after searching in 4 runs
- The random architecture seems better than your searched result.
And I evaluate your results in the last line. Every initial architecture is better than yourrandom
architecture and one of initial architecture is better than yourpgd
architecture.
- I evaluate architecture after training 50 epochs, did you use early stopping ?
Could you please help me figure it out? Thx!
BR
Hi,
Thanks for your good working.
I'm trying to run this model.But I have a question for pytorch and torchvision version.
Would you please tell me which pytorch and torchvision version used in experiment?
Hi, thanks for your release code.
I ran the experiments about searching/evaluating on s1~s4 on Cifar10, but they got imperfect results as paper's.
Can you share more about hyperparameter setting on these? Thanks!
Hi, would you please explain why you bypass softmax in the forward computation while applying projected gradient descent to maximize the loss wrt arch_parameters? How is the performance compared to updateType='alpha'
Thanks!
Hi, thank you for sharing your code. I have a question about the unrolling.
SmoothDARTS/sota/cnn/train_search.py
Line 52 in 83710f1
This flag is false by default, and the execution command presented in README.md
is, for example, SDARTS-RS: cd sota/cnn && python train_search.py --search_space=s1 --perturb_alpha=random
. Did you use unrolled
in the main experiments?
Hi.
Thanks for your release code. Great job!
But when I ran the experiments on s1(Cifar10), it seemed to get completely different results from the paper: No matter I use random or pgd_linf method, almost all operations I got when the search finished still are skip_conn. Are there some special tricks or hyperparameter setting that are not mentioned during the experiment?
Look forward to your reply.Thanks.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.