Light

marsaki / nasnet Goto Github PK

View Code? Open in Web Editor NEW

42.0 1.0 13.0 22.81 MB

reimplementation of "Learning Transferable Architectures for Scalable Image Recognition" using mnist dataset, include controller

Python 99.35% Shell 0.65%

nasnet automl image-classification neural-architecture-search pytorch-implementation

nasnet's Introduction

NasNet 2018

This code is the reimplementation of "Learning Transferable Architectures for Scalable Image Recognition", including the training process of controller. This code contains three algorithms to search model, Random Search, Policy Gradient and PPO.

Requirements

Python >= 3.6.7, PyTorch == 0.4.0

Architecture Search

python train_search.py --cutout --algorithm RS  #use random search
python train_search.py --cutout --algorithm PG  #use policy gradient
python train_search.py --cutout --algorithm PPO #use PPO

Note the validation performance in this step does not indicate the final performance of the architecture. One must train the obtained genotype/architecture from scratch using full-sized models. Also the default setting is training with 20 processes and 3 GPU. Change the processes to 10:

python train_search.py --cutout --episodes 10

or modify code in random_search.py, policy_gradient.py and PPO.py .

Architecture Evaluation

Because of the limitation of time and computation resource, I didn't train the candidate genotypes/architectures from scratch.

Results

python draw.py

We can see RL search is better than random search, also PPO is more stable and faster.

nasnet's People

Contributors

Stargazers

Watchers

Forkers

zhangrj91 pinglmlcv johnzhjw abadianfatemeh iksooman eronhou cscylei yintianan xueliu8617112 dakies 13015517713 afzalxo jqjin123

nasnet's Issues

What are node decoders?

policy_gradient.py's problems

hello.
I have two questions in policy_gradient.py to ask you.
first:
def cal_loss(self, actions_p, actions_log_p, worker, baseline):
reward = worker.acc - baseline
policy_loss = -1 * torch.sum(actions_log_p * reward)
entropy = -1 * torch.sum(actions_p * actions_log_p)
entropy_bonus = -1 * entropy * self.entropy_weight
what does entory mean?
second:
for episode in range(self.episodes):
worker = results_queue.get()
# worker.actions_p = torch.Tensor(worker.actions_p).to(self.device)
worker.actions_index = torch.LongTensor(worker.actions_index).to(self.device)
workers.append(worker)

worker.actions_index = torch.LongTensor(worker.actions_index).to(self.device) ,the worker.actions_index does't change?

What the step means in controller

What does the step mean in the controller?

for node in range(self.steps):

I think it's a number of blocks in each cell. In the original paper, there are 5 blocks in either normal cell or reduction cell.

why it is 4?

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.