simsso / nips-2018-adversarial-vision-challenge Goto Github PK

Code, documents, and deployment configuration files, related to our participation in the 2018 NIPS Adversarial Vision Challenge "Robust Model Track"

License: MIT License

Dockerfile 0.30% Python 66.07% Shell 1.50% Jupyter Notebook 30.19% HTML 0.82% HCL 0.25% MATLAB 0.86%

adversarial-attacks classifier nips-2018 robustness tensorflow

nips-2018-adversarial-vision-challenge's People

Contributors

Stargazers

Watchers

Forkers

neineit

nips-2018-adversarial-vision-challenge's Issues

Training Pipeline Finalization

Docker image with Python 3.6 (tricky for tensorflow-gpu) or make code compatible with Python 3.5 (@Simsso)
Installation of the vq_layer pip module
Versioning of models with strings (not only numbers)
Drop /models folder

11. Working Group Meeting (11. September 2018)

(aka. "is the breakthrough coming?!")

Assignments

@Doktorgibson

enablement tools #11
input pipeline folder mounting
docs #33

@FlorianPfisterer

email Stanford paper guy
sanity-check / debugging
read Adversarial Logit Pairing paper #40
fine-tune ResNet #39
reproduce 69.2% on Tiny ImageNet with pre-trained weights from here and our input pipeline

@Simsso

reproduce 69.2% on Tiny ImageNet with pre-trained weights from here
read Adversarial Logit Pairing paper #40
fine-tune ResNet #39
PR for experiments branch as part of the clean-up #36

Model Deployment Pipeline

Let's use this issue as a thread to communicate different, possible deployment pipelines for ML models. Once we have decided upon something we can go ahead and create a wiki page.

Since our batch size is very limited right now (due to the inefficient memory usage of the VQ layer, as described in #58), we should implement a "virtual batch size", i.e. accumulate multiple gradients and apply them at once. That way we could specify a compute_batch_size and a update_batch_size, where the former depends on the GPU memory and the latter on our preference.

This SO answer contains relevant information.

Development in the gradient-accumulation branch

6. Working Group Meeting

6. Working Group Meeting (22. July 2018)

(aka. "Earlybird")

Work Items

@doktorgibson

make local deployment work
take necessary steps, following from our decision to use VMs (GPC and AWS)
make TensorBoard work
create documentation on Wiki

@Simsso

train a Tiny ImageNet classifier and submit it to the data set webpage #23
upload a dumb model to the challenge repository to appear in the scoreboard #20

@FlorianPfisterer (suggestions)

participate in either #20 or #23
anything else, that you consider relevant

VQ-Layer Cosine Distance

Since L1, L2, and Inf norm seem to be problematic in high dimensional space, it might be worth considering to add an additional norm order, namely ~~'dotp'~~ 'cos'. It would

normalize all input vectors to unit norm
initialize the embedding space with unit norm vectors
replace inputs with the vector from the embedding space to which the dot product is the greatest
define the loss as the negative dot product

The tasks are

unit testing
implementation of the described ~~dot product norm~~ cosine distance
empirical evaluation

Inception Architecture for Tiny ImageNet

(sub-task of #23)

understand the Inception architecture
try to train it on standard ImageNet
adapt the architecture to Tiny ImageNet
train the adapted architecture on Tiny ImageNet

Experiments and Results

13. Working Group Meeting

Date: 3. Oct 2018
(aka. honeymoon)

@doktorgibson

enablement tools #35
get avc-test-model to work (docs here)
training pipeline finalization #62

@FlorianPfisterer

read wikipage of training pipeline
develop #51 further (implement direct embedding value assignment + tests)
make ResNet submittable #61

@Simsso

read wikipage of training pipeline
develop #51 further (in particular profiling #58)
logger class #60
virtual batch sizes #64
dotp norm order #63

Vector Quantization Layer

Development of a production-ready vector quantization (VQ) layer in TensorFlow, based on the prototype developed in #25 and merged with #52 (+prototype 2).

Development branch: vq-layer

Documentation

Sub-tasks

Test batch size effect on gradient (f390595)
Assignment of least used embedding space vectors with values from input x that were furthest away from embedding space vectors
Consider developing a custom C++ op
Research on current memory efficiency (which parts consume a lot of RAM, how to profile)
is_training parameter updates only during training (not needed anymore)
Add scatter_update call to tf.GraphKeys.UPDATE_OPS
dotp norm order #63

Paper Discussion: Adversarial Logit Pairing

Discussion of the paper "Adversarial Logit Pairing" by Harini Kannan, Alexey Kurakin, Ian Goodfellow (16 Mar 2018).

Let's see whether having an issue for a discussion event is helpful or unnecessary overhead. At least it's a good way of documenting it.

Abstract:

In this paper, we develop improved techniques for defending against adversarial examples at scale. First, we implement the state of the art version of adversarial training at unprecedented scale on ImageNet and investigate whether it remains effective in this setting - an important open scientific question (Athalye et al., 2018). Next, we introduce enhanced defenses using a technique we call logit pairing, a method that encourages logits for pairs of examples to be similar. When applied to clean examples and their adversarial counterparts, logit pairing improves accuracy on adversarial examples over vanilla adversarial training; we also find that logit pairing on clean examples only is competitive with adversarial training in terms of accuracy on two datasets. Finally, we show that adversarial logit pairing achieves the state of the art defense on ImageNet against PGD white box attacks, with an accuracy improvement from 1.5% to 27.9%. Adversarial logit pairing also successfully damages the current state of the art defense against black box attacks on ImageNet (Tramer et al., 2018), dropping its accuracy from 66.6% to 47.1%. With this new accuracy drop, adversarial logit pairing ties with Tramer et al.(2018) for the state of the art on black box attacks on ImageNet.

2. Working Group Meeting

2. Working Group Meeting (18. June 2018)

(aka. "The Birthday Meeting")

Topics

Linear combinations experiments #4
Cloud analysis (partly) #2
Reading matter (papers, articles, code) #6
CNN knowledge #5
Pipeline #9

10. Working Group Meeting

10. Working Group Meeting (2. September 2018)

(aka. "There is nothing to defend, if the castle is already burning.")

Assignments

@Doktorgibson #11 (and docs for the past work #33)
@FlorianPfisterer #29
@Simsso #31

4. Working Group Meeting

4. Working Group Meeting (9. July 2018)

(aka. "Distributed development: Berlin, Stuttgart, Karlsruhe")

Work Items

@doktorgibson in #11 and #9

make cloud testing CNN ML-Engine compatible
define interfaces for Google Cloud Storage (GCS)
logging of images / text
make TensorBoard work
local execution with debugger

@Simsso

complete layerwise perturbation experiments #14
implementation of attacks #17
attack frameworks #20
read "The Limitations of Deep Learning in Adversarial Settings", "On Detecting Adversarial Perturbations"

@FlorianPfisterer

implementation of attacks #17
analysis of last year's attacks (code) #19
read "The Limitations of Deep Learning in Adversarial Settings", "On Detecting Adversarial Perturbations"

Compare GCP and AWS regarding Deployment and Training of Models

Google Cloud Platform Analysis
Google Cloud Platform Building the Pipeline

Enablement Tools

restore pre-trained TensorFlow CNN
apply to given dataset
create Dockerfile
coordinate deployment with @doktorgibson

1. Working Group Meeting

1. Working Group Meeting (9. June 2018)

Attendees: Timo Denk, Samed Güner

Challenge

Tracks

Research focus on defense track. However, state of the art knowledge about attacks is requried in order to validate new defenses. We need to do research in both topics and have to come up with something new at the defense track.

Defense: Find a function X->W where X is the set of all images and W are the classes. The function is parametrized by theta.
Attack: Find a function (x1,w,theta)->x2, which takes an image x1 of class w and finds another image x2 using the weights theta of the given model. Such that |x2-x1| is minimal and x2 is misclassified.

Evaluation Criterion

Let M be the model and S be the set of samples.
We apply the five best untargeted attacks on M for each sample in S. Sample from training or test data? Do targeted attacks come into play as well?
For each sample we record the minimum adversarial L2 distance (MAD) across the attacks. L2 can behave in a weird way (curse of dimensionality). Our test should also be validated using L2 distance.
If a model misclassifies a sample then the minimum adversarial distance is registered as zero for this sample.
The final model score is the median MAD across all samples.
The higher the score, the better.

Our deployment pipeline should perform validation in a very similar manner. In particular L2 distance and median.

Example:

let list of distances d = []
foreach image I_s[i] in the dataset S
    calculate 5 perturbation I_p[j] images from I_s[i]
    foreach image I_p[j]
        calculate |I_p[j]-I_s[i]|_2
    Add minimum L_2 distance to d
return median(d)

Deadlines

June 25th, 2018: Challenge begins
November 1st: Final submission date
November 15th: Winners Announced

Research

We need to do research on both topics. Relevant papers need to be determined, asap.

Papers

We must be very strong against One pixel attack for fooling deep neural networks since its attacks are extremely close in L2.
Distillation as a Defense to Adversarial Perturbations against Deep Neural Networks
CNNs basic knowledge
Which activation function is most robust (for instance leaky ReLU parametrization)?

Ideas

Linear combinations of inputs (for evaluation). Determine the distance from an image (when linearly approaching an image of another class) of the first miss-classified input. Analyze how noisy the classifications along the line are.

Derivative penalties regularize training with penalities for high first and second order derivatives wrt input changes.

Growing filters of the CNN, similiar to progressive growing of GANs. I have not seen any research that goes into this direction, but it might work. New filters would be faded in slowly.

Fisher information matrix for network size reduction Overcoming catastrophic forgetting in neural networks. The matrix contains information on how relevant certain weights are for classification.

Dropout on kernel level / additive noise to kernels of higher layers. This might be common already, we have to do research on that.

Deployment and Infrastructure

Deployment on AWS or GCP (Azure is not an option, and never was).
Funding through free tier budget, our own money, and subsequently and in the long run sponsoring by SAP Machine Learning Foundation. Also, there might be sponsored computing power available.

Miscellaneous

GitHub repository for code
GitHub wiki for documentation
Watch the challenge Gitter channel
Watch the project page to notice once they put more detailed information up

Collect information about TensorRT

TensorRT

Layerwise Perturbation Graph

Implement layer-wise perturbation analysis. Output a graph. If possible, generalize as a reusable Python class.

source

Also relevant: How Can Neural Network Similarity Help Us Understand Training and Generalization?

12. Working Group Meeting

Date: 21. September 2018
(aka. no aka this time)

Assignments

@doktorgibson

docs #33
enablement tools #35
fix central TensorBoard to monitor training runs
get avc-test-model to work (docs here)

@Simsso & @FlorianPfisterer

first: step (1) of #42
then: step (3) of #42 (see below)

@Simsso

complete #36
finish setup.py (1) #42
input pipeline #42
vector quantization experiments #25

@FlorianPfisterer

README.md and type issues (1) #42
rewrite trainer class & add appropriate flags #42
(optionally) vector quantization experiments #25

Attack Frameworks and Submission

Familiarize with adversarial attack / validation frameworks:

TensorFlow Wiki Page

while practicing & familiarizing with TensorFlow, simultaneously build up a knowledge base

8. Working Group Meeting

8. Working Group Meeting (12. August 2018)

(aka. "Samed is back")

Assignments

@doktorgibson make it home safely
@FlorianPfisterer get the submission to work #20 (little amount of work on our side); follow along #23; mainly #24
@Simsso continue to train classifiers #23

ResNet Fine-Tuning

(sub-task of #23)

After training a ResNet model from scratch (#31) and retraining Inception (#29) have not yielded satisfying results, we will try to fine-tune a ResNet with the weights provided here.

Wiki docs

The overall goal is the extraction of the ResNet model from the provided code. Then it can be used for experiments such as the insertion of layers, addition of VQ (#25), etc. All of that with accuracies from 60 to 70%.

Linear Combination Experiments

Conduct experiments on the classification behavior with linear combinations as network input. Here, a linear combination is a mixture of two images of different (or perhaps even same) classes.

Marriage of VQ, ResNet, and Pipeline

Let's bring everything together 🎉

Extend ResNet by additional VQ layers
Train the embedding space with our pipeline on GCP
Submit the trained model to the challenge website

Location: KIT CS library
Date: 24. Sep 2018
Time: 17:30

7. Working Group Meeting

7. Working Group Meeting (31. 26. July 2018)

(aka. ~~"Pre-Vacation"~~"Vacation")

Assignments

@doktorgibson complete deployment pipeline #9
@FlorianPfisterer successful submission and understand TF wrapper #20
@Simsso better image classifier #23

Branch Clean-up

I think we've got more branches than needed. Maybe it's time to clean things up a bit.

No high priority, just opening the issue for now.

Infrastructure of Adversarial Attacks

In order to validate or models we want to set up an automated pipeline of adversarial attacks which run on every submitted model. This thread is about

defining an interface that our attacks use, and our models provide
understanding the model format which we have to submit in order to participate in the challenge

5. Working Group Meeting

5. Working Group Meeting (16. July 2018)

(aka. "Only two were left...")

Work Items

@doktorgibson

take necessary steps, following from our decision to use VMs (GPC and AWS) [rather than the poorly documented ML Engine]
deployment of a containerized model
logging of images / text
make TensorBoard work

@Simsso

read "One pixel attacks" paper
train a Tiny ImageNet classifier and submit it to the data set webpage #23
upload a dumb model to the challenge repository to appear in the scoreboard #20

@FlorianPfisterer

read "One pixel attacks" paper

3. Working Group Meeting

3. Working Group Meeting (1. July 2018)

(aka. "Exams actually suck")

Tasks

@doktorgibson

Complete GCP analysis
Complete AWS analysis
GCP GCE vs GKE
Deployment hands-on

@FlorianPfisterer

Papers
Reference model #11
Build up TensorFlow knowledge in general

@Simsso

Papers
Layerwise perturbations analysis #14
Update attack and defense wiki pages

Analyse Capabilities and Differences of GCP and AWS

Already included in #3 , therefore closing this one.

Analysis of Last Year's Attacks

2017 NIPS attack 1. place targeted and un-targeted Momentum (Paper)
2017 NIPS attack 2. place targeted and un-targeted Ensemble (Paper)

Tiny ImageNet Classifier

Design and implement a simple CNN
Train it on "Tiny ImageNet"
Submit it to the data set webpage

Quick Links

Inception sub-issue: #29
ResNet sub-issue: #31
ResNet fine-tuning sub-issue: #39
Best training runs wiki page: wiki/Image-Classifiers
Experiments and results of this issue

ResNet Base Code Clean-up and Enhancement

Clean-up of the code in the resnet-base branch.

We'll proceed as follows

Superficial clean-up of the present code (resnet-base)
Merge into master
Feature branches with individual PRs for improvements and features

(1) Superficial Clean-up

Create README.md
Finish setup.py
Resolve type issues with BaseModel / ResNet

(3) Features

Rewrite the entire trainer class (+ base class)
Refactor the input pipeline to use tf.data.
Split-up def build_model(self) -> None: in ResNet into multiple separate function calls (with the aim of having simpler method replications in inheriting experiment-classes)
Flags
Refactor the model graph construction code (and documentation)

We'll extend list (3) when conducting step (2).

Evaluate Google's 'Unrestricted Adversarial Examples'

With this blog post a new challenge in the field was started. The repo is here.

Deployment Pipeline Documentation

After completing #9, a wiki page about "which Ansible is calling which Terraform", etc. should be created.

ResNet Architecture for Tiny ImageNet

(subtask of #23)

Experiments and Results

Challenge baseline model

ResNet Base Submittable

Change the resnet-base model such that it can be easily submitted.
Including usage of trained embedding weights of a VQ layer.
Submit a model without gradient skipping (copying the gradient from VQ-output to VQ-input) and see how Foolbox behaves if gradients cannot be computed (w.r.t. whitebox-attack).

Gather CNNs Basic Knowledge

Read and understand CNNs.

Questions & Problems:

VQ Profiling

Memory and performance profiling of the VQ layer code (branch vq-layer).

Investigate how the memory consumption changes depending on (batch size, vector size, embedding space size)
Familiarize with profiling tools
Assess development of a C++ op that's more efficient

9. Working Group Meeting

9. Working Group Meeting (26. August 2018)

(aka. "Half-time")

Assignments

Still, it remains mission critical to find a good classifier architecture and parametrization #23.

@Doktorgibson ops #9 (implied #11)
@FlorianPfisterer adapt and train Inception net #29 (sub task of #23)
@Simsso enhance ResNet architecture #31 (sub task of #23)

Jupyter Notebook for Experiments

Create a Jupyter notebook that can be used to quickly try attacks, compute gradients, and play around in general.

Maybe we should have (1) one model for MNIST (for super fast training convergence and experiment iteration; here is an inspiration) and (2) one for Tiny ImageNet, just to have some more realistic testing platform as well. The latter could be based on the TF cifar10 network.

Attack Implementation

In order to fully understand different adversarial attacks, we implement some of them ourselves.

Vector Quantization Prototype

Implementation our defense mechanism "CNN Filter Compression" / "Vector Quantization" (described in the wiki).

Validation implies verifying, whether

training a CNN with encoded filters is still possible
a trained CNN is more robust wrt. adversarial attacks
a trained CNN ist more robust to black-box attacks (transferability)

simsso / nips-2018-adversarial-vision-challenge Goto Github PK

nips-2018-adversarial-vision-challenge's People

Contributors

Stargazers

Watchers

Forkers

nips-2018-adversarial-vision-challenge's Issues

11. Working Group Meeting (11. September 2018)

Assignments

@Doktorgibson

6. Working Group Meeting (22. July 2018)

Work Items

@doktorgibson

@FlorianPfisterer (suggestions)

Sub-tasks

2. Working Group Meeting (18. June 2018)

Topics

10. Working Group Meeting (2. September 2018)

Assignments

4. Working Group Meeting (9. July 2018)

Work Items

@doktorgibson in #11 and #9

1. Working Group Meeting (9. June 2018)

Tracks

Evaluation Criterion

Deadlines

Research

Papers

Ideas

Deployment and Infrastructure

Miscellaneous

Assignments

@doktorgibson

@Simsso & @FlorianPfisterer

8. Working Group Meeting (12. August 2018)

Assignments

7. Working Group Meeting (31. 26. July 2018)

Assignments

5. Working Group Meeting (16. July 2018)

Work Items

@doktorgibson

3. Working Group Meeting (1. July 2018)

Tasks

@doktorgibson

Quick Links

(1) Superficial Clean-up

(3) Features

9. Working Group Meeting (26. August 2018)

Assignments

Recommend Projects

Recommend Topics

Recommend Org