Improving models of the ventral stream using human behavioral data

behavioralthor's People

Stargazers

Watchers

behavioralthor's Issues

Dropout network

Because @daseibert and I have not been able to get access to a running model that we can use for our projects, and because we are interested having the ability to quickly train a neural network with regularization new data (instead of having to run an HMO screen, which we also cannot do) he and I are planning to work on implementing one of the following existing pieces of code:

Mostly CUDA

http://cs.nyu.edu/~wanli/dropc/ drop connect from nyu
https://github.com/dnouri/cuda-convnet forked from alex's code

Mostly python

https://github.com/mdenil/dropout theano implementation
https://github.com/Philip-Bachman/NN-Dropout i don't know what this is

I have not yet decided which would be best to attempt, but it seems the theano implementation would most closely integrate with out existing codebase. @yamins81 Is this true, and which one would you recommend?

Directions forward

@yamins81
There are 2 main goals

A screening set that is representative of the difficulty in the 1000-way categorization task, for creating a challenge submission
A screening set that is representative of the difficulty that humans are good at in all of imagenet, for getting better neural fits

re 1)
We should use random L3 models (5 sets of features, one from each random model) and find a set of images that is hard to separate on average for the model class. This would mean extracting #N1 images from each synset, then getting margins for all 2-ways for each image. Then, we could just take the mean of the set of negative margins for each image as a score, and take the #N2 lowest scoring images.

re 2)
We should find the largest negative margins as above, but then for each of these margins, test it in humans. This means that we will have ranked list of tuples ranked by margin (most negative first):
(image, distractor_synset, margin)

And we will search through this set of image tuples using psychophysics to find the first (going down the ordered list) #N2 tuples that have a performance above some threshold.

Here are some training curve results for MCC2 classification
The results for linearsvc are still being calculated (takes about 210 minutes to generate one of these curves.)

Immediate points of action:

Deciding how many images per synset to extract (#N1), then extracting them.
Deciding the size of the screening set (#N2)

N1 seems to be around 400 given the training curve (saturation around 300-350, need 50-100 test examples)

If you agree with this decision for #N1, then I will create a new dataset called PixelHardSynsets which you should then extract

import imagenet
dataset = imagenet.dldataset.PixelHardSynsets

Recommend Projects

ardila / behavioralthor Goto Github PK

behavioralthor's Introduction

behavioralthor's People

Stargazers

Watchers

Forkers

behavioralthor's Issues

Dropout network

Mostly CUDA

Mostly python

Directions forward

N1 seems to be around 400 given the training curve (saturation around 300-350, need 50-100 test examples)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent