Giter Site home page Giter Site logo

behavioralthor's Introduction

Improving models of the ventral stream using human behavioral data

behavioralthor's People

Stargazers

Tom Wallis avatar

Watchers

James Cloos avatar Diego Ardila avatar  avatar

Forkers

dtbinh

behavioralthor's Issues

Dropout network

Because @daseibert and I have not been able to get access to a running model that we can use for our projects, and because we are interested having the ability to quickly train a neural network with regularization new data (instead of having to run an HMO screen, which we also cannot do) he and I are planning to work on implementing one of the following existing pieces of code:

Mostly CUDA

  1. http://cs.nyu.edu/~wanli/dropc/ drop connect from nyu
  2. https://github.com/dnouri/cuda-convnet forked from alex's code

Mostly python

  1. https://github.com/mdenil/dropout theano implementation
  2. https://github.com/Philip-Bachman/NN-Dropout i don't know what this is

I have not yet decided which would be best to attempt, but it seems the theano implementation would most closely integrate with out existing codebase. @yamins81 Is this true, and which one would you recommend?

Directions forward

@yamins81
There are 2 main goals

  1. A screening set that is representative of the difficulty in the 1000-way categorization task, for creating a challenge submission

  2. A screening set that is representative of the difficulty that humans are good at in all of imagenet, for getting better neural fits

re 1)
We should use random L3 models (5 sets of features, one from each random model) and find a set of images that is hard to separate on average for the model class. This would mean extracting #N1 images from each synset, then getting margins for all 2-ways for each image. Then, we could just take the mean of the set of negative margins for each image as a score, and take the #N2 lowest scoring images.

re 2)
We should find the largest negative margins as above, but then for each of these margins, test it in humans. This means that we will have ranked list of tuples ranked by margin (most negative first):
(image, distractor_synset, margin)

And we will search through this set of image tuples using psychophysics to find the first (going down the ordered list) #N2 tuples that have a performance above some threshold.

Here are some training curve results for MCC2 classification
The results for linearsvc are still being calculated (takes about 210 minutes to generate one of these curves.)

screen shot 2013-09-30 at 5 22 44 pm

Immediate points of action:

  1. Deciding how many images per synset to extract (#N1), then extracting them.
  2. Deciding the size of the screening set (#N2)

N1 seems to be around 400 given the training curve (saturation around 300-350, need 50-100 test examples)

If you agree with this decision for #N1, then I will create a new dataset called PixelHardSynsets which you should then extract

import imagenet
dataset = imagenet.dldataset.PixelHardSynsets

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.