Giter Site home page Giter Site logo

emap_emnlp2020's Introduction

What's in here?

This repo contains (a very short amount of) code from: Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think! The bibtex is:

@inproceedings{hessel2020emap,
	title={Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!},
	author={Hessel, Jack and Lee, Lillian},
	booktitle={EMNLP},
	year={2020}
}

what does this code do?

The code here implements a minimal version of Empirical Multimodally-Additive Projections (EMAP), as described in our EMNLP paper.

TL;DR: if you have a multimodal classification task, EMAP can provide insight in whether or not your model is using conditional, cross-modal interactions (like cross-modal attention) to make more accurate predictions (or not). For many multimodal tasks, we imagine that our algorithms are doing the types of inferences that we do, carefully comparing different aspects of images and text and then making an informed decision based on that interaction. But are they? EMAP can tell you! (see the paper for more details!)

how do you use it?

Say you had N evaluation datapoints consisting of text/visual pairings ti, vi. The emap function in emap.py assumes as input a dictionary that maps from indices i,j \in {1...N} to the output logits of your predictor f evaluated on ti,vj for each class. The function returns the projected predictions of your predictor's EMAP, which are as-close-as-possible to your original model's predictions, but only have additive structure.

emap_emnlp2020's People

Contributors

jmhessel avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.