Giter Site home page Giter Site logo

tf-models's Introduction

The goal of this repository is to collect reproducable benchmarks for NLP tasks using standard data sets. Unless otherwise noted, all models are implemented in Tensorflow.

Why?

Reproducability is a major problem in NLP research. Many times have I talked to researchers who could not reproduce the numbers reported by other researchers. Even though the Deep Learning community is more open and transparent than most other communities, many don't publish their code. Even if code is published, it is often difficult to compare techniques directly, because the implemntation framework or data pre-processing pipeline is not the same. The goal of this repository is to implement popular Deep Learning models and evaluate them on standard data sets. The benchmarks and models here can serve as a entry point for new researchers to implement and compare their own models.

Data Sets

  • Movie Reviews (MR): Movie Reviews from Rotten Tomaties. 5331 positive and 5331 negative processed sentences / snippets. (Source)
  • Stanford Sentiment Treebank (SST): variation on the MR dataset with individual subphrases tagged on Mechanical Turk. The Stanford Parser is used to parses all 10,662 sentences. In approximately 1,100 cases it splits the snippet into multiple sentences. We then used Amazon Mechanical Turk to label the resulting 215,154 phrases. This dataset was first used in Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank (Source)
  • IMDB Movie Reviews (IMDB): TODO (Source)
  • 20 Newsgroups: TODO (Source)

Models

tf-models's People

Contributors

dennybritz avatar

Watchers

James Cloos avatar Yong Chen avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.