Giter Site home page Giter Site logo

xiaolong-yun / emnlp2017_doc Goto Github PK

View Code? Open in Web Editor NEW

This project forked from leishu02/emnlp2017_doc

0.0 1.0 0.0 18 KB

code for our EMNLP 2017 paper "DOC: Deep Open Classification of Text Documents"

License: MIT License

Jupyter Notebook 72.71% Python 27.29%

emnlp2017_doc's Introduction

EMNLP2017_DOC

code for our EMNLP 2018 paper "DOC: Deep Open Classification of Text Documents"

DOC's experiment setting is huge. I trimmed them into one file containing every function from pre-processing till evaluating. In paper, I use google-new pretrained embedding. This code does not use pretrained embedding. If you want to fully re-produce the result, you may need to randomly sample 10 times seen-unseen classes split and load the pretrained embedding.

20NewsGroup: please download 20news-18828.tar.gz from http://qwone.com/~jason/20Newsgroups/ (preprocess: use every word inside)

50EleReviews.json contains 50-product original reviews.( 25% of 50 classes: 12 seen classes, 75% of 50 classes: 37 seen classes). please download from https://drive.google.com/file/d/1Kgtqbp0B67S-f4W7ULfG-_YP2kOdZ-Do/view?usp=sharing

DOC_emnlp17.py or .ipynb (ipython notebook, it has running results) contains code.

We have one continual project which solves UNSEEN CLASS DISCOVERY IN OPEN-WORLD CLASSIFICATION https://arxiv.org/pdf/1801.05609.pdf. It shows that DOC also works well on image.

We have one meta-learning based continuing work recently accepted at the web conference (WWW) 2019:Open-world Learning and Application to Product Classification (code and data is available, see link in paper) https://www.cs.uic.edu/~liub/publications/WWW-2019-camera-ready.pdf

library: python 2.7 keras 2.1.2 scipy json numpy sklearn jupyter (if you want to use .ipynb file)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.