Giter Site home page Giter Site logo

liangliangbupt / jacana Goto Github PK

View Code? Open in Web Editor NEW

This project forked from xuchen/jacana

0.0 1.0 0.0 168 MB

Automatically exported from code.google.com/p/jacana

Python 5.37% Shell 0.77% Assembly 10.93% Perl 6 5.11% Batchfile 0.01% Java 57.32% HTML 1.05% Scala 15.04% CSS 0.08% JavaScript 0.33% Perl 0.39% Makefile 0.14% C 3.44% C++ 0.02%

jacana's Introduction

Note: Original repo was at Google Code (https://code.google.com/p/jacana/). It was too big and after conversion, commit history was lost. Also, the data files had to be separated. Please download the data files and untar it to the same folder as the project files.

Jacana (/dʒəˈkɑːnə/) consists of three parts:

  • jacana-qa, a Question Answering engine for TREC-style questions, written in Java.
  • jacana-align, a monolingual word aligner for English, written in Java and Scala.
  • jacana-freebase, a Question Answering engine for web-style questions on Freebase, written in Java and Scala.

download:

[http://www.cs.jhu.edu/~xuchen/packages/jacana-align.bin.20141029.tar.bz2 jacana-align.bin.20141029.tar.bz2] (103M, just the binary version of the aligner)

[http://www.cs.jhu.edu/~xuchen/packages/jacana.20141029.tar.bz2 jacana.20141029.tar.bz2] (660MB, everything (qa+align+freebase, source code only, no compiled jars), mostly data files and lexical resources)

jacana-qa

contains:

  • a ranker for ranking whether a sentence contains an answer for a given question,
  • an answer extractor for extracting exact answer segments from sentences.

It provides software implementation and dataset for the following two papers:

[http://cs.jhu.edu/~xuchen/paper/yao-jacana-qa-naacl2013.pdf Answer Extraction as Sequence Tagging with Tree Edit Distance]. Xuchen Yao, Benjamin Van Durme, Peter Clark and Chris Callison-Burch. Proceedings of NAACL 2013.

[http://cs.jhu.edu/~xuchen/paper/yao-jacana-ir-acl2013.pdf Automatic Coupling of Answer Extraction and Information Retrieval]. Xuchen Yao, Benjamin Van Durme and Peter Clark. Proceedings of ACL 2013, short papers.

HOWTO: JacanaQA

jacana-align

is a token-based aligner described in the following paper:

[http://cs.jhu.edu/~xuchen/paper/yao-jacana-wordalign-acl2013.pdf A Lightweight and High Performance Monolingual Word Aligner]. Xuchen Yao, Benjamin Van Durme, Chris Callison-Burch and Peter Clark. Proceedings of ACL 2013, short papers.

[http://jacana.clsp.jhu.edu/ online demo]

HOWTO: JacanaAlign

jacana-freebase

is a question answering engine that extracts answers from Freebase:

Information Extraction over Structured Data: Question Answering with Freebase. Xuchen Yao and Benjamin Van Durme. Proceedings of ACL 2014.

HOWTO: JacanaFreebase

Change Log

2015-08-18:

  • imported from Google Code. Original repo exceeded GitHub limit (less 1G repo, single file less than 100MB). Had to separate data from main repo and lost commit history.

2014-10-29:

  • jacana-align: is now thread-safe, model is also updated/re-trained

2014-10-2

  • jacana-align: removed word2vec since it's too slow
  • jacana-align: added parallel support (by default), use the "--single" parameter to disable it

2014-3-19

  • released jacana-freebase

2013-11-3

  • added more lexical resources: word2vec, wiktionary, better PPDB support
  • feature tuning for token alignment
  • full UTF-8 support in AJAX demo

2013-8-5

  • retrained model on larger alignment corpus
  • now compiles with Scala-2.10.2
  • fixed Windows compatability issue of several lexical models
  • demo pre-loads model first

2013-5-28

  • Initial release

jacana's People

Contributors

xuchen avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.