Giter Site home page Giter Site logo

wbj0110 / adam_qas Goto Github PK

View Code? Open in Web Editor NEW

This project forked from 5hirish/adam_qas

0.0 1.0 0.0 72.09 MB

ADAM - A Question Answering System. Inspired from IBM Watson

Home Page: http://www.shirishkadam.com/

License: GNU General Public License v3.0

Python 99.26% Shell 0.27% Dockerfile 0.47%

adam_qas's Introduction

ADAM -- Question Answering System

License: GPL v3 Codacy Badge Codecov Build Status Gitter Twitter

A question answering system that extracts answers from Wikipedia to questions posed in natural language. Inspired by IBM Watson and START. We are currently focused on improving the accuracy of the extracted answers. Follow the creator's blog at shirishkadam.com for updates on progress.

Getting Started

Elasticsearch is being used to store and index the scrapped and parsed texts from Wikipedia. Elasticsearch 7.X installation guide can be found at Elasticsearch Documentation. You might have to start the elasticsearch search service.

$ git clone https://github.com/5hirish/adam_qas.git
$ cd adam_qas
$ pip install -r requirements.txt
$ python -m qas.adam -vv "When was linux kernel version 4.0 released ?"

Note: The above installation downloads the best-matching default english language model for spaCy. But to improve the model's accuracy you can install other models too. Read more at spaCy docs.

$ python -m spacy download en_core_web_md

Running with Docker

$ git clone https://github.com/5hirish/adam_qas.git
$ cd adam_qas
$ docker-compose up

Now both conntainers are up and running. Next step is to enter in the python container and run Adam:

$ docker exec -it $(docker ps -a -q  --filter ancestor=adam_qas_adam) bash
$ python -m qas.adam -vv "When was linux kernel version 4.0 released ?"

References

Find more in depth documentation about the system with its research paper and system architecture here

Requirements

Python Package dependencies listed in requirements.txt Upgrading Elasticsearch 6.X:

  • Rolling Update 6.2 to 6.8 > ref
  • Rolling Update 6.8 to 7.1 > ref

Features

  • Extract information from Wikipedia
  • Classify questions with regular expression (default)
  • Classify questions with a SVM (optional)
  • Vector space model used for answer extraction
  • Rank candidate answers
  • Merge top 5 answers into one response

Current Project State ?

GitHub Issue #36: Invalid Answers

TODO

  • Replace Wikipedia APIs with custom scraper
  • Storing extracted data in database (elasticsearch)
  • SQLite test input data storage
  • Anaphora resolution in both questions and answers
  • Machine learning query constructor rather than rule-based
  • Improve vector space language model for answer extraction

Contributions

Please see our contributing documentation for some tips on getting started.

Maintainers

adam_qas's People

Contributors

5hirish avatar am1tg avatar codacy-badger avatar dependabot[bot] avatar idoroiengel avatar louisguitton avatar mpcsb avatar ziggerzz avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.