Giter Site home page Giter Site logo

biquad's Introduction

BiQuAD

prerequisites

The data can be used as is, for generation (or use on custom datasets) Python >= 3.7 is required. The bundeled datalog interpreter used for answer retrieval requires node.js >= v15.5.

create a virtual environment and install required Python packages

# create environment
python3 -m venv .venv

# activate environment
source .venv/bin/activate

# install dependencies
pip install -r requirements.txt

Install node.js dependencies

cd dltools/nodeinterp/
npm install .

download ESDB

The sqlite database can be obtained via Kaggle. It should be placed in the data directory as database.sqlite.

scripts

  • output_rulecount.py Makes sure the transformations in data/transformations can be loaded properly.
  • kbtransform.py Contains most of the transformation logic and can be invoked directly to syntax check the rulesets. The syntax check also outputs all placeholders or "fillers" defined per ruleset.
  • execdatalog.py Runs the datalog interpreter for answer retrieval, can be directly invoked to test communication.
  • soccerdb.py Contains the data model and SQL queries for matches, players, and events. Can be invoked directly to show the event dump of a random match in the database.
  • write_textrep.py Applies textual representations to matches and events, output is written to data/generated/reports/
  • write_datalog.py Similar to the above, generates datalog representations.
  • write_questions.py Generates questions in textual and datalog form, the datalog query is then tested against the knowledge base. Output is written to data/generated/questions.
  • generate_unanswerable.py Generates adverserial unanswerable questions in addition to those in data/generated/questions, output is saved to data/generated/questionsfull.
  • export_squad.py Generates a new squad-like 60-20-20 split from the generated content.

other data

  • data/generated-content/ The fully generated train/dev/test splits.
  • data/transformations/ The rulesets for transformations of match reports and questions into text and datalog.

biquad's People

Contributors

frankgrimm avatar

Watchers

 avatar  avatar  avatar  avatar

Forkers

osanchezgr

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.