Giter Site home page Giter Site logo

trokhymovych / wikicheck Goto Github PK

View Code? Open in Web Editor NEW
22.0 2.0 6.0 634 KB

Implementation for WikiCheck API, an open-source Wikipedia-based fact-checking API. The project is done in cooperation with Wikimedia Foundation and Ukrainian Catholic University.

Home Page: https://nli.wmcloud.org/

License: MIT License

Python 3.22% Jupyter Notebook 96.70% HTML 0.08%
api end-to-end fact-checking nlp search

wikicheck's Introduction

Site ResearchGate Wikipedia

WikiCheck API

Repository with the implementation of WikiCheck API, end-to-end open source Automatic Fact-Checking based on Wikipedia.

The research was published in CIKM2021 applied track:

  • Trokhymovych, Mykola, and Diego Saez-Trumper. WikiCheck: An End-to-End Open Source Automatic Fact-Checking API Based on Wikipedia. Proceedings of the 30th ACM International Conference on Information & Knowledge Management, Association for Computing Machinery, 2021, pp. 4155–4164, CIKM ’21. DOI:10.1145/3459637.3481961

  • The preprint WikiCheck: An End-to-End Open Source Automatic Fact-Checking API Based on Wikipedia: DOI:10.48550/arXiv.2109.00835

We encourage you to test the WikiCheck API by yourself: Website

Installation and Usage:

The project consists of modules directory with the implementation of modules used for inference along with the script for NLI models training.

The configs directory includes configuration files for training and inference.

The notebooks directory includes .ipynb notebooks with experiments done during the research.

If you want to get access to our fine-tuned models, you can load them from Zenodo DOI

Also, you can train your model by running the modules/model_trainer.py script.

API setup and run

  • Clone the official WikiCheck repo and cd into it

git clone https://github.com/trokhymovych/WikiCheck.git

cd WikiCheck

  • Create and activate virtualenv:

virtualenv -p python venv

source venv/bin/activate

  • Install requirements from requirements.txt:

pip install -r requirements.txt

  • Load pretrained models:

    • Loading models from Zenodo DOI
  • Run the API:

python run.py --config configs/inference/sentence_bert_config.json

Citation

If you find this work is useful, please cite our paper:

WikiCheck: An End-to-End Open Source Automatic Fact-Checking API Based on Wikipedia.

@inproceedings{10.1145/3459637.3481961,
author = {Trokhymovych, Mykola and Saez-Trumper, Diego},
title = {WikiCheck: An End-to-End Open Source Automatic Fact-Checking API Based on Wikipedia},
year = {2021},
isbn = {9781450384469},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3459637.3481961},
doi = {10.1145/3459637.3481961},
booktitle = {Proceedings of the 30th ACM International Conference on Information & Knowledge Management},
pages = {4155–4164},
numpages = {10},
keywords = {applied research, nlp, nli, wikipedia, fact-checking},
location = {Virtual Event, Queensland, Australia},
series = {CIKM '21}
}

wikicheck's People

Contributors

trokhymovych avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.