Giter Site home page Giter Site logo

elco's Introduction

The application consists of clients to call Entity Linking services (EL) in English, and modules to operate on the results. Implements the Entity Linking System Combination described in our *SEM 2015 paper.

The EL services currently supported are:

Requirements

  • Python 2.7
  • lxml
  • MySQL-python (aka MySQLdb)
  • nltk
  • pyspotlight
  • requests

To call TagMe and Babelfy, you need to request a key: Tagme, Babelfy. The application's config module has variables to enter the keys.

Modules

  • analysis: Parses client responses. Computes entity-cooccurrence tables.
  • clients: Clients to call the services
  • config: Configuration
  • main: Example how to use. Creates runners and calls them for each service
  • model: Data types and some methods for them
  • readers: To preprocess input before calling a client
  • runners: Classes here use a reader, client and writer to create an annotation workflow
  • utils: General tools useful for several modules
  • writers: To postprocess the annotations and output them (to a file etc)

Usage

  • activate the services to call in config.py

  • call main.py

     usage: App to work with Entity Linking [-h] [-i MYINPUT] [-o MYOUT]
                                        [-s MYSKIPLIST] [-c CORPUS_NAME]
    
     optional arguments:
       -h, --help            show this help message and exit
       -i MYINPUT, --input MYINPUT
                             Input file, directory or text. A default can be set in
                             config.py (default: /path/to/some/default/input)
       -o MYOUT, --output MYOUT
                             Output file or files. Default names are created 
                             dynamically by code in writers.py module (default: None)
       -r MYOUTRESPS, --resp_output MYOUTRESPS
                             Output directory for client responses. A default is
                             created dynamically by code in writers.py module
                             (default: None)
       -s MYSKIPLIST, --skip_list MYSKIPLIST
                             File with filenames to skip (default:
                             /path/to/some/default/list)
       -c CORPUS_NAME, --corpus CORPUS_NAME
                             Name of the corpus (for output files etc.). A default
                             can be set in config.py (default: SOME_DEFAULT_NAME)
    

elco's People

Contributors

pruizf avatar

Stargazers

Andreas Asprou avatar

Watchers

James Cloos avatar  avatar paper2code - bot avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.