Giter Site home page Giter Site logo

umakabulk's Introduction

Compile instructions:

  1. Install jdk 1.8.*

  2. Install sbt: www.scala-sbt.org

  3. Run: sbt compile

To compile

Optional

  1. Install scala-ide: www.scala-ide.org (This is a modified version of eclipse)

  2. Run: sbt eclipse

To generate an Eclipse project (can be imported into Eclipse later, with File/Import... Existing projects into workspace)

How to use

Configuration

Define environment variables in script/config.sh. We have to define at least 2 variables:

Retrieve Bulk-Download data

./script/download.sh [ENDPOINT]

This command download data in the directory under the ${BULKDOWNLOADS_DIR}/[ENDPOINT]. If you do not specify ENDPOINT, this tool download all endpoints' data according to CSV_FILE_PATH.

Extract RDF file

./script/extract.sh [ENDPOINT]

This command extract RDF in the directory under the ${BULKDOWNLOADS_DIR}/[ENDPOINT]/extractions. If you do not specify ENDPOINT, this tool extract all endpoints' data according to CSV_FILE_PATH.

NOTE: The following process detect RDF file by file extension. You have to modify file names if the file extension of extracted files is not none of "rdf", "rdfs", "owl", "xml", "nt", "ttl", "n3", "xml", "trix", "trig", "brf", "nq", "jsonld", "rj", "xhtml" and "html".

Extract Candidates of Prefixes

./script/create_prefix.sh [ENDPOINT]

This command read RDFs, extract candidates of prefixes and output to ${BULKDOWNLOADS_DIR}/[ENDPOINT]_prefixes.csv. If you do not specify ENDPOINT, this tool create prefix file of all endpoints according to CSV_FILE_PATH.

umakabulk's People

Contributors

yayamamo avatar takuodoi avatar

Watchers

Toshiaki Katayama avatar Tazro Inutano Ohta avatar James Cloos avatar  avatar

Forkers

level-five

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.