Giter Site home page Giter Site logo

raudikko's Introduction

Raudikko

Raudikko is Java library for performing morphological analysis on Finnish language. Raudikko uses Voikko's morphology files and is based on Voikko, but unlike Voikko's Java-interface, it is implemented purely in Java and needs no native libraries. Raudikko is also quite a bit faster than Voikko's Java-interface.

Usage

Add dependency to your build file:

implementation("fi.evident.raudikko:raudikko:0.1.1")

Create an analyzer and use it to produce Analysis-objects for given words:

// First load and compile the morphology. This is a relatively expensive operation 
// that should be done only once. The loaded morphology is immutable and can be shared.
Morphology morphology = Morphology.loadBundled();

// Create an analyzer from the morphology. Creating an analyzer is a cheap operation. 
// Analyzers have some cached state and can't be shared between different threads. 
Analyzer analyzer = morphology.newAnalyzer();

// Analyze some words
System.out.println(analyzer.analyze("kissoittansa"));
System.out.println(analyzer.analyze("kahdellakymmenelläseitsemällä"));

Compatibility with Voikko

Raudikko is carefully tested against Voikko, making sure that the output of Raudikko and Voikko agree for all inputs. The below table lists the version of Voikko that Raudikko is tests against.

Raudikko version Voikko version Morphology
0.1.1 4.3 9f0d3d

Acknowledgements

Most of Raudikko's code is based heavily on Voikko and the used morphology files come directly from Voikko. None of this would be possible without the great work of Harri Pitkänen and other contributors to Voikko.

Copyright and license information

Raudikko is available under GNU General Public License version 3, or (at your option) any later version. Apart from the morphology in directory resources/morpho, all the content in this repository is also licensed under Mozilla Public License 2.0, which you may choose to use instead.

Raudikko is a port of Voikko, and therefore Voikko's original copyright holders hold copyrights on parts of Raudikko. See Voikko's LICENSE and CONTRIBUTORS for details.

Apart from the original copyright holders of Voikko, Evident Solutions Oy holds copyright on Raudikko.

raudikko's People

Contributors

komu avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.