Giter Site home page Giter Site logo

irenawasikplay / aalto-xml Goto Github PK

View Code? Open in Web Editor NEW

This project forked from fasterxml/aalto-xml

0.0 1.0 0.0 3.66 MB

Ultra-high performance non-blocking XML processor (Stax API + extensions)

Home Page: http://wiki.fasterxml.com/AaltoHome

License: Other

Java 100.00%

aalto-xml's Introduction

Overview

Aalto XML processor is an ultra-high performance next generation Stax XML processor implementation, implementing both basic Stax API (javax.xml.stream) and Stax2 API extension (org.codehaus.woodstox.stax2). In addition, it also implements SAX2 API.

In additional to standard Java XML interfaces, one unique feature not implemented by any other Java XML parser that we are aware is so-called non-blocking (asynchronous) XML parsing: ability to parse XML without using blocking I/O, necessary for fully asynchronous processing such as those with Akka framework. Aalto non-blocking API is a minimalistic extension above Stax/Stax2 API to allow indication of "not yet available" token (EVENT_INCOMPLETE) as well as feeding of input (since InputStream can not be used as it blocks)

Aalto is licensed under Apache License 2.0

Status

Build Status

Documentation

JDK Compatibility

Aalto 1.x:

  • Can be used on JDK versions 6 (1.6) and up
  • Needs JDK 8 or higher to build (as of Aalto 1.2.0)
  • Contains Java 9 Module definitions starting with version 1.2.0

Usage

Blocking XML parsing (Stax, SAX)

Blocking XML parsing is done using one of standard interfaces:

  • Stax (javax.xml.stream) interface -- countless tutorials exist.
    • Also implements Stax2 extension
    • StaxMate is a good companion library for more convenient access

Non-blocking ("async") XML parsing

Non-blocking parsing interface is extension of basic Stax (and Stax2) API, with extensions defined in 'com.fasterxml.aalto' package:

  • AsyncXMLInputFactory offers factory methods for creating non-blocking parsers
  • AsyncXMLStreamReader is extended type that non-blocking parsers implement
    • AsyncXMLStreamReader.EVENT_INCOMPLETE (value 257; just outside range reserved by Stax API) is used to denote "not yet available" (without more data)
    • Method getInputFeeder() is used to access object of type 'AsyncInputFeeder' used to feed input non-blocking way
  • AsyncInputFeeder contains methods for feeding input.

Typical usage pattern is one where block of input is fed to parser, and zero or more complete events are read using basic 'XMLStreamReader.next()' method; and once 'EVENT_INCOMPLETE' is returned, more input needs to be given. AsyncXMLStreamReader itself does not buffer input beyond a single block; caller is responsible for additional buffering, if any. See Async parsing for details.

Construction of AsyncXMLInputFactory is simple; instance may be constructed with or without initial content to parse:

AsyncXMLInputFactory f = new InputFactoryImpl();
// IF there is content use this:
AsyncXMLStreamReader<AsyncByteArrayFeeder> parser = f.createAsyncFor(byteArray);
// If NO CONTENT yet available, just use this:
AsyncXMLStreamReader<AsyncByteArrayFeeder> parser = f.createAsyncForByteArray();

and more content is feed via AsyncInputFeeder when getting EVENT_INCOMPLETE via parser.next():

parser.getInputFeeder().feedInput(b, offset, dataLength);

or, if no more input available, indicate end-of-content with

parser.getInputFeeder().endOfInput();

See the Full non-blocking parsing example on Wiki for more details.

Aalto Design goals

  • Ultra-high performance parsing by making the Common Case Fast (similar to original RISC manifesto). This may mean limiting functionality, but never compromising correctness. XML 1.0 compliancy is not sacrificed for speed.
  • Allow non-block, asynchronous parsing: it should be possible to "feed" more input and incrementally get more XML events out, without forcing the current thread to block on I/O read operation.

Dependency

Aalto dependency is usually added via Maven repository, so something like:

<dependency>
    <groupId>com.fasterxml</groupId>
    <artifactId>aalto-xml</artifactId>
    <version>1.1.0</version>
</dependency>

Related

aalto-xml's People

Contributors

adamretter avatar cowtowncoder avatar jakeri avatar prb avatar simonetripodi avatar stebulus avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.