Giter Site home page Giter Site logo

pgatk-io's Introduction

pgatk-io

Java CI with Maven License

About pgatk-io

The pgatk-io library is a java framework to manipulate mass spectrometry and proteomics file formats. It has an special focus on novel file formats like Apache Spark Parquet and Json file formats for proteomics.

Support Matrix

This table summarizes the current level of support for each feature across the different file formats. See discussion below for details on each feature.

Feature MGF APL (Maxquant) mzXML mzML PRIDE Json Pep Avro
Random Access ✔️ ✔️ ✔️ ✔️ ✔️
Fast Iterable Access ✔️ ✔️ ✔️
Gzip Support
Numpress Support

File formats

License

pgatk-io is licensed under Apache License 2.0.

Main Features

  • Based on a custom build class to efficiently parse text files line by line all parsers can handle arbitrary large files in minimal memory, allowing easy and efficient processing of peak list files using the Java programming language.

  • For every implementation a Random Access and Iterable Access Reader is provided.

    • In the Random access developers can access to any individual Spectrum using the Identifier of the Spectrum or the index.
    • In the Iterable access developers can access one by one each of the spectra with the next function

Getting Help

If you have questions or need additional help, please create an issue in the library repo in github (https://github.com/bigbio/pgatk-io/issues). Please send us your feedback, including error reports, improvement suggestions, new feature requests and any other things you might want to suggest.

Similar libraries:

  • ms-data-core-api Perez-Riverol Y., Uszkoreit J., Sanchez A., Ternent T., Del Toro N., Hermjakob H., Vizcaíno J.A., Wang R. ms-data-core-api: an open-source, metadata-oriented library for computational proteomics. Bioinformatics, 2015 Sep 1;31(17):2903-5 ms-data-core-api

  • jmzReader Griss J, Reisinger F, Hermjakob H, Vizcaíno JA. jmzReader: A Java parser library to process and visualize multiple text and XML-based mass spectrometry data formats. Proteomics. 2012 Mar;12(6):795-8. doi: 10.1002/pmic.201100578.

pgatk-io's People

Contributors

chakrabandla avatar dependabot[bot] avatar ypriverol avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar  avatar

pgatk-io's Issues

Add support for PRIDE JSON streams

Currently, the PRIDE Json reader expects on JSON object per line.
Ideally, the reader should be updated to also support JSON streams as returned by the API directly.

Use GitHub Actions to Deploy automatically to GitHub Packages

@ypriverol

GitHub now also provides a maven repository which is accessible through GitHub actions.

I suggest to move this package to the GitHub mvn repo to automatically update the latest snapshot version as soon as we have a successful merge into master.

Otherwise, we have to update the mvn package manually which is causing some issues on my side.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.