Giter Site home page Giter Site logo

boulund / merge_taxonomic_classifications Goto Github PK

View Code? Open in Web Editor NEW
2.0 3.0 0.0 12 KB

Merge Kaiju, Kraken, and CLARK-S classifications of metagenomic reads

License: MIT License

Python 100.00%
metagenomics taxonomic-classifications classification

merge_taxonomic_classifications's Introduction

Merge taxonomic classifications

A small utility tool for merging Kaiju, Kraken, and CLARK-S classifications of metagenomic reads. Merges classifications in a user-specified order, overwriting assignments of classified reads made by methods earlier in the merge order.

merge_taxonomic_classifications's People

Contributors

boulund avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar

merge_taxonomic_classifications's Issues

Summary unclassified read count incorrect when merging with empty file

If merging with an empty file, e.g. merging an empty Kraken output file with a Kaiju output file with merge order kraken,kaiju, the summary unclassified read count is incorrect.

The reason for this is that when reading the kraken file, no classified or unclassified reads are added to the SQLite3 database. Thus, when the kaiju file is read and "merged" with the kraken results, only the classified reads from kaiju are merged. It is assumed by the merge order that unclassified reads from kaiju are unclassified also in kraken, which is incorrect in this special case. This isn't really a problem, but still annoys me a bit :)

One of the earlier implementations did not have this issue, as it stored a complete SQLite3 table for each input file, and THEN merged everything, wasting a lot of time and space.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.