Giter Site home page Giter Site logo

chebuu / fastore Goto Github PK

View Code? Open in Web Editor NEW

This project forked from refresh-bio/fastore

0.0 1.0 0.0 276 KB

FaStore - high-performance FASTQ files compressor

License: GNU General Public License v3.0

Makefile 0.70% C++ 93.76% C 2.21% QMake 0.65% Shell 2.37% Python 0.31%

fastore's Introduction

FaStore

GitHub downloads

Overview

FaStore is a high-performance short FASTQ sequencing reads compressor.

The compression process happens over multiple steps and the compressor currently consists of 3 tools:

  • fastore_bin - distribute DNA reads into bins
  • fastore_rebin - re-distribute the DNA reads into bins and clusterize further
  • fastore_pack - compress the DNA reads stored in bins

However, for ease of use, automated scripts are provided in the scripts directory to perform compression and decompression.

Usage

FaStore offers a variety of different compression configurations. In order to simplify selection, we created 4 profiles, namely lossless, reduced, lossy and max. To perform automatic compression and decompression, a pair of scripts fastore_compress.sh and fastore_decompress.sh is provided.

For example, to compress a pair of FASTQ files IN_1.fastq and IN_2.fastq in the lossless mode with reads represented in pared-end mode and using 8 processing threads type:

./fastore_compress.sh --lossless --in IN_1.fastq --pair IN_2.fastq --out COMP --threads 8

the compressed files will be stored as COMP.cmeta and COMP.cdata files.

To decompress the archives generated with any of the above mentioned profiles and using 8 processing threads type:

./fastore_decompress.sh --in COMP --out OUT_1.fastq --pair OUT_2.fastq --threads 8

the decompressed files will be stored as OUT_1.fastq and OUT_2.fastq files.

Building

Prerequisites

FaStore currently provides Makefiles for building on Linux and Mac OSX platforms. However, it should also be able to be compiled on Windows platform.

The only prerequisite is the zlib library.

Compiling

FaStore binaries are by default compiled using g++ version >= 4.8 (reguired for C++11 threading support). To compile FaStore using g++ >= 4.8 with C++11 standard and dynamic linking, use the default Makefile file and in the main directory type:

make

Alternatively, to compile using clang, invoke make using Makefile.clang file:

make -f Makefile.clang

The resulting fastore_bin, fastore_rebin and fastore_pack binaries will be placed in bin subdirectory.

However, to compile each subprogram separately, use the makefile files provided in each of subprograms directory.

fastore's People

Contributors

lrog avatar enormandeau avatar sebastiandeorowicz avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.