Giter Site home page Giter Site logo

jenni-westoby / blueprint Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 1.0 5.79 MB

Benchmarking pipeline published in https://genomebiology.biomedcentral.com/articles/10.1186/s13059-018-1571-5

License: GNU General Public License v3.0

Shell 23.11% Python 1.55% Perl 8.29% R 67.05%
human-cell-atlas

blueprint's Introduction

Benchmarking pipeline for BLUEPRINT data

Prerequisites:

-virtualenv

-github account

-reference genome gtf and fasta files. The Mus musculus Ensembl release 89 genome was used in this study with ERCC sequences appended (see https://tools.thermofisher.com/content/sfs/manuals/cms_095048.txt).

-java version 1.8

-R version 3.4.4

To run the pipeline:

Execute ./wrapper.sh path/to/java path/to/ref/fasta path/to/ref/gtf

In practice it is unlikely that your machine will have the resources to run the entire pipeline in one go, so you will probably need to split up the wrapper script and run it in bits.

The pipeline automatically downloads the required data. In addition, a list of SRR accession codes can be found in SRR_Acc_List.txt.

As part of the pipeline, quality control steps are automatically carried out. For reference, these are the statistics used to filter the raw data:

Statistic Name of statistic in table Threshold
No. uniquely mapping reads Unique >8000000
No. of non-uniquely mapping reads NonUnique >350000
No. alignments NumAlign >8200000
No. of reads NumReads >4000000

These are the statistics used to filter the Polyester simulated data:

Statistic Name of statistic in table Threshold
No. of non-uniquely mapping reads NonUnique >250,000

In addition, the scater package was used to filter cells in which more than 10% of reads mapped to mitochondrial genes in both the raw and simulated data.

blueprint's People

Contributors

jenni-westoby avatar

Watchers

 avatar  avatar

Forkers

afs-lab

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.