Salmon vs kallisto on GEUVADIS and SEQC datasets

Google Slides associated with this Shiny app

Fragment-level bias modeling for RNA-seq data analysis

A Shiny app to explore differences in transcript abundance estimation on GEUVADIS and SEQC

How to load the Shiny app

Clone or Download this repo from GitHub (either clone or Download ZIP and unzip)
Open geuvadis.R or seqc.R in RStudio. For seqc.R you can change the dataset variable to "A" or "B" for the two different types of samples processed by SEQC.
Click Run App
Clicking on a point on the left side will open a plot on the right side showing the normalized estimated counts for all the samples for the transcript you clicked (green and orange colored points) and for the other transcripts of the gene that had differences. This example shows kallisto with very low normalized estimated counts for the samples from the CNL sequencing center. Green indicates sequencing centers with more fragment sequence bias, while orange indicates less fragment sequence bias. The diagonal lines are y = 2x and x = 2y. The red numbers indicate the number of transcripts in the different thirds of the plot defined by the two lines.

The SEQC quantifications are the same from the Salmon publication. Quantifications were made against the RefSeq gene annotation file and the genome FASTA contained within the hg19 Illumina iGenome using Salmon 0.8.0 with --gcBias --seqBias and kallisto 0.43.0 with --bias. The LRT in DESeq2 was used to generate p-values for each transcript, comparing a model with sequencing center to a model with just the intercept.

The Geuvadis quantifications were made against Gencode v26 CHR transcripts using Salmon 0.8.2 with --gcBias --seqBias and kallisto 0.43.1 with --bias. Limma-voom was used to generate F statistics for each transcript, comparing a full model with sequencing center, sex, and population to a reduced model with only sex and population.

mikelove / salmon_kallisto_diffs Goto Github PK

salmon_kallisto_diffs's Introduction

Salmon vs kallisto on GEUVADIS and SEQC datasets

Google Slides associated with this Shiny app

A Shiny app to explore differences in transcript abundance estimation on GEUVADIS and SEQC

How to load the Shiny app

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent