Giter Site home page Giter Site logo

mvgwas-nf's Introduction

mvgwas-nf

nextflow CI-checks

A pipeline for multi-trait genome-wide association studies (GWAS) using MANTA.

The pipeline performs the following analysis steps:

  • Split genotype file
  • Preprocess phenotype and covariate data
  • Test for association between phenotypes and genetic variants
  • Collect summary statistics

The pipeline uses Nextflow as the execution backend. Please check Nextflow documentation for more information.

Requirements

  • Unix-like operating system (Linux, MacOS, etc.)
  • Java 8 or later
  • Docker (v1.10.0 or later) or Singularity (v2.5.0 or later)

Quickstart (~2 min)

  1. Install Nextflow:

    curl -fsSL get.nextflow.io | bash
    
  2. Make a test run:

    nextflow run dgarrimar/mvgwas-nf -with-docker
    

Notes: move the nextflow executable to a directory in your $PATH. Set -with-singularity to use Singularity instead of Docker.

(*) Alternatively you can clone this repository:

git clone https://github.com/dgarrimar/mvgwas-nf
cd mvgwas-nf
nextflow run mvgwas.nf -with-docker

Important: Since release 22.12.0-edge, DSL1 is not further supported in Nextflow. Until mvgwas-nf is migrated to DSL2, the pipeline should be run using an older Nextflow release. This can be done using NXF_VER before Nextflow commands, e.g. NXF_VER=22.04.0 nextflow run dgarrimar/mvgwas-nf -with-docker.

Pipeline usage

Launching the pipeline with the --help parameter shows the help message:

nextflow run mvgwas.nf --help
N E X T F L O W  ~  version 20.04.1
Launching `mvgwas.nf` [amazing_roentgen] - revision: 56125073b7

mvgwas-nf: A pipeline for multivariate Genome-Wide Association Studies
==============================================================================================
Performs multi-trait GWAS using using MANTA (https://github.com/dgarrimar/manta)

Usage:
nextflow run mvgwas.nf [options]

Parameters:
--pheno PHENOTYPES          phenotype file
--geno GENOTYPES            indexed genotype VCF file
--cov COVARIATES            covariate file
--l VARIANTS/CHUNK          variants tested per chunk (default: 10000)
--t TRANSFOMATION           phenotype transformation: none, sqrt, log (default: none)
--i INTERACTION             test for interaction with a covariate: none, <covariate> (default: none)
--ng INDIVIDUALS/GENOTYPE   minimum number of individuals per genotype group (default: 10)
--dir DIRECTORY             output directory (default: result)
--out OUTPUT                output file (default: mvgwas.tsv)

Input files and format

mvgwas-nf requires the following input files:

  • Genotypes. bgzip-compressed and indexed VCF genotype file.

  • Phenotypes. Tab-separated file with phenotype measurements (quantitative) for each sample (i.e. n samples x q phenotypes). The first column should contain sample IDs. Columns should be named.

  • Covariates. Tab-separated file with covariate measurements (quantitative or categorical) for each sample (i.e. n samples x k covariates). The first column should contain sample IDs. Columns should be named.

Example data is available for the test run.

Pipeline results

An output text file containing the multi-trait GWAS summary statistics (default: ./result/mvgwas.tsv), with the following information:

  • CHR: chromosome
  • POS: position
  • ID: variant ID
  • REF: reference allele
  • ALT: alternative allele
  • F: pseudo-F statistic
  • R2: fraction of variance explained by the variant
  • P: P-value

The output folder and file names can be modified with the --dir and --out parameters, respectively.

Cite mvgwas-nf

If you find mvgwas-nf useful in your research please cite the related publication:

Garrido-Martín, D., Calvo, M., Reverter, F., Guigó, R. A fast non-parametric test of association for multiple traits. Genome Biol 24, 230 (2023). https://doi.org/10.1186/s13059-023-03076-8

mvgwas-nf's People

Contributors

dgarrimar avatar

Stargazers

XinyangYu avatar qianche avatar kcan avatar Beatrice Borsari avatar

Watchers

 avatar

Forkers

kcnd15 kahinabch

mvgwas-nf's Issues

Nextflow DSL1 no longer supported

As of release v22.12.0-edge, Nextflow does not further support DSL1:

N E X T F L O W ~ version 23.04.3
Nextflow DSL1 is no longer supported — Update your script to DSL2, or use Nextflow 22.10.x or earlier

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.