Giter Site home page Giter Site logo

xinl60 / tiga Goto Github PK

View Code? Open in Web Editor NEW

This project forked from unmtransinfo/tiga

0.0 0.0 0.0 7.03 MB

An approach, workflow and web app for GWAS gene-trait evidence aggregation

License: BSD 2-Clause "Simplified" License

Shell 9.75% Python 15.03% R 75.13% HTML 0.10%

tiga's Introduction

TIGA: Target Illumination GWAS Analytics

Genome-wide association studies (GWAS) generate large volumes of associations between genomic variations and phenotypic traits. However, assessing strength, specificity and relevance of associations, and semantically valid aggregation of associations, for applications such as drug target prioritization, is challenging. This project addresses this challenge.

The NHGRI-EBI GWAS Catalog is itself an expertly designed and curated aggregation of GWAS results and metadata, and the primary data source for this project. Our effort builds upon the GWAS Catalog with more specific applications and use cases, focused on protein-coding genes and well defined traits semantically related to disease states relevant to discovery of drugs and druggable targets.

Dependencies

  • R 3.6+; readr, data.table, igraph, muStat, RMySQL (Webapp: shiny, DT, shinyBS, shinysky, plotly)
  • Python 3.7+; pandas, BioClients
  • Java 8+; Jena, IU_IDSL_JENA

GWAS Catalog features

  • GWAS Catalog studies each have a study_accession. Also are associated with a publication (PubMedID), but not uniquely.
  • OR_or_BETA: Reported odds ratio or beta-coefficient associated with strongest SNP risk allele. Note that if an OR <1 is reported this is inverted, along with the reported allele, so that all ORs included in the Catalog are >1. Appropriate unit and increase/decrease are included for beta coefficients.
  • MAPPED_GENE: Gene(s) mapped to the strongest SNP. If the SNP is located within a gene, that gene is listed. If the SNP is intergenic, upstream and downstream genes are listed. May be chromosomal location or range (e.g. "LOC102723594 - LOC285043").
  • Documentation: methods; curation; fileheaders
  • Reference: Buniello, A. et al. (2019) The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res., 47, D1005โ€“D1012.

Issues

  • Beta coefficients require units and thus are not comparable between non-convertible units (e.g. mg vs mm). Nor are beta coefficients comparable with OR, so it is questionable that these values are combined in one field OR_or_BETA. Current TIGA workaround is to use simple count of beta values supporting gene-trait association.

Features of TIGA

  • Protein-coding gene to disease association focus.
  • Evidence assessment based on confirmatory statistics.
  • iCite annotations from iCite API, via PMIDs from GWAS Catalog.
  • Visualization of associations for a given disease by scatter plot of effect size versus meanRankScore, inverse multivariate mean rank of benchmark-validated variables.

TIGA Workflow

See WORKFLOW.md for details describing how to update the TIGA dataset from sources.

TIGA Application

TIGA Downloads

Latest release and archives of full dataset and utility files.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.