Giter Site home page Giter Site logo

genemunge's Introduction

genemunge

Tools for munging genomic data such as:

  • Converting between different types of gene identifiers
  • Searching for terms in the Gene Ontology (GO) associated with a keyword
  • Looking up housekeeping genes and transcription factors
  • Getting a list of GO terms associated with a given gene
  • Looking up how a gene is expressed across tissues
  • Normalizing a matrix of gene expression data by converting to TPM

Unlearn.AI

When we’re not developing super awesome open source packages like genemunge, we help biopharma partners use unsupervised deep learning to extract insights from their omics data. Learn more at unlearn.health.

Install

This library is accompanied by the following data sources:

Installing this package through pip (pip install genemunge from PyPI, pip install . from GitHub) will use the static data that accompanies this repository.

If you wish to use the latest data from the above sources, you may install in "develop" mode from GitHub with pip -e install .. Notably, this will download and process the recount2 GTEx data, requiring R and the recount package from bioconductor:

source("https://bioconductor.org/biocLite.R")
biocLite("recount")

Citations

Please cite the following papers if you make use of genemunge for a publication.

This package: Fisher C, Smith A, Walsh J, https://www.biorxiv.org/content/early/2018/04/10/299107

Gene Ontology: Ashburner et al. Gene ontology: tool for the unification of biology (2000) Nat Genet 25(1):25-9 GO Consortium, Nucleic Acids Res., 2017

recount2: Collado-Torres L, Nellore A, Kammers K, Ellis SE, Taub MA, Hansen KD, Jaffe AE, Langmead B, Leek JT. Reproducible RNA-seq analysis using recount2. Nature Biotechnology, 2017.

HGNC: Gray KA, Yates B, Seal RL, Wright MW, Bruford EA. genenames.org: the HGNC resources in 2015. Nucleic Acids Res. 2015 Jan;43(Database issue):D1079-85.

Transcription factors: TFcheckpoint: a curated compendium of specific DNA-binding RNA polymerase II transcription factors Konika Chawla; Sushil Tripathi; Liv Thommesen; Astrid Laegreid; Martin Kuiper Bioinformatics 2013.

Housekeeping genes: E. Eisenberg and E.Y. Levanon, Trends in Genetics 29, (2013)

Similar tools

If you know of similar tools that would be helpful references for users, please contribute an attribution to them here.

  1. goatools
  2. goenrich

GO evidence codes

Experiment:
 - Inferred from Experiment (EXP)
 - Inferred from Direct Assay (IDA)
 - Inferred from Physical Interaction (IPI)
 - Inferred from Mutant Phenotype (IMP)
 - Inferred from Genetic Interaction (IGI)
 - Inferred from Expression Pattern (IEP)

Computational:
 - Inferred from Sequence or structural Similarity (ISS)
 - Inferred from Sequence Orthology (ISO)
 - Inferred from Sequence Alignment (ISA)
 - Inferred from Sequence Model (ISM)
 - Inferred from Genomic Context (IGC)
 - Inferred from Biological aspect of Ancestor (IBA)
 - Inferred from Biological aspect of Descendant (IBD)
 - Inferred from Key Residues (IKR)
 - Inferred from Rapid Divergence(IRD)
 - Inferred from Reviewed Computational Analysis (RCA)

Literature:
 - Traceable Author Statement (TAS)
 - Non-traceable Author Statement (NAS)

Other:
 - Inferred by Curator (IC)
 - No biological Data available (ND) evidence code
 - Inferred from Electronic Annotation (IEA)

Common gene id types

['symbol','name','entrez_id','ensembl_gene_id','refseq_accession','uniprot_ids']

genemunge's People

Contributors

d-rams avatar drckf avatar jrwalsh1 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.