Giter Site home page Giter Site logo

bhelmi / metapathways Goto Github PK

View Code? Open in Web Editor NEW

This project forked from hallamlab/metapathways

0.0 1.0 0.0 215.82 MB

A modular pipeline for constructing Pathway/Genome Databases from environmental sequence information

Home Page: http://hallam.microbiology.ubc.ca/MetaPathways

Python 0.43% Shell 0.01% Nu 98.92% Perl 0.55% Makefile 0.01% Perl 6 0.02% C 0.07% Grammatical Framework 0.01%

metapathways's Introduction

MetaPathways: A modular pipeline for constructing Pathway/Genome Databases from environmental sequence information

Abstract

Background: A central challenge to understanding the ecological and biogeochemical roles of microorganisms in natural and human engineered ecosystems is the reconstruction of metabolic interaction networks from environmental sequence information. The dominant paradigm in metabolic reconstruction is to assign functional annotations using BLAST. Functional annotations are then projected onto symbolic representations of metabolism in the form of KEGG pathways or SEED subsystems.

Results: Here we present MetaPathways, an open source pipeline for pathway inference that uses the PathoLogic algorithm to construct environmental Pathway/Genome Databases (ePGDBs) compatible with the editing and navigation features of Pathway Tools. The pipeline accepts assembled or unassembled nucleotide sequences, performs quality assessment and control, predicts and annotates noncoding genes and open reading frames, and produces inputs to PathoLogic. In addition to constructing ePGDBs, MetaPathways uses MLTreeMap to build phylogenetic trees for selected taxonomic anchor and functional gene markers, converts General Feature Format (GFF) files into concatenated GenBank files for ePGDB construction based on third-party annotations and generates useful file formats including Sequin files for direct GenBank submission and gene feature tables summarizing annotations, MLTreeMaps and ePGDB pathway coverage summaries for statistical comparisons.

Conclusions: Metapathways provides users with a modular annotation and analysis pipeline for predicting metabolic interaction networks from environmental sequence information using an alternative to KEGG pathways and SEED subsystems mapping. It is extensible to genomic and transcriptomic datasets from a wide range of sequencing platforms, and generates useful data products for microbial community structure and function analysis. The MetaPathways software package, installation instructions, and example data can be obtained from http://hallam.microbiology.ubc.ca/MetaPathways

Keywords: Environmental Pathway/Genome Database (ePGDB), metagenome, Pathway Tools, PathoLogic, MetaCyc, microbial community, metabolism, metabolic interaction networks

Please cite: Konwar, Kishori M., et al. "MetaPathways: a modular pipeline for constructing pathway/genome databases from environmental sequence information." BMC bioinformatics 14.1 (2013): 202.

metapathways's People

Contributors

nielshanson avatar kishori82 avatar

Watchers

Hoda avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.