Giter Site home page Giter Site logo

mapo9 / big-mex Goto Github PK

View Code? Open in Web Editor NEW

This project forked from pereiramemo/big-mex

0.0 0.0 0.0 79.71 MB

BiG-MEx implementation as Docker images and R packages

License: GNU General Public License v3.0

Shell 8.38% HTML 89.99% R 0.31% Dockerfile 1.31%

big-mex's Introduction

BiG-MEx

BiG-MEx: a tool for the mining of Biosynthetic Gene Cluster (BGC) domains and classes in metagenomic data. It consists of the following modules:

  1. run_bgc_dom_annot: fast identification of BGC protein domains.
  2. run_bgc_dom_div: BGC domain-based diversity analysis.
  3. run_bgc_class_pred: BGC class abundance predictions.

Citation

Pereira-Flores, E., Buttigieg, P. L., Medema, M. H., Meinicke, P., Glöckner, F. O. and Fernandez-Guerra, A.. (2018+). Mining metagenomes for natural product biosynthetic gene clusters: unlocking new potential with ultrafast techniques. Under review.

Installation

BiG-MEx consists of five docker images:

  1. epereira/bgc_dom_annot
  2. epereira/bgc_dom_amp_div
  3. epereira/bgc_dom_meta_div
  4. epereira/bgc_dom_merge_div
  5. epereira/bgc_class_pred

Before running BiG-MEx it is necessary to install docker.

Then just clone the GitHub repository:

git clone [email protected]:pereiramemo/BiG-MEx.git

All four images are in dockerhub. These will be downloaded automatically the first time you run the scripts.

Documentation

The run_bgc_*.bash scripts run the docker images, which include all the code, dependencies and data used in the analysis. Given that we are using docker, if your user is not in the docker group in Linux or Mac OS, the run_bgc_*.bash scripts have to be executed with sudo.

1. bgc_dom_annot

This first module runs UProC using a BGC domain profile database. It takes as an input metagenomic unassembled data and outputs a BGC domain abundance profile table.

See help

./run_bgc_dom_annot.bash . . --help

2. bgc_dom_div

The bgc_dom_div has three different modes: amplicon (amp), metagenome (meta), and merge. The first two modes have the objective of analyzing the BGC domain diversity in amplicon and metagenomic samples. The diversity analysis consists of estimating the operational domain unit (ODU) diversity, blasting the domain sequences against a reference database, and placing the domain sequences onto reference trees. The merge mode integrates the amplicon or metagenome diversity results of different samples to provide a comparative analysis.

See help

./run_bgc_dom_div.bash amp . . --help

./run_bgc_dom_div.bash meta . . . --help

./run_bgc_dom_div.bash merge . .  --help

3. bgc_class_pred

This module is based on the bgcpred R package, which includes a library of BGC class abundance models. Based on the domain profile generated by bgc_dom_annot, this module computes the BGC class abundance profile.

See help

./run_bgc_class_pred.bash . . --help

See the wiki for further documentation.

big-mex's People

Contributors

pereiramemo avatar genomewalker avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.