Giter Site home page Giter Site logo

sayalaruano / bactnets-colombrainforestconv Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 7.21 MB

A project to evaluate the differences in bacterial association networks from rainforest and converted pasturelands in the Colombian Amazon region using CCLasso and SPRING inference methods

R 77.77% Jupyter Notebook 22.23%
microbiome-data network-analysis network-biology r rainforest netcomi

bactnets-colombrainforestconv's Introduction

Evaluating differences in bacterial association networks from rainforest and converted pasturelands in the Colombian Amazon region

Table of contents:

About the project

Microbial association networks (MANs) provide insights into potential ecological interactions among microbes, including mutualism, competition, and more. Furthermore, these networks can reveal communities that share ecological functions or keystone taxa playing crucial roles in the system. In MANs, nodes correspond to Operational Taxonomical Units (OTUs) at a given taxonomic rank, and edges between nodes denote significant co-presence (positive relationships) or mutual exclusion (negative relationships) patterns in OTU abundances across samples. Multiple association metrics are available for inferring MANs, mainly founded on correlation, proportionality, and conditional dependence approaches.

In this project, the CCLasso (correlation-based) and SPRING (conditional dependence-based) methods were used to explore differences in microbiomes found in the rainforest and converted pasturelands of the northwest Colombian Amazon region.

The NetCoMi R package v1.1 was the central framework to perform the network inference, analysis, visualization, and comparison. It provides a computational workflow that involves calculating associations among OTUs using a specified metric, applying sparsification if necessary, and converting these associations into dissimilarities and subsequently into similarities, resulting in the adjacency matrix for the inferred networks.

Dataset

The raw sequencing data used for this project is available at the PRJEB44163 project from the European Nucleotide Archive (ENA) database. This research collected 52 soil samples from the Colombian Amazon region: 36 from rainforest areas and 16 from converted pasturelands. The rainforest samples were the reference with minimal intervention, while the pastureland represented the land use systems.

MGnify is a platform that automates the analysis of metagenomics datasets from ENA and other databases. The abundance table and taxonomic profiles to infer the MANs in this project were obtained from the MGYS00005779 study, which applied the MGnify's pipeline v5. The first step was downloading the data and metadata using the MGnifyR package v0.1, enabling the utilization of the MGnify API in R scripts. Then, the data was preprocessed and manipulated using the Phyloseq v1.44 and Microbiome v1.22 R packages. The dataset was filtered to retrieve only bacterial OTUs, excluding taxa from other life kingdoms. In addition, the data was aggregated at the family taxonomic level, obtaining 200 OTUs.

Structure of the repository

The main files and directories of this repository are:

File Description
Fetch_analyses_Mgnify.ipynb Jupyter notebook to retrieve data from MGnify using MGnifyR
Preprocessing_metag_data.R Script to preprocess the metagenomics dataset
Build_bact_assoc_nets.R Script to infer bacterial association networks
Network_analysis_and_cond_compar.R Script to analyze, visualize, and compare networks
Networks_fam_and_phylum.R Script to make visualization of networks colored by Phylum
Compare_pred_edges_inf_methods.R Script to calculate coincidence percentage of associations among networks
Data/ Folder with the metagenomics data stored in a phyloseq object
Results/ Folder with edge lists of inferred networks, figures, and txt with comparison results

Credits

Further details

More details about the biological background of the project, the interpretation of the results, and ideas for further work are available in this pdf report.

Contact

If you have comments or suggestions about this project, you can open an issue in this repository, or email me at [email protected].

bactnets-colombrainforestconv's People

Contributors

sayalaruano avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.