Giter Site home page Giter Site logo

afatsony / monti Goto Github PK

View Code? Open in Web Editor NEW

This project forked from inukj/monti

0.0 0.0 0.0 168.18 MB

MONTI is a tool for analyzing large multi-omics cancer cohort data in association with clinical featuers

License: GNU General Public License v3.0

Python 1.74% Jupyter Notebook 98.26%

monti's Introduction

MONTI: A multi-omics non-negative tensor decomposition framework for gene-level integrative analysis

https://www.frontiersin.org/articles/10.3389/fgene.2021.682841/abstract

Multi-omics data is frequently measured to characterize biological mechanisms underlying phenotypes. Complex relationships in multi-omics data, if mined, can lead to more accurate classification of patient samples according to the phenotypes.

MONTI (Multi-Omics Non-negative Tensor decomposition for Integrative analysis) is a tool that can be used to integrate and analyze large sets of multi-omics data. MONTI identifies gene regulatory multi-omics features specific to a group of samples that share a common biological trait.

Below is an illustration of the analysis workflow of MONTI. workflow

The output of MONTI is a simple gene list with information of their associated subtypes, which can be used for further downstream analysis. For example, the Venn diagram below shows the genes that are found to be associated to colorectal cancer subtypes CMS1, CMS2, CMS3 and CMS4. These genes showed to be informative in separating the four subtypes as shown in the t-SNE plot.

example output


Install MONTI

MONTI is developed in python3 and can be installed as below

python3 -m pip install monti

Documentation

The functions and objects used by MONTI are documented here.

Tutorial using colon cancer data (TCGA-COAD)

A brief tutorial for using MONTI can be found under the 'tutorial' directory. The associated multi-omics data are included.

Before starting the tutorial, the dataset should be downloaded. After download decompress data by

cd <download_path>
tar -xzvf tutorial_data_coad.tar.gz

The <download_path> should also be used as the tutorial directory, or you can simply move the data to another directory to be used for the tutorial.

The data includes three omics data, 1) gene expression (mRNA), 2) methylation level and 3) miRNA expression. They are raw data directly collected from the TCGA portal.

In the jupyter notebook shows an example of how to integrate multi-omics data in a gene-level manner and extract features that can classify the molecular subtypes of COAD.

The tutorial includes the below analysis procedures:

  • gene-level transformation
  • normalization
  • feature selection
  • classification accuracy measurement and
  • plotting of the results

monti's People

Contributors

cobi-git avatar inukj avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.