Giter Site home page Giter Site logo

ibrahimelzahaby / msc_thesis Goto Github PK

View Code? Open in Web Editor NEW
4.0 1.0 0.0 106.07 MB

Computational protemics analysis of cancer cell-lines at the level of single-cells

License: MIT License

R 100.00%
cancer-proteomics correlation-analysis gene-enrichment gsea kegg-pathway network-analysis psea single-cell-proteomics go-enrichment bioinformatics

msc_thesis's Introduction

Comprehensive Single-Cell Proteomics Analysis of Cancer Cell-Line Model Systems

This MSc thesis project is comprehensively investigates the proteomic profile of Melanoma, Hela, Pancreatic Ductal Adenocarcinoma (PDAC), and Monocytes (U937) cancer cell-lines.

Table of Contents

  • Project Overview
  • Analysis Worklow
    • Create Venn diagram
    • Gene Set Enrichment Analysis (GSEA)
    • KEGG pathway analysis
    • Network analysis
    • Cell Clustering:
      • Principal Component Analysis (PCA)
      • Uniform Manifold Approximation and Projection (UMAP)
  • License
  • Acknowledgments

Project Overview

In this project, we gained a comprehensive understanding of Four cancer cell lines based on their proteome profiles at the level of single-cell. By integrating various analytics approaches, we identified Gene Ontology (GO) terms, the pathways associated with each cancer type, hub proteins, and clustering subpopulations. The complete project code can be found here.

Analysis Workflow

The analysis workflow consists of the following pipeline:

Create Venn diagram

A venn diagram is created to visualize the overlap of the identified proteins in four cancer cell-lines using SCoPE approaches. This diagram provides the intersected proteins that will be used for further analysis across different cancer types. You can check the code and data files here.

Gene Set Enrichment Analysis (GSEA)

GSEA is performed to identify the enriched gene sets and biological pathways associated with each cancer cell line. The enrichment analysis helped uncover the biological processes and pathways that plays a vital role in each cancer type. More details can be found in Enrichment_analysis file.

KEGG pathway analysis

KEGG pathway analysis is conducted to identify the key signaling pathways that are dysregulated in the different cancer cell lines. By analyzing the pathway-level protein expression, we can gain insights into the underlying biological mechanisms driving each cancer type.

Network analysis

Protein-Protein network analysis is performed to identify protein-protein correlations and construct functional protein networks for each cancer cell line. This analysis helps uncover the interconnectedness of proteins and potential hubs that play critical roles in Melanoma and Hela cancer types.

Cell Clustering

In this file, Cell clustering techniques are applied to the proteomic data to identify distinct cell populations within each cancer cell line. Two commonly used methods are utilized:

Principal Component Analysis (PCA)

Here we applied PCA to reduce the dimensionality of the data and visualize the variation between different cell populations within each cancer type. This technique helps identify clusters and patterns in the proteomic data.

Uniform Manifold Approximation and Projection (UMAP)

UMAP is a dimensionality reduction technique that preserves the local structure of the data. It is used to visualize and identify distinct cell populations within each cancer cell line based on their proteomic profiles. you can find the code in this R file.

License

This project is released under the MIT License.

Acknowledgments

I would like to acknowledge the Slavov laboratory where the data was obtained for this project. This project was conducted as part of the master's thesis at Systems and Personalized Medicine Division - Laboratory of Systems and Synthetic Biology at Wageningen University and Research. I extend my gratitude to my supervisors, Dr. Cristina Furlan and Dr. Edoardo Saccenti for their guidance and support throughout the process of completing this project.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.