Giter Site home page Giter Site logo

rkelly712 / mouse_organogenesis_10x_multiome_publication Goto Github PK

View Code? Open in Web Editor NEW

This project forked from rargelaguet/mouse_organogenesis_10x_multiome_publication

0.0 0.0 0.0 2.22 MB

Code to reproduce the analysis of "Decoding gene regulation in the mouse embryo using single-cell multi-omics""

Shell 0.04% Python 6.33% R 34.04% Jupyter Notebook 59.60%

mouse_organogenesis_10x_multiome_publication's Introduction

Decoding gene regulation in the mouse embryo using single-cell multi-omics

This repository contains the scripts to reproduce the results of the manuscript Decoding gene regulation in the mouse embryo using single-cell multi-omics.

Abstract

Following gastrulation, the three primary germ layers develop into the major organs in a process known as organogenesis. Single-cell RNA sequencing has enabled the profiling of the gene expression dynamics of these cell fate decisions, yet a comprehensive map of the interplay between transcription factors and cis-regulatory elements is lacking, as are the underlying gene regulatory networks. Here we generate a multi-omics atlas of mouse early organogenesis by simultaneously profiling gene expression and chromatin accessibility from tens of thousands of single cells. We develop a computational method to leverage the multi-modal readouts to predict transcription factor binding events in cis-regulatory elements, which we then use to infer gene regulatory networks that underpin lineage commitment events. Finally we show that these models can be used to generate in silico predictions of the effect of transcription factor perturbations. We validate this experimentally by showing that Brachyury is essential for the differentiation of neuromesodermal progenitors to somitic mesoderm fate by priming cis-regulatory elements.

Content

  • /acc/: analysis of chromatin accessibility data
  • /rna/: analysis of RNA expression data
  • /accrna/: simultaneous analysis of RNA expression and chromatin accessibility data

Snakemake pipeline

We provide snakemake pipelines that can be used to reproduce many results.

  • /rna/snakemake: snakemake pipeline for RNA expression
  • /atac/ArchR/snakemake: snakemake pipeline for chromatin accessibility using ArchR
  • /rna_atac/snakemake: snakemake pipeline to integrate RNA expression and chromatin accessibility results (MOFA, in silico ChIP-seq, etc.)

Please note that the snakemake pipeline is rather complex and needs to be simplified and polished. It is currently useful to get an idea of the pipeline, but bare in mind that it won't work straight away.

IGV Genome browser session

We provide a precomputed IGV Genome Browser Session that can be used to interactively explore the ATAC-seq profiles, as shown in the screenshot below:

It can be downloaded running the following command line:

wget ftp://ftpusr92:[email protected]/igv_session_celltype.tar.gz

Then load the file igv_session.xml using File -> Open Session.

R Shiny app

The R shiny app for interactive data analaysis is available here

Directories

  • Mapping to the reference atlas: /rna/mapping
  • MOFA dimensionality reduction: /rna_atac/mofa
  • Analysis of gene markers: /rna_atac/rna_vs_acc/pseudobulk/gene_markers_rna_vs_acc
  • in silico ChIP-seq: /rna_atac/virtual_chipseq_library
  • Metacell inference: /rna/metacells/run
  • Catalogue of TF activities per cell type (Figure 3): /rna_atac/rna_vs_chromvar_chip/pseudobulk/per_celltype
  • Gene regulatory network of NMP differentiation (Figure 4): /rna_atac/gene_regulatory_networks/metacells/trajectories

Data

The data can be downloaded from the following FTP server:

Hostname 	ftp1.babraham.ac.uk
Username 	ftpusr92
Password 	5FqIACU9
FTP URL 	ftp://ftpusr92:[email protected]

Directory structure:

  • sample_metadata.txt.gz: cell metadata file
  • results: results folder
    • rna: results based on RNA expression alone
    • atac: results based on chromatin accessibility alone
    • rna_atac: results based on both RNA expression and chromatin accessibility
  • data: data folder
    • original: CellRanger output files
    • processed: processed data objects
      • rna: Seurat, anndata and SingleCellExperiment objects.
      • atac/archR: ArchR objects
  • igv_session_celltype.tar.gz: IGV session of celltype-specific ATAC profiles
  • igv_session_brachyury_ko.tar.gz: IGV session of celltype-specific ATAC profiles for the Brachyury KO study

To download a specific file:

wget ftp://ftpusr92:[email protected]/data/processed/rna/SingleCellExperiment.rds .

To download everything (~60GB):

wget -r ftp://ftpusr92:[email protected]/ .

If your download from the FTP server is slow, we also provide a temporary download via Dropbox here

Twitter thread

We all know this is the most important day in the road towards a scientific publication:

Twitter URL

Contact

We have created a Slack group to discuss results, questions, collaborations, etc. related to the study. Feel free to drop by using this link. Alternatively, feel free to reach me via email at [email protected]

mouse_organogenesis_10x_multiome_publication's People

Contributors

rargelaguet avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.