Giter Site home page Giter Site logo

czi-rfa's Introduction

czi-rfa

Application to "Collaborative Computational Tools for the Human Cell Atlas" https://chanzuckerberg.com/initiatives/rfa

I just decided to write one of these. I am interested in methods to define the gene expression latent space for single cells, and to connect that back to large numbers of bulk transcriptomes. Drop a line via issues if you're interested in networking our proposals. Please also note if you'd be interested in reviewing some text via PR. I'd love to operate under a pull request + review model if I can find sufficient reviewers.

Thanks! Casey

Other related applications

czi-rfa's People

Contributors

cgreene avatar ejfertig avatar gwaybio avatar huqiwen0313 avatar jaclyn-taroni avatar lanagarmire avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

czi-rfa's Issues

Aim 3 - Arjun's Data and Experimental Design

As @cgreene noted in #3

We will need more detail from Arjun to provide an example. I think you should make a hypothetical example of the latent space arithmetic experimental design. What about doing it as single cell data + bulk perturbation & comparing to single cell + perturbation? That seems like the cleanest design to test your hypothesis.

We will use this issue to discuss the nature of the data and experimental design to test the hypothesis:

that the latent space will preserve vector arithmetic operations between state transitions within benchmark perturbation expression datasets.

The issue can be closed once the experimental design is updated in aim 3.

RFA Bullet Points

Copied from the RFA

The goals of this RFA include, but are not limited to:

  • Developing standard formats and analysis pipelines for genomic, proteomic, and imaging data, in forms that enable consistent use of these pipelines by numerous experimental labs
  • Identifying and solving common challenges for web-based interactive visualization of cellular and imaging data
  • Developing user tools that allow scientists and physicians to extract and analyze data organized by genes, cells, or tissues of interest
  • Supporting analytical methods and machine learning approaches to solving problems such as multimodal integration, inference of state transitions and developmental trajectories, and representation of spatial relationships at the cellular or molecular level
  • Generating curated benchmark datasets from new or existing data for evaluating computational methods and designing future analysis competitions
  • Developing new computational approaches to comparing and normalizing genomic and imaging data across assays, subjects, and species
  • Generating experimental datasets that directly address computationally-guided questions in quality control, reproducibility, or multimodal integration

Use zero-inflated reconstruction loss

Single-cell RNA-seq data is sparse/zero-inflated. There are several works [1-3] showing advantages of modeling zero-inflated distances between cells over L-1/L-2 distances.

For the VAE in aim 1, I expect that using zero inflated loss rather than euclidean reconstruction loss would work significantly better.

[1] https://genomebiology.biomedcentral.com/articles/10.1186/s13059-015-0805-z
[2] https://arxiv.org/abs/1610.05857
[3] https://genomebiology.biomedcentral.com/articles/10.1186/s13059-017-1188-0

Garmire Application

@lanagarmire mentioned to me that she will be putting in an application for a visualization & workflow system. I will mention this in our collaborative network too.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.