Giter Site home page Giter Site logo

Comments (4)

Linhua-Sun avatar Linhua-Sun commented on July 17, 2024 1

Thank you very much for your detailed reply. Your reply has greatly improved our understanding. We will further test it based on your suggestions. At present, perhaps the best way is to directly calculate the value of each loop, and find a hard suitable threshold.
We are also testing the pareidolia.
Thanks again!

from chromosight.

cmdoret avatar cmdoret commented on July 17, 2024

Hi @Linhua-Sun ,

Glad you're enjoying the software !

That's a good question, I usually go with option 2: Merge all samples -> detect to get a list of positions common to all samples, and then use it to quantify in each sample separately. This will yield a value for each sample on the same set of positions and you can compare between conditions using multiple replicates. Unfortunately, unless you have many replicates, it is hard to define "DEG loops", so you'll most likely have to apply a threshold.

When you do this, you need to be careful that all samples have ~the same coverage (--subsample flag is useful here), otherwise higher coverage samples will yield better loops.

I am working on a tool (koszullab/pareidolia) to detect significantly different patterns between conditions, but this is still very much a work in progress and unstable.

from chromosight.

Linhua-Sun avatar Linhua-Sun commented on July 17, 2024

Hi:
Thank you very much for your quick reply. The information you provided is very important. I am looking forward to your new tools.
By the way, we got two replicates for each conditions, is it OK to perform the "DEG loops" analysis?
Thanks again!

from chromosight.

cmdoret avatar cmdoret commented on July 17, 2024

This is a bit tricky, for RNAseq it was proposed that the minimum should be 6 replicates, and 12 to get a good statistical power (source: https://pubmed.ncbi.nlm.nih.gov/27022035/). In practice, you almost never see that many replicates due to cost limitations.

I think we face the same issue with Hi-C; there are tools for differential Hi-C analysis based on edgeR, such as diffHiC. There is also CHESS that works well. These tools may identify regions with "significantly" different contacts (they are agnostic to patterns and only look at contact increase / decrease).

One thing you could do is to run these tools and then see if you have Chromosight loops that fall in these differential regions. But with 2 replicates, only regions with very strong changes will be "statistically significant". So basically you will detect loops that completely disappear, but not subtle changes which could be important for regulation. Detection of these "subtle change" (dubbed 3D QTL) has been done previously with 11 replicates (source: https://link.springer.com/article/10.1186/s13059-019-1855-4).

I saw this method which claims to detect differential loops even with 2 replicates, but tbh I have never managed to get it to work...

Alternatively, you could try to directly compare loop scores as I mentioned previously using Chromosight (or try using pareidolia, which essentially does the same thing), knowing you will have no statistically significant results due to sample size. If you already have genes or regions you're interested in, you could still check the score difference between conditions, and where it falls in the distribution. For now, what I do in pareidolia is applying a signal-to-noise ratio (replicate variance < condition variance) and a percentile threshold.

from chromosight.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.