Giter Site home page Giter Site logo

Comments (6)

plger avatar plger commented on June 19, 2024

Hi,

  • I assume the sampleIDs are individual 10x captures (i.e. no cell barcoding or such)?
  • What kind of tissue is this? adult or developmental/trajectory-like?
  • Do you know how much cells were put into the machine originally?
  • Could you plot a distribution of the split_D$scDblFinder.score?
    (FYI you should avoid using @; the colData columns can be accessed directly with split_D$whatever)

from scdblfinder.

zqun1 avatar zqun1 commented on June 19, 2024

Thank you for the quick reply!

  1. Yes.
  2. They are sorted immune cells from adult mice.
  3. I aimed for 10k cells for sequencing. For GEM generation, I input 10- 20 k cells per sample (the vert starting step). And in the end, I only captured 1.4-6.5k cells as mentioned above.
  4. See below
p1= hist(split_D$scDblFinder.score,plot = F)
p1$density <- p1$counts/sum(p1$counts) * 100
plot(p1, freq = FALSE) 

image

Hi,

* I assume the sampleIDs are individual 10x captures (i.e. no cell barcoding or such)? 

* What kind of tissue is this? adult or developmental/trajectory-like? 

* Do you know how much cells were put into the machine originally?

* Could you plot a distribution of the `split_D$scDblFinder.score`?
  (FYI you should avoid using `@`; the colData columns can be accessed directly with `split_D$whatever`) **Thanks**

from scdblfinder.

plger avatar plger commented on June 19, 2024

Hi,
ok this is as I thought, I'm afraid you really do have ~10% or so doublets.
The determining factor for the doublet rate is the number of cells loaded, as this influences the density and hence the probability that two are captured in the same droplet. The fact that many of these cells were for instance too damaged (or otherwise...) to pass cellranger's early QC (i.e. calls of what's a cell and what's an empty droplet) doesn't influence the doublet rate. (Note that this isn't the only possible explanation for few cells / few reads in cells)
So sorry if it's a disappointment for you, but I think scDblFinder does a nice job of finding them despite having the wrong expected doublet rate :)

from scdblfinder.

zqun1 avatar zqun1 commented on June 19, 2024

Hi,
I see. So I should not look at the number of cells recovered from sequencing to determine the doublet rate. But for some reason, unfortunately, my recover rate is significantly lower than expected (as listed by 10X), right?

Computationally, scDblFinder only knows the number of cells I recovered from 10X. Therefore, the expected doublet rate (dbr) is probably determined by the recovered cell number, isn't it? How come the threshold for scDblFinder.score was decided so that the actual doublet rate is more than 2x of the expected rate? These questions may sound naive but I am curious 😅

from scdblfinder.

plger avatar plger commented on June 19, 2024

Hi,

Yes, you have a lower recovery rate than expected. I'm really not an expert there, but in my experience this has typically been attributable to low cell viability and/or expired/contaminated reagents (e.g. the buffer), but you'd have better luck trying to understand this with wet lab people.

Yes, scDblFinder estimates the dbr from the recovered cells. However, the thresholding is not only based on this: as described in the paper, it's also based on the ability to correctly classify artificial doublets. This often has a larger influence than the expected doublet rate, and in your case rescued the thresholding.

from scdblfinder.

zqun1 avatar zqun1 commented on June 19, 2024

Thank you very much, plger!
You can close this issue now.

from scdblfinder.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.