Comments (6)
Hi,
- I assume the sampleIDs are individual 10x captures (i.e. no cell barcoding or such)?
- What kind of tissue is this? adult or developmental/trajectory-like?
- Do you know how much cells were put into the machine originally?
- Could you plot a distribution of the
split_D$scDblFinder.score
?
(FYI you should avoid using@
; the colData columns can be accessed directly withsplit_D$whatever
)
from scdblfinder.
Thank you for the quick reply!
- Yes.
- They are sorted immune cells from adult mice.
- I aimed for 10k cells for sequencing. For GEM generation, I input 10- 20 k cells per sample (the vert starting step). And in the end, I only captured 1.4-6.5k cells as mentioned above.
- See below
p1= hist(split_D$scDblFinder.score,plot = F)
p1$density <- p1$counts/sum(p1$counts) * 100
plot(p1, freq = FALSE)
Hi,
* I assume the sampleIDs are individual 10x captures (i.e. no cell barcoding or such)? * What kind of tissue is this? adult or developmental/trajectory-like? * Do you know how much cells were put into the machine originally? * Could you plot a distribution of the `split_D$scDblFinder.score`? (FYI you should avoid using `@`; the colData columns can be accessed directly with `split_D$whatever`) **Thanks**
from scdblfinder.
Hi,
ok this is as I thought, I'm afraid you really do have ~10% or so doublets.
The determining factor for the doublet rate is the number of cells loaded, as this influences the density and hence the probability that two are captured in the same droplet. The fact that many of these cells were for instance too damaged (or otherwise...) to pass cellranger's early QC (i.e. calls of what's a cell and what's an empty droplet) doesn't influence the doublet rate. (Note that this isn't the only possible explanation for few cells / few reads in cells)
So sorry if it's a disappointment for you, but I think scDblFinder does a nice job of finding them despite having the wrong expected doublet rate :)
from scdblfinder.
Hi,
I see. So I should not look at the number of cells recovered from sequencing to determine the doublet rate. But for some reason, unfortunately, my recover rate is significantly lower than expected (as listed by 10X), right?
Computationally, scDblFinder only knows the number of cells I recovered from 10X. Therefore, the expected doublet rate (dbr
) is probably determined by the recovered cell number, isn't it? How come the threshold for scDblFinder.score was decided so that the actual doublet rate is more than 2x of the expected rate? These questions may sound naive but I am curious 😅
from scdblfinder.
Hi,
Yes, you have a lower recovery rate than expected. I'm really not an expert there, but in my experience this has typically been attributable to low cell viability and/or expired/contaminated reagents (e.g. the buffer), but you'd have better luck trying to understand this with wet lab people.
Yes, scDblFinder estimates the dbr
from the recovered cells. However, the thresholding is not only based on this: as described in the paper, it's also based on the ability to correctly classify artificial doublets. This often has a larger influence than the expected doublet rate, and in your case rescued the thresholding.
from scdblfinder.
Thank you very much, plger!
You can close this issue now.
from scdblfinder.
Related Issues (20)
- Error running scDblFinder HOT 4
- how to get layers info after running scDblFinder HOT 2
- Multiple errors: samples argument and Matrix HOT 4
- Removing genes from count matrix before doublet detection HOT 3
- SCTransform example in vignette not working HOT 1
- Unreasonably high doublets rate HOT 4
- Running scDblFinder before or after normalization? HOT 2
- Error running scDblFinder HOT 3
- non-interpretable error message when providing non-integer as nfeatures argument HOT 1
- Clarify recommended nFeatures/artificialDoublets for scATACseq doublet removal in vignette HOT 2
- Doublet filtering in Parse Biosciences data HOT 6
- Unable to run scDblFinder - "as_cholmod_sparse" does not exist HOT 4
- Unable to install scDblFinder HOT 2
- Ambient RNA Removal HOT 4
- Compatibility between Seurat, matrix and scDblFinder versions HOT 4
- error in scATAC HOT 1
- Does ‘Size factors should be positive’ error matter? HOT 4
- "did not converge" Error on cellbender3 HOT 8
- How can I add results to back my original file HOT 2
- Error running scDblFinder HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from scdblfinder.