Comments (7)
The plan is to use PSCBS to perform segmentation of both log2 copy ratios and allele frequencies if a VCF file of SNVs is provided. If the VCF contains a paired normal/control sample in addition to the tumor/test sample, take advantage of that.
Emit a .cns file with extra columns indicating allele-specific values.
from cnvkit.
@chapmanb I saw you added a similar feature in bcbio for BubbleTree, but segmenting on allele frequencies instead of rather than in addition to total copy number. It looks like this would be generally useful to have in CNVkit. Am I reading that right? If so, mind if I borrow the code and expose it through CNVkit's segment
command?
from cnvkit.
Eric;
You're absolutely welcome to any of my code that's of use. However the BubbleTree work is not well tested right now and needs work to get up to date with the latest BubbleTree development version so it might not be the best time to grab it. I'm going to try and reboot the heterogeneity analysis work in bcbio and can give you a heads up when this is better tested and integrated.
from cnvkit.
Loosely relevant to this issue, here's a paper where PyClone was used with CNVkit to identify/examine subclones. I'll have to read this paper more carefully and maybe ask the lead author about the experience.
from cnvkit.
Would be very interested if segment could work with allele frequencies solely. Is there a way to do this?
As implemented--including an extra column of the average b-allele frequency of a copy number segment--is worthless because copy neutral loss of heterozygosity can drive many dramatic changes in baf within a single copy number segment.
from cnvkit.
You could probably manage it through the cnvlib
API -- see the implementation of cnvlib.segment.__init__
where the variants array is used, it's just a few lines to run segmentation on the SNP allele frequencies once you have loaded them via cnvlib.cmdutil.load_het_snps
.
Once you've segmented your read depth ratios and/or SNP b-allele frequencies, use the call
command to infer integer total copy number and allele-specific copy number (the cn
, cn1
and cn2
columns, respectively). Uniparental disomy shows up in the latter columns but not the former.
from cnvkit.
@etal OK thanks. I got the variants loaded through the API, but can you give another hint on how to call which method in segments/init to segment on just baf?
from cnvkit.
Related Issues (20)
- baits and target bed file
- How to merge vcfs of different samples?
- Running CNVkit with VCF files
- how to install the cnvkit v0.9.11 by the tar.gz source
- No Results HOT 1
- Installation error HOT 3
- Unable to produce .cnr files HOT 2
- Getting BAF and allelic imbalance in export vcf
- Improper post-processing of segment [bug]
- Unable to use fix without antitarget file for WGS samples
- Merging segments while running pipeline
- `segment` function fails HOT 1
- Somatic CNV calling in case of pre-existing CNVs
- Installation problems HOT 1
- Unable to load BED file with skgenome
- cnvkit.py segment error
- CNVkit segment command RuntimeError: Subprocess command failed
- access command: recommended -s (mingap) value
- Blank plots HOT 1
- Possible bug in bin sizes
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cnvkit.