nf-core / createpanelrefs Goto Github PK
View Code? Open in Web Editor NEWGenerate Panel of Normals, models or other similar references from lots of samples
Home Page: https://nf-co.re/createpanelrefs
License: MIT License
Generate Panel of Normals, models or other similar references from lots of samples
Home Page: https://nf-co.re/createpanelrefs
License: MIT License
With 40 normal samples a panel of normals file should be generated which can be used with Mutect2.
Running GATK's germline calling pipeline requires a ploidy model (created using GATK DetermineGermlineContigPloidy) and a cnv model (created using GATK GermlineCNVCaller). More details on how to create these models can be found here.
The pipeline ends with error with ~120 samples:
A USER ERROR has occurred: Bad input: Filtering removed all intervals. Select less strict filtering criteria.
However it does not generate cnv model or ploidy (a run in cohort mode with the samples provided). Only readcounts are available in folder readcounts (hdf5 format).
nextflow run /playground/nf-core-createpanelrefs_dev/dev \
-profile singularity \
--input /playground/dataset_cnvs/samplesheets_sarek/samplesheet_bams_illumina.csv \
--tools germlinecnvcaller \
--fasta /corebm/tmp/bams/hg38.fa --dict /corebm/tmp/bams/hg38.dict --fai /corebm/tmp/bams/hg38.fa.fai \
--outdir /playground/dataset_cnvs/germlinecnvcaller_illumina_reference_nochrM \
--ploidy_priors /playground/dataset_cnvs/priors.table \
--padding 250 \
--bin_length 0 \
--mappable_regions /playground/dataset_cnvs/Illumina-truseq-rapid-exome_v1.2_hg38_target_orig_nochrM.bed
nextflow version 23.10.1.5891
Workstation
local executor
singularity
ubuntu 22.04
nf-core/createpanelrefs dev (22/01/2024)
Given a number of normal samples a panel of normals file should be generated which can be used in cnvkit.
It might be worthwhile to integrate a flow to generate bin annotations for qDNAseq.
@nvnieuwk has already prepared a standalone workflow for this, but it might be a nice addition here.
https://github.com/CenterForMedicalGeneticsGhent/nf-cmgg-qdnaseq
Why don't you also convert the CRAM files to BAM and add them here?
Originally posted by @nvnieuwk in #5 (comment)
As discussed :) The instructions for it are here: https://www.ensembl.org/info/docs/tools/vep/script/vep_plugins.html#dbnsfp (click "more)
(slightly annoying it is different for 37 and 38)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.