Comments (4)
Cromwell supports "call-caching" and I am sure that is what you are looking for. It skips tasks that are already done based on docker hash, md5sum hash of input files and code lines in
commands { }
block. You may need to install a MySQL server and run a cromwell server with it and submit jobs via REST API.But there are some limitations. Any change in a docker container (that is... any single update/bug-fix on the pipeline) will make you lose resumability. So you will not find it very useful.
We don't have an official tutorial or README for this. But I think the following documnets will be helpful.
I think the documents maybe helpful. However, the web page links you posted are now invalid. Could you update links to the document, please?
from atac-seq-pipeline.
Cromwell supports "call-caching" and I am sure that is what you are looking for. It skips tasks that are already done based on docker hash, md5sum hash of input files and code lines in commands { }
block. You may need to install a MySQL server and run a cromwell server with it and submit jobs via REST API.
But there are some limitations. Any change in a docker container (that is... any single update/bug-fix on the pipeline) will make you lose resumability. So you will not find it very useful.
We don't have an official tutorial or README for this. But I think the following documnets will be helpful.
- https://github.com/ENCODE-DCC/atac-seq-pipeline/blob/master/docs/tutorial_sherlock.md#running-multiple-pipelines-with-cromwell-server-mode
- https://github.com/ENCODE-DCC/atac-seq-pipeline/tree/master/test#how-to-run-a-cromwell-server-on-gc
from atac-seq-pipeline.
Hi Jin,
Thank you very much for the documentation. I will look into it.
from atac-seq-pipeline.
My comment on this issue was written on 2018. It's outdated.
Please take a look at these docs for SLURM and GC:
https://github.com/ENCODE-DCC/caper#running-pipelines-on-slurm-clusters
https://github.com/ENCODE-DCC/caper/tree/master/scripts/gcp_caper_server
MySQL is no longer needed for call-caching.
File-based DB (db=file
in Caper's conf file ~/.caper/default.conf
) should work fine for a small number of workflows.
from atac-seq-pipeline.
Related Issues (20)
- Help with running the pipeline HOT 1
- Unable to run the pipeline. invalid jar file error HOT 3
- 6 days stuck on task=atac.read_genome_tsv:-1, retry=0, status=Running
- More than 10 replicates HOT 3
- Encode-atac-seq-pipeline environment can't be found?
- [Question]: Do reads need to be deduped before FRiP calculation HOT 6
- Invalid MEMLIMIT unit value with LSF jobs on Linux
- two replicates and the combined have different signal
- --read-len selection
- Memory Saving: too many large files?
- The pipeline stalled at "chip.read_genome_tsv" for local backend HOT 2
- Differences in qc when validating installation HOT 3
- Failed on fastqs having identical filename but different path HOT 3
- Confirming that separate conditions/treatments should be analyzed by separate pipelines HOT 2
- Don't need to trim adapters
- Add --ntasks-per-node or --exclusive option for your multi-process jobs.
- bam files with replicates HOT 1
- If the pipeline can be used when data is from other platforms (eg: DNBSEQ-G400). HOT 1
- Discrepancy between signal p-value and fold change bigwig tracks around the open regions
- build_genome_sh did not finish running
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from atac-seq-pipeline.