General Question Hi Jin, I am won

Cromwell supports <a href="https://cromwell.readthedocs.io/en/develop/Con

Cromwell supports <a href="https://cromwell.readthedocs.io/en/develop/Configuring/#cal

Resume function in the pipeline about atac-seq-pipeline HOT 4 CLOSED

encode-dcc commented on June 2, 2024

Resume function in the pipeline

from atac-seq-pipeline.

Comments (4)

Cristinex commented on June 2, 2024 1

Cromwell supports "call-caching" and I am sure that is what you are looking for. It skips tasks that are already done based on docker hash, md5sum hash of input files and code lines in commands { } block. You may need to install a MySQL server and run a cromwell server with it and submit jobs via REST API.

But there are some limitations. Any change in a docker container (that is... any single update/bug-fix on the pipeline) will make you lose resumability. So you will not find it very useful.

We don't have an official tutorial or README for this. But I think the following documnets will be helpful.

https://github.com/ENCODE-DCC/atac-seq-pipeline/blob/master/docs/tutorial_sherlock.md#running-multiple-pipelines-with-cromwell-server-mode

https://github.com/ENCODE-DCC/atac-seq-pipeline/tree/master/test#how-to-run-a-cromwell-server-on-gc

I think the documents maybe helpful. However, the web page links you posted are now invalid. Could you update links to the document, please?

from atac-seq-pipeline.

leepc12 commented on June 2, 2024

Cromwell supports "call-caching" and I am sure that is what you are looking for. It skips tasks that are already done based on docker hash, md5sum hash of input files and code lines in commands { } block. You may need to install a MySQL server and run a cromwell server with it and submit jobs via REST API.

But there are some limitations. Any change in a docker container (that is... any single update/bug-fix on the pipeline) will make you lose resumability. So you will not find it very useful.

We don't have an official tutorial or README for this. But I think the following documnets will be helpful.

from atac-seq-pipeline.

shanmukhasampath commented on June 2, 2024

Hi Jin,

Thank you very much for the documentation. I will look into it.

from atac-seq-pipeline.

leepc12 commented on June 2, 2024

My comment on this issue was written on 2018. It's outdated.

Please take a look at these docs for SLURM and GC:
https://github.com/ENCODE-DCC/caper#running-pipelines-on-slurm-clusters
https://github.com/ENCODE-DCC/caper/tree/master/scripts/gcp_caper_server

MySQL is no longer needed for call-caching.
File-based DB (db=file in Caper's conf file ~/.caper/default.conf) should work fine for a small number of workflows.

from atac-seq-pipeline.

Recommend Projects

Resume function in the pipeline about atac-seq-pipeline HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent