Giter Site home page Giter Site logo

garyluu / skylab Goto Github PK

View Code? Open in Web Editor NEW

This project forked from humancellatlas/skylab

0.0 2.0 1.0 8.96 MB

Secondary analysis pipelines

License: BSD 3-Clause "New" or "Revised" License

R 12.93% WDL 66.30% Python 5.91% Shell 3.92% Jupyter Notebook 10.93%

skylab's Introduction

skylab

Secondary analysis pipelines for the Human Cell Atlas.

Pipelines

Bold indicates that this pipeline is blessed

How to run pipelines from skylab

For now, use git clone [email protected]:HumanCellAtlas/skylab.git and run the pipeline in Cromwell.

  1. WDL and Cromwell Documentation
  2. Running WDLs in Cromwell

[TODO] Update this section with better instructions on how to easily try our pipelines.

[TODO] Blessed HCA Pipelines in Methods Repositories

  • Agora
  • Dockstore

Example on how to add a pipeline (workflow) to Dockstore and run it

After Dockstore registration has been completed, register the pipeline (workflow) on Dockstore with the following fields:

  • "HumanCellAtlas/skylab" as the "Source Code Repository"
  • "/pipelines/smartseq2_single_sample/SmartSeq2SingleSample.wdl" as the "Workflow Path"
  • "/pipelines/smartseq2_single_sample/dockstore_SmartSeq2SingleSampleExample.json" as the "Test Parameter File Path"

After registering, publish the pipeline (workflow) and it will be available via the Dockstore's GA4GH TRS endpoints and Dockstore CLI.

Dockstore CLI

Download the dockstore_SmartSeq2SingleSampleExample.json and run the pipeline (workflow) using:

$ wget https://dockstore.org:8443/api/ga4gh/v2/tools/%23workflow%2Fgithub.com%2FHumanCellAtlas%2Fskylab/versions/master/PLAIN_WDL/tests
$ dockstore workflow launch --entry github.com/HumanCellAtlas/skylab:master --json tests

TRS and Cromwell

See the list of files available using GET /api/ga4gh/v2/tools/{id}/versions/{version_id}/{type}/files:

$ curl -X GET "https://dockstore.org:8443/api/ga4gh/v2/tools/%23workflow%2Fgithub.com%2FHumanCellAtlas%2Fskylab/versions/master/WDL/files" -H  "accept: application/json"

This will indicate which files are needed to run the pipeline (workflow). It should look like:

[
  {
    "file_type": "PRIMARY_DESCRIPTOR",
    "path": "/pipelines/smartseq2_single_sample/SmartSeq2SingleSample.wdl"
  },
  {
    "file_type": "TEST_FILE",
    "path": "/pipelines/smartseq2_single_sample/dockstore_SmartSeq2SingleSampleExample.json"
  },
  {
    "file_type": "SECONDARY_DESCRIPTOR",
    "path": "HISAT2.wdl"
  },
  {
    "file_type": "SECONDARY_DESCRIPTOR",
    "path": "Picard.wdl"
  },
  {
    "file_type": "SECONDARY_DESCRIPTOR",
    "path": "RSEM.wdl"
  }
]

Download each descriptor using GET /api/ga4gh/v2/tools/{id}/versions/{version_id}/{type}/descriptor/{relative_path} for both primary and secondary descriptors:

$ wget https://dockstore.org:8443/api/ga4gh/v2/tools/%23workflow%2Fgithub.com%2FHumanCellAtlas%2Fskylab/versions/master/PLAIN_WDL/descriptor//pipelines/smartseq2_single_sample/SmartSeq2SingleSample.wdl
$ wget https://dockstore.org:8443/api/ga4gh/v2/tools/%23workflow%2Fgithub.com%2FHumanCellAtlas%2Fskylab/versions/master/PLAIN_WDL/descriptor/HISAT2.wdl
$ wget https://dockstore.org:8443/api/ga4gh/v2/tools/%23workflow%2Fgithub.com%2FHumanCellAtlas%2Fskylab/versions/master/PLAIN_WDL/descriptor/Picard.wdl
$ wget https://dockstore.org:8443/api/ga4gh/v2/tools/%23workflow%2Fgithub.com%2FHumanCellAtlas%2Fskylab/versions/master/PLAIN_WDL/descriptor/RSEM.wdl

Download the test parameter file using GET /api/ga4gh/v2/tools/{id}/versions/{version_id}/{type}/tests:

$ wget https://dockstore.org:8443/api/ga4gh/v2/tools/%23workflow%2Fgithub.com%2FHumanCellAtlas%2Fskylab/versions/master/PLAIN_WDL/tests

To run the workflow locally using cromwell, you may first need to download all the files mentioned in the input parameter file and change it all the https path to local paths. Then run the workflow using:

$ java -jar cromwell-*.jar run SmartSeq2SingleSample.wdl -i tests

[TODO] Tagged releases

Pipelines are released in tagged releases to track changes in version of pipeline, and to track when a pipeline is blessed.

[TODO] Automated testing options

[TODO] Onboarding pipelines

skylab's People

Contributors

ambrosejcarr avatar jishuxu avatar mckinsel avatar rexwangcc avatar samanehsan avatar garyluu avatar jsotobroad avatar kbergin avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.