broadinstitute / warp-tools Goto Github PK
View Code? Open in Web Editor NEWThis repository contains all containers that WARP uses.
License: BSD 3-Clause "New" or "Revised" License
This repository contains all containers that WARP uses.
License: BSD 3-Clause "New" or "Revised" License
As of #6 , TagSort is now completely decoupled from the fastqpreprocessing code - the .cc/.h files and Makefile rules they're compiled from are entirely disjoint. So it should be seamless to move TagSort out of warp-tools/fastqpreprocessing, into a new warp-tools/TagSort directory.
An alternative (if wanting to keep tools of the same pipeline together) would be to rename the fastqpreprocessing directory to indicate there are two tools in it. However, having the directory structure make clear that there are no dependencies between the two to worry about is a valuable complexity reduction, one that is reliable and lasting compared to a handful of people temporarily "knowing" that it is the case. So keeping them together should look like warp-tools/some_new_name/TagSort and warp-tools/some_new_name/fastqpreprocessing.
I cloned the repo just now, and was confused to find the changes from #6 - which github showed as merged - not existing. It turned out that there is a custom "develop" branch that is set to be the default. This is a problem, because "develop" is currently behind master.
AFAICT the "develop" branch is intended as a staging area, I think for full-on end-to-end tests involving building a Docker image and everything. If possible to do that sort of thing directly on the branches that PRs are created off of, then that would be cleaner: the feature branch is going to exist anyways, so the "develop" setup is just adding the complexity of an extra step in the process of submitting+deploying code.
If there is something more essential about the "develop" setup that I am missing, then automation is required to make this behind-master situation impossible.
The usage of TagSort that actually produces a sorted version of the input file (as opposed to just being used to compute metrics) writes a tab-separated ASCII SAM file. TagSort's input is a binary BAM. Since the point of BAM is to save space, and the point of TagSort being C++ is to handle huge data efficiently, it seems likely that someone using sorting feature might prefer BAM output.
It's also possible just converting from SAM to BAM after TagSort is done would work well enough to not justify the additional work this would take.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.