Dannon Baker, Marius Van Den Beek, John Chilton, Nate Coroar, Delphine LaRiviere, Nicholas Keener, Sergei Kosakovsky, Anton Nekrutenko, James Taylor, Steven Weaver
The is a companion to the manuscript describing the analysis of early COVID-19 data. It contains description of workflows and exact versions of all software used.
It contains the following steps:
- Pre-processing of raw read data
- Assembly of COVID-19 genome
- Estimation of timing for most common recent ancestor (MCRA)
- Analysis of variation within individual isolates
- Analysis of Spike protein substitutions
- Analysis of recombination
The analyses have been performed on Galaxy platform using open source tools from BioConda. All tools were run on XSEDE resources maintained by the Texas Advanced Computing Center (TACC), Pittsburgh Supercomputing Center (PSC), and Indiana University.