Rapid whole genome SNP alignments from multiple draft genomes
Core genome SNP alignments are useful, but spatial information is lost because converved columns are removed, and non-core regions are excluded. Scapper is a fast way to take a bunch of genomes (closed or draft contigs) and produce a whole genome alignment.
- MUMmer >= 3.23
- BioPerl >= 1.60
- TrimAL >= 1.4
% cd $HOME
% git clone https://github.com/tseemann/scapper.git
% $HOME/scapper/bin/scapper -h
% ls
reference.fa draft1.fa draft2.fa draft3.fa
% scapper --output outname reference.fa draft1.fa draft2.fa draft3.fa
% ls outname.*
outname.aln outname.nogaps.aln outname.core.aln outname.nogaps.aln
output.aln
has the global alignment with conserved columns and gapsoutput.nogaps.aln
has the global alignment with all gap columns removedoutput.core.aln
has the global alignment with conserved columns removedoutput.nogaps.core.aln
has the global alignment with gaps and conserved columns removed (aka "core SNPs")
- Everything is relative to the first genome supplied