:seedling: :clipboard: :arrow_right: :8ball: :arrow_right: :palm_tree: :evergreen_tree: :deciduous_tree: - A quick phylogenetic tree building pipeline!
Using trimal 1.4 has given this error:
Error: the symbol 'K' accesing the matrix is not defined in this object
but I do not know why! The data looked fine.
Trimal v1.2 works fine...
zero sized files make the process fail downstream - you can't trim no data - but it throws an exception and exits rather than continuing to the next. zero sized files come up every now and again due to errors previously... so it's best to error report them and move on rather than stopping...
Files are output in the directory "run_id" from the YAML file, perhaps this should be called "output_dir" or something to be more clear....
Moreover, we could keep a run ID and have an output dir as well... that way you can run multiple runs of the same data in one project directory: e.g.
your_project/
your_project/1e-20
your_project/1e-30
I have a sed command that is supposed to check for selenocysteine (U) and other junk in the sequence files of each run - as MAFFT has a hissy fit when it sees them and the "--anysymbol" seems to produce shit alignments. The consequence is zero-sized files and then downstream programs quit/crash and then the pipeline fails. needs to be sorted! 1) make sure the U and non-IUPAC protein codes are repalced by X and 2) fail gracefully!!!!!