Comments (5)
If all possibilities would be implemented, I'd think the most requested cases would be --lineage_dataset bacteria_odb10
>> --auto-lineage
> --lineage_dataset archea_odb10
> anything else. This is because usually the majority of genomes are bacteria and BUSCO evaluation results are best comparable if all bins are evaluated with the same reference data. Therefore, only making a single db available for now would be sufficient, I think. But the --auto-lineage
option looks so tempting ;)
from mag.
UPDATE:
With BUSCO version 4.0.6 there are frequent, non-reproducible errors occurring, caused by a replace("faa", "fna") function corrupting nextflow filenames that contain the substring "faa" in their hash id. I prepared a fix for BUSCO (https://gitlab.com/ezlab/busco/-/issues/305). Waiting currently.
We need a new BUSCO release before preparing pipeline release.
from mag.
Hi @skrakau . Are you referring to this type of error? I am getting a lot of these with BUSCO version 3.0.2when running mag with revision: 8586c49 [dev]
Aug-26 20:41:52.612 [Task monitor] ERROR nextflow.processor.TaskProcessor - Error executing process > 'busco (MEGAHIT-SRR9030455.61.fa)'
Caused by:
Process `busco (MEGAHIT-SRR9030455.61.fa)` terminated with an error exit status (1)
Command executed:
run_BUSCO.py --in MEGAHIT-SRR9030455.61.fa --lineage_path bacteria_odb9 --cpu "4" --blast_single_core --mode genome --out MEGAHIT-SRR9030455.61.fa >MEGAHIT-SRR9030455.61.fa_busco_log.txt
cp run_MEGAHIT-SRR9030455.61.fa/short_summary_MEGAHIT-SRR9030455.61.fa.txt short_summary_MEGAHIT-SRR9030455.61.fa.txt
for f in run_MEGAHIT-SRR9030455.61.fa/single_copy_busco_sequences/*faa; do
[ -e "$f" ] && cat run_MEGAHIT-SRR9030455.61.fa/single_copy_busco_sequences/*faa >MEGAHIT-SRR9030455.61.fa_buscos.faa || touch MEGAHIT-SRR9030455.61.fa_buscos.faa
break
done
for f in run_MEGAHIT-SRR9030455.61.fa/single_copy_busco_sequences/*fna; do
[ -e "$f" ] && cat run_MEGAHIT-SRR9030455.61.fa/single_copy_busco_sequences/*fna >MEGAHIT-SRR9030455.61.fa_buscos.fna || touch MEGAHIT-SRR9030455.61.fa_buscos.fna
break
done
Command exit status:
1
Command output:
(empty)
Command error:
cp: cannot stat ‘run_MEGAHIT-SRR9030455.61.fa/short_summary_MEGAHIT-SRR9030455.61.fa.txt’: No such file or directory
from mag.
Hi @ropolomx, to be precise, the problem I described was for BUSCO 4 versions, which prevented us from updating BUSCO to solve some issues. But in BUSCO 3.0.2 there was a related problem, which can cause such errors as you described above (BUSCO itself did not return an error, but since an output file is missing, the downstream cp
command failed).
from mag.
Solved in #103 :)
from mag.
Related Issues (20)
- Supplying a directory to --gtdbk-db breaks GTDB-Tk HOT 1
- Add coverm/contig and coverm/genome as options for calculating read alignment metrics
- Expanded documentation for nf-core/mag use cases & downstream analysis
- Add documentation detailing phageome/virome analysis
- Pipeline fails if only CONCOCT is turned on and other binners off
- CONCOCT (slow) HOT 1
- Single end data input
- Problem with DUMPSOFTWAREVERSIONS HOT 2
- Centrifuge error - (ERR): mkfifo(/tmp/46.inpipe1) failed. HOT 3
- NFCORE_MAG:MAG:GTDBTK:GTDBTK_CLASSIFYWF fails when all genomes classified by ANI screening step HOT 4
- CHECKM_LINEAGEWF failing with exit status 1 HOT 7
- Error in executing PROKKA HOT 4
- Bins in CheckM summary do not match bins in bin depths summary HOT 7
- add diamond support to mag
- Improve documentation on the effects of completeness/contamination filters
- Replace CheckM with CheckM2 HOT 1
- Genome taxnomic classification with Sourmash HOT 3
- Pipeline completed sucessfully, but with errored process(es) HOT 1
- CAT_DB process has harcoded CAT subdirectory names HOT 1
- List of tool versions HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mag.