Comments (19)
I would use as the input format what is spit out by https://github.com/nf-core/fetchngs so it generally has the same columns but different headers for it. I would drop the format column. If needed, that can be figured out from the filenames.
from taxprofiler.
Sorry yup - eager is TSV 😅
from taxprofiler.
Sarek was TSV too, we're now csv
from taxprofiler.
Don't abanon me!
Back on topic:
accept: fastq, fq, fasta, fna. fa + all with .gz
from taxprofiler.
from taxprofiler.
OK actually I agree, that's what I actually based this off of. Do you have an example of a fetchngs sheet?
from taxprofiler.
I ran it once and the only samplesheet I got was filled with millions of columns which I didn't like
from taxprofiler.
Nevermind, I saw this:
--nf_core_pipeline [string] Name of supported nf-core pipeline e.g. 'rnaseq'. A samplesheet for direct use with the pipeline will be created with
the appropriate columns.
so we can customise it I guess
from taxprofiler.
Yeah, it adds a lot of columns but we can pick the ones we need. I do think it's nice, though, if the pipeline keeps all input columns. This makes it easier for users to add any kind of meta information that they would like. The minimal information, in my opinion, is:
sample,fastq_1,fastq_2
from taxprofiler.
@jfy133 I think this is csv and not tsv
from taxprofiler.
CSV seems to be the standard in nf-core pipelines. In Python it's quite easy to allow both but that's harder in nextflow I think.
from taxprofiler.
not at all, you have the splitCsv
operator: https://www.nextflow.io/docs/latest/operator.html#splitcsv
from taxprofiler.
Yes, but it cannot "sniff" if it's CSV or TSV by itself, so you either need to hard code it, look at the file extension, or let the user determine it.
from taxprofiler.
Oh I see what you mean, then yes you're right.
And as you said, csv is the standard in DSL2 nf-core pipelines.
from taxprofiler.
@maxibor and I decided to go for an explicit .fasta
column as this means fastq_1
and fastq_2
can be taken directly from fetchNGS
from taxprofiler.
should change platform to specific machine, as we need 2/4 colour chemsity info
from taxprofiler.
should change platform to specific machine, as we need 2/4 colour chemsity info
Can you provide some more context, please, why this is needed?
from taxprofiler.
@maxibor did you add a check that if you can't supply FASTA and FASTQ in the same line?
from taxprofiler.
I think this is set for now, can reopen if more issues crop up
from taxprofiler.
Related Issues (20)
- Add small files to tower.yml
- Simplify test profiles in CI test HOT 1
- Update KrakenUniq to 1.0.4 due to bug causing misreading if input files
- Update to nf-core tempate 2.10
- Add support for sequencing coverage esimates with nonpareil HOT 4
- process `MEGAN_RMA2INFO` JavaFX detected no fonts error HOT 1
- Taxpasta parameters produce an error exit status of 2 which asks for a taxonomy even though kraken2 and centrifuge have worked. HOT 1
- Taxpasta parameters produce an error exit status of 2 which asks for a taxonomy even though kraken2 and centrifuge have worked. HOT 1
- process `VISUALIZATION_KRONA` mis-combined channel by wrong index HOT 3
- `KRONA_KTIMPORTTEXT` input file name collision if same 'db_name' is assigned to kraken2 and bracken HOT 5
- MetaPhlAn4 full index provide duplicated 'NCBI_tax_id' to taxpasta as input HOT 1
- Allow hibf in ganon? HOT 8
- General version update of all tools
- megan version 6.21.7 throw `java.lang.NullPointerException` error HOT 5
- Allow multiple bracken profiles with different `-l` levels from the same kraken2 report HOT 1
- Adding GetOrganelle HOT 10
- Add Nanoq for Nanopore reads
- Documentation: Broken Links in "Full database sheet" Section HOT 4
- Krakenuniq save_reads does not give the fastq files when using PE data HOT 18
- Centrifuge error : (ERR): mkfifo(/tmp/72.inpipe1) failed. HOT 13
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from taxprofiler.