Giter Site home page Giter Site logo

Input CSV structure about taxprofiler HOT 19 CLOSED

nf-core avatar nf-core commented on May 27, 2024
Input CSV structure

from taxprofiler.

Comments (19)

Midnighter avatar Midnighter commented on May 27, 2024 1

I would use as the input format what is spit out by https://github.com/nf-core/fetchngs so it generally has the same columns but different headers for it. I would drop the format column. If needed, that can be figured out from the filenames.

from taxprofiler.

jfy133 avatar jfy133 commented on May 27, 2024 1

Sorry yup - eager is TSV 😅

from taxprofiler.

maxulysse avatar maxulysse commented on May 27, 2024 1

Sarek was TSV too, we're now csv

from taxprofiler.

jfy133 avatar jfy133 commented on May 27, 2024 1

Don't abanon me!

Back on topic:

accept: fastq, fq, fasta, fna. fa + all with .gz

from taxprofiler.

jfy133 avatar jfy133 commented on May 27, 2024 1

https://support.illumina.com/content/dam/illumina-support/help/Illumina_DRAGEN_Bio_IT_Platform_v3_7_1000000141465/Content/SW/Informatics/Dragen/PolyG_Trimming_fDG.htm

from taxprofiler.

jfy133 avatar jfy133 commented on May 27, 2024

OK actually I agree, that's what I actually based this off of. Do you have an example of a fetchngs sheet?

from taxprofiler.

jfy133 avatar jfy133 commented on May 27, 2024

I ran it once and the only samplesheet I got was filled with millions of columns which I didn't like

from taxprofiler.

jfy133 avatar jfy133 commented on May 27, 2024

Nevermind, I saw this:

 --nf_core_pipeline           [string]  Name of supported nf-core pipeline e.g.  'rnaseq'. A samplesheet for direct use with the pipeline will be created with 
                                         the appropriate columns. 

so we can customise it I guess

from taxprofiler.

Midnighter avatar Midnighter commented on May 27, 2024

Yeah, it adds a lot of columns but we can pick the ones we need. I do think it's nice, though, if the pipeline keeps all input columns. This makes it easier for users to add any kind of meta information that they would like. The minimal information, in my opinion, is:

sample,fastq_1,fastq_2

from taxprofiler.

maxulysse avatar maxulysse commented on May 27, 2024

@jfy133 I think this is csv and not tsv

from taxprofiler.

Midnighter avatar Midnighter commented on May 27, 2024

CSV seems to be the standard in nf-core pipelines. In Python it's quite easy to allow both but that's harder in nextflow I think.

from taxprofiler.

maxulysse avatar maxulysse commented on May 27, 2024

not at all, you have the splitCsv operator: https://www.nextflow.io/docs/latest/operator.html#splitcsv

from taxprofiler.

Midnighter avatar Midnighter commented on May 27, 2024

Yes, but it cannot "sniff" if it's CSV or TSV by itself, so you either need to hard code it, look at the file extension, or let the user determine it.

from taxprofiler.

maxulysse avatar maxulysse commented on May 27, 2024

Oh I see what you mean, then yes you're right.
And as you said, csv is the standard in DSL2 nf-core pipelines.

from taxprofiler.

jfy133 avatar jfy133 commented on May 27, 2024

@maxibor and I decided to go for an explicit .fasta column as this means fastq_1 and fastq_2 can be taken directly from fetchNGS

from taxprofiler.

jfy133 avatar jfy133 commented on May 27, 2024

should change platform to specific machine, as we need 2/4 colour chemsity info

from taxprofiler.

Midnighter avatar Midnighter commented on May 27, 2024

should change platform to specific machine, as we need 2/4 colour chemsity info

Can you provide some more context, please, why this is needed?

from taxprofiler.

jfy133 avatar jfy133 commented on May 27, 2024

@maxibor did you add a check that if you can't supply FASTA and FASTQ in the same line?

from taxprofiler.

jfy133 avatar jfy133 commented on May 27, 2024

I think this is set for now, can reopen if more issues crop up

from taxprofiler.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.