Giter Site home page Giter Site logo

Adding GetOrganelle about taxprofiler HOT 10 CLOSED

erinyoung avatar erinyoung commented on July 22, 2024
Adding GetOrganelle

from taxprofiler.

Comments (10)

Midnighter avatar Midnighter commented on July 22, 2024 1

Given that this is the first time we see this request, maybe it'd make sense for @erinyoung to adapt the taxprofiler pipeline for their purposes as a proof-of-concept, and then we decide if/how to adopt it?

from taxprofiler.

jfy133 avatar jfy133 commented on July 22, 2024

Hmmm, that's an interesting one. It half fits the scope, but I'm a little wary of the assembly bit. @nf-core/taxprofiler what do you think? (and @maxibor ?)

from taxprofiler.

sofstam avatar sofstam commented on July 22, 2024

I am not sure into what category this tool would fall into. It seems a bit specific to me and agree regarding the assembly part.

from taxprofiler.

jfy133 avatar jfy133 commented on July 22, 2024

So to me basically it:

  • Assembles as many organelle genomes as possible to get long contiguous sequences
  • Tries to identify which species the organelle comes from by comparing to a set of mt database

I think conceptually this would actually fit. Just rather than short-read alignment or kmer-comparison, it does 'long-read' comparison to a database (the main difference is that it generates the 'long reads' itself).

from taxprofiler.

jfy133 avatar jfy133 commented on July 22, 2024

Given that this is the first time we see this request, maybe it'd make sense for @erinyoung to adapt the taxprofiler pipeline for their purposes as a proof-of-concept, and then we decide if/how to adopt it?

What do you mean by PoC - as in make a fork, add it, and see if it makes sense?

I think conceptually it does what we want (I just need to check the output), it's just outside our typical direct kmer/alignment of reads concept

from taxprofiler.

jfy133 avatar jfy133 commented on July 22, 2024

I just had a quick look: @erinyoung does the tool at all produce a OTU/taxon like table as output at all? I tried to look through and couldn't find anything like that. The closest thing to a table was listing gene loci rather than species

from taxprofiler.

Midnighter avatar Midnighter commented on July 22, 2024

What do you mean by PoC - as in make a fork, add it, and see if it makes sense?

As PoC, I meant to add the modules and make data flow adjustments needed to get the pipeline to work as needed for the purpose, yes.

from taxprofiler.

erinyoung avatar erinyoung commented on July 22, 2024

I created a nf-core module for getOrganelle (nf-core/modules#4484). The output is a fasta file with either complete or partial organelle/plasmidome sequences.

from taxprofiler.

jfy133 avatar jfy133 commented on July 22, 2024

Thanks @erinyoung !

So if the output of the module is simply fasta files, I don't consider that in scope for taxprofiler - as that means it is simply just an assembler.

However I saw there is this utility function: https://github.com/Kinggerm/GetOrganelle/wiki/Usage#summary_get_organelle_outputpy

Depending on what the output of that looks like, this may sort of make it fit.

from taxprofiler.

erinyoung avatar erinyoung commented on July 22, 2024

My apologies, but I've encountered other priorities. I may get back into the issue at a later, but am closing this for now.

from taxprofiler.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.