Giter Site home page Giter Site logo

Comments (7)

dimus avatar dimus commented on June 2, 2024 1

Hello @cpauvert, thank you for nice words about gnverifier. Yes, sure, I can add the List of Prokaryotic Names to data sources. I am already registered, so it should not be a problem to download it. I will add the list this or next week.

from gnverifier.

cpauvert avatar cpauvert commented on June 2, 2024 1

Awesome, thanks @dimus for your quick reply. Do not hesitate to ping once you do, so that I can try it out and point my microbiologists colleague to your resource as well!

from gnverifier.

dimus avatar dimus commented on June 2, 2024 1

@cpauvert sure, you welcime!

You can also try to limit searches:

image

It would add a filter ds=208 (to only show data from this data-source) to the query:

https://verifier.globalnames.org/?capitalize=on&ds=208&format=html&names=Bacteroides+vulgatus+

The same can be done for OpenRefine https://github.com/gnames/gnverifier/wiki/OpenRefine-readme#filters-to-remove-false-positive-matches

from gnverifier.

craynaud007 avatar craynaud007 commented on June 2, 2024 1

Hello @dimus,

I work with @cpauvert and I just tested gnverfier with OpenRefine, thanks for this initiative.
While using it I encounter some issues. First, I tried to select the LPSN data source as explain in “Reconciling taxonomic names in OpenRefine via Global Names” and it work but partially. In fact, some species that were not in the LPSN, had a match according to another data source even if I specified to use only LPSN. So, the question is how can I restrict to one data source on OpenRefine?

The second problem I encountered was due to Current Name. As an example, I reconciled Bacteroides vulgatus and it gave me Bacteroides vulgatus Eggerth and Gagnon 1933. And if I look deeper in it, it is said that the current Name is Phocaeicola vulgatus (cf. screenshot). So, the question is how can I say I only want the current name and not the older one?

image

Best Regards

from gnverifier.

dimus avatar dimus commented on June 2, 2024

@cpauvert I added LPSN, if you find any problems, please reopen the ticket and let me know

from gnverifier.

cpauvert avatar cpauvert commented on June 2, 2024

Hi @dimus
Thanks for the quick implementation!

I tried out with Bacteroides vulgatus expecting to be properly corrected to Phocaeicola vulgatus as per the LPSN indication, but I was not...

https://verifier.globalnames.org/?capitalize=on&format=tsv&names=Bacteroides+vulgatus

Until I realized that the match to the LPSN did appear if I ticked the "Show all matches"

https://verifier.globalnames.org/?all_matches=on&capitalize=on&format=tsv&names=Bacteroides+vulgatus

It's just that the LPSN score was slightly lower 9.41496 vs 9.41391. Any reason for a difference in score when the match were essentially the same?

BEst,

PS: can't wait to try this out with openrefine!!

from gnverifier.

cpauvert avatar cpauvert commented on June 2, 2024

Hi,

@cpauvert I added LPSN, if you find any problems, please reopen the ticket and let me know

I cannot reopen the issue as I'm not a collaborator, let us know if you'd rather have a separate issue to discuss.

I can expand @craynaud007 comments here (and tagging @magelm here as well), it seems the LPSN API actually does not indicate the current valid name directly. For instance, see the API output with Bacteroides vulgatus (https://api.lpsn.dsmz.de/fetch/773979) where only Bacteroides vulgatus is mentioned, but it does indicate the record with the current name:

lpsn_correct_name_id:	7841

This record then actually points (as expected) to Phocaeicola vulgatus (https://api.lpsn.dsmz.de/fetch/7841)

@dimus are you using the API or the data dumps?
BEst,

from gnverifier.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.