Comments (14)
hi @Adafede, thanks for submitting it
yes you are right, I just released a new version of gnames, where I tried to fix a few inconsistencies in API, and as a result v0.8.0 has quite a few breaking changes, sorry about adding extra work for you, but I think in a long run these changes will be justified. Please read https://github.com/gnames/gnames/releases/tag/v0.8.0
The purpose of most of these changes was to bring search and verification behavior to almost identical. Before "data-sources"
option as well as "show all matches" option behaved differently and it was very confusing. In short:
-s 0
option does not exist at all anymore, it is the default behavior now.
-s N,N2,N3
option now limits search to provided data-sources
-M
option now shows all results
BestResult in csv/tsv format indicates the best result for a name
SortedMatch shows all other results, and, as you see, they are sorted, and there is new field 'SortScore' that illustrates the numbers that were used for sorting.
Again apologies for breaking changes, there are quite a few of them, because I tried to put all of them into one release, so I do not break others scripts too often. I hope it is now close to stabilizing, but until /api/v0
will be /api/v2
I will listen for suggestions on improving API.
from gnverifier.
At the start of the next week I will make a blog post that will explain the changes I made.
from gnverifier.
No problems, I know it is for the good! 😉
My question is regarding -s 1,2,3
.
To me, it should work without -M
but it does not, am I right?
➜ lotus-processor git:(main) ✗ bin/gnverifier "Iris pallida" -s 1,2,3,4,179
00:40:14 INF Using config file: /Users/rutza/Library/Application Support/gnverifier.yaml.
Kind,SortScore,MatchType,EditDistance,ScientificName,MatchedName,MatchedCanonical,TaxonId,CurrentName,Synonym,DataSourceId,DataSourceTitle,ClassificationPath,Error
BestMatch,9.01315,Exact,0,Iris pallida,Iris pallida Lam.,Iris pallida,3PZY6,Iris pallida Lam.,false,1,Catalogue of Life,Biota|Plantae|Tracheophyta|Liliopsida|Asparagales|Iridaceae|Iris|Iris pallida,
from gnverifier.
can you show the version of gnverifier?
With v0.8.2 I get:
✦ ❯ gnverifier "Iris pallida" -s 1,2,3,4,179
18:09:18 INF Using config file: /home/dimus/.config/gnverifier.yaml.
Kind,SortScore,MatchType,EditDistance,ScientificName,MatchedName,MatchedCanonical,TaxonId,CurrentName,Synonym,DataSourceId,DataSourceTitle,ClassificationPath,Error
BestMatch,9.01315,Exact,0,Iris pallida,Iris pallida Lam.,Iris pallida,3PZY6,Iris pallida Lam.,false,1,Catalogue of Life,Biota|Plantae|Tracheophyta|Liliopsida|Asparagales|Iridaceae|Iris|Iris pallida,
It shows best result only, and with -M
I get all of them
✦ ❯ gnverifier "Iris pallida" -s 1,2,3,4,179 -M -q
Kind,SortScore,MatchType,EditDistance,ScientificName,MatchedName,MatchedCanonical,TaxonId,CurrentName,Synonym,DataSourceId,DataSourceTitle,ClassificationPath,Error
BestMatch,9.01315,Exact,0,Iris pallida,Iris pallida Lam.,Iris pallida,3PZY6,Iris pallida Lam.,false,1,Catalogue of Life,Biota|Plantae|Tracheophyta|Liliopsida|Asparagales|Iridaceae|Iris|Iris pallida,
SortedMatch,9.01005,Exact,0,Iris pallida,"Iris pallida Salisb., nom. illeg.",Iris pallida,3PZY5,Iris halophila Pall.,true,1,Catalogue of Life,Biota|Plantae|Tracheophyta|Liliopsida|Asparagales|Iridaceae|Iris|Iris halophila,
SortedMatch,9.01005,Exact,0,Iris pallida,"Iris pallida Ten., nom. illeg.",Iris pallida,3PZY4,Iris germanica L.,true,1,Catalogue of Life,Biota|Plantae|Tracheophyta|Liliopsida|Asparagales|Iridaceae|Iris|Iris germanica,
SortedMatch,8.98392,Exact,0,Iris pallida,Iris pallida Lam.,Iris pallida,43223,Iris pallida Lam.,false,3,ITIS,Plantae|Viridiplantae|Streptophyta|Embryophyta|Tracheophyta|Spermatophytina|Magnoliopsida|Lilianae|Asparagales|Iridaceae|Iris|Iris pallida,
SortedMatch,8.98012,Exact,0,Iris pallida,Iris pallida,Iris pallida,49342,Iris pallida,false,2,Wikispecies,,
SortedMatch,8.94848,Exact,0,Iris pallida,Iris pallida,Iris pallida,259588,Iris pallida,false,179,Open Tree of Life,||Eukaryota|Archaeplastida|Chloroplastida|Streptophyta|Embryophyta|Tracheophyta|Euphyllophyta|Spermatophyta|Magnoliopsida|Mesangiospermae|Liliopsida|Petrosaviidae|Asparagales|Iridaceae|Iridoideae|Irideae|Iris|Iris pallida,
SortedMatch,8.91436,Exact,0,Iris pallida,Iris pallida,Iris pallida,29817,Iris pallida,false,4,NCBI,|Eukaryota|Viridiplantae|Streptophyta|Streptophytina|Embryophyta|Tracheophyta|Euphyllophyta|Spermatophyta|Magnoliopsida|Mesangiospermae|Liliopsida|Petrosaviidae|Asparagales|Iridaceae|Iridoideae|Irideae|Iris|Iris pallida,
from gnverifier.
Here:
➜ lotus-processor git:(main) ✗ bin/gnverifier -V
version: v0.8.2
build: 2022-02-25_22:48:34UTC
➜ lotus-processor git:(main) ✗ bin/gnverifier "Iris pallida" -s 1,2,3,4,179
01:11:35 INF Using config file: /Users/rutza/Library/Application Support/gnverifier.yaml.
Kind,SortScore,MatchType,EditDistance,ScientificName,MatchedName,MatchedCanonical,TaxonId,CurrentName,Synonym,DataSourceId,DataSourceTitle,ClassificationPath,Error
BestMatch,9.01315,Exact,0,Iris pallida,Iris pallida Lam.,Iris pallida,3PZY6,Iris pallida Lam.,false,1,Catalogue of Life,Biota|Plantae|Tracheophyta|Liliopsida|Asparagales|Iridaceae|Iris|Iris pallida,
➜ lotus-processor git:(main) ✗ bin/gnverifier "Iris pallida" -s 1,2,3,4,179 -M
01:11:45 INF Using config file: /Users/rutza/Library/Application Support/gnverifier.yaml.
Kind,SortScore,MatchType,EditDistance,ScientificName,MatchedName,MatchedCanonical,TaxonId,CurrentName,Synonym,DataSourceId,DataSourceTitle,ClassificationPath,Error
BestMatch,9.01315,Exact,0,Iris pallida,Iris pallida Lam.,Iris pallida,3PZY6,Iris pallida Lam.,false,1,Catalogue of Life,Biota|Plantae|Tracheophyta|Liliopsida|Asparagales|Iridaceae|Iris|Iris pallida,
SortedMatch,9.01005,Exact,0,Iris pallida,"Iris pallida Salisb., nom. illeg.",Iris pallida,3PZY5,Iris halophila Pall.,true,1,Catalogue of Life,Biota|Plantae|Tracheophyta|Liliopsida|Asparagales|Iridaceae|Iris|Iris halophila,
SortedMatch,9.01005,Exact,0,Iris pallida,"Iris pallida Ten., nom. illeg.",Iris pallida,3PZY4,Iris germanica L.,true,1,Catalogue of Life,Biota|Plantae|Tracheophyta|Liliopsida|Asparagales|Iridaceae|Iris|Iris germanica,
SortedMatch,8.98392,Exact,0,Iris pallida,Iris pallida Lam.,Iris pallida,43223,Iris pallida Lam.,false,3,ITIS,Plantae|Viridiplantae|Streptophyta|Embryophyta|Tracheophyta|Spermatophytina|Magnoliopsida|Lilianae|Asparagales|Iridaceae|Iris|Iris pallida,
SortedMatch,8.98012,Exact,0,Iris pallida,Iris pallida,Iris pallida,49342,Iris pallida,false,2,Wikispecies,,
SortedMatch,8.94848,Exact,0,Iris pallida,Iris pallida,Iris pallida,259588,Iris pallida,false,179,Open Tree of Life,||Eukaryota|Archaeplastida|Chloroplastida|Streptophyta|Embryophyta|Tracheophyta|Euphyllophyta|Spermatophyta|Magnoliopsida|Mesangiospermae|Liliopsida|Petrosaviidae|Asparagales|Iridaceae|Iridoideae|Irideae|Iris|Iris pallida,
SortedMatch,8.91436,Exact,0,Iris pallida,Iris pallida,Iris pallida,29817,Iris pallida,false,4,NCBI,|Eukaryota|Viridiplantae|Streptophyta|Streptophytina|Embryophyta|Tracheophyta|Euphyllophyta|Spermatophyta|Magnoliopsida|Mesangiospermae|Liliopsida|Petrosaviidae|Asparagales|Iridaceae|Iridoideae|Irideae|Iris|Iris pallida,
I would expect the -M
result without the -M
.
And with -M
to actually obtain possible multiples per source
from gnverifier.
Ah, yes, I understood now. It is the change in verification procedure. It works like this now:
no flags -- searches everything, returns best
-s flag -- searches only data-sources that you want and returns only best result. The result is limited to provided data-sources
-M -- shows everything found
from gnverifier.
Nothing, read too quick!
What I want was -s 1,2,3 -M so! Thanks a lot!
from gnverifier.
An option to pick only best result per data-source:
I can add it if i get a request. Did I get a request from you? ;)
I did not add it, because it is easy to pick the first result for each source using a script. So I am waiting for a request from people to have this option again.
from gnverifier.
Hmmm...isn't it what -s 1,2,3 -M
is doing?
from gnverifier.
Only if there is just one match per source. For example:
✦ ❯ gnverifier "Jsoetes longissimum" -s 158 -M -q
Kind,SortScore,MatchType,EditDistance,ScientificName,MatchedName,MatchedCanonical,TaxonId,CurrentName,Synonym,DataSourceId,DataSourceTitle,ClassificationPath,Error
BestMatch,8.79803,Fuzzy,1,Jsoetes longissimum,Isoetes longissimum Bory,Isoetes longissimum,144750512,Isoetes longissimum Bory,false,158,EUNIS,,
SortedMatch,7.96012,Fuzzy,3,Jsoetes longissimum,Isoetes longissima Bory,Isoetes longissima,144848706,Isoetes longissima Bory,false,158,EUNIS,,
returns several
from gnverifier.
I see, no problem to filter per best score in a small script as you mentioned to me, so no request your work is consequent enough! 😄
from gnverifier.
Hmmm...last annoying question, how can I obtain the equivalent of
gnverifier "Iris pallida" -s 1,2,3,4,179 -M -q
using gnfinder
?
I think there is no -M
in gnfinder
from gnverifier.
Fine no worries, I'll do "old school", running gnverifier on the top of gnfinder as long time ago in the meantime :)
from gnverifier.
Added gnames/gnfinder#116
from gnverifier.
Related Issues (20)
- doubtful entries in GBIF HOT 6
- Advanced search: filter on taxonomic rank HOT 18
- Prepare gnverifier to v1.0.0 release HOT 5
- Improve uBio presence in gnverifier
- new datasources of fungal names HOT 2
- As a Developer I want to refactor the code to a better file structure
- Update list of data-sources given in web-UI
- brew v 1.0.2 fails HOT 2
- As a User I want to see results for exact name_string
- As a User I want to see a widget for a particullar name-string
- No fuzzy matching? HOT 3
- Post return incorrect name HOT 1
- Updating datasets, iNaturalist and VASCAN in particular HOT 2
- include prokaryotic names (e.g., from LPSN) as a source in the verifier HOT 7
- Add TaxonomicStatus to results
- Add a Relaxed fuzzy match option
- Add explanatin in README how fuzzy matching works
- Make use of TaxonomicStatus field from DarwinCore to determine taxonomic status when possible
- api v0 link not working HOT 1
- Return 'vernacular' field to present common names provided by a data source for a particular match. HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gnverifier.