I'm impressed by the accuracy of Lingua as compared to even fasttext, but it would be

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Performance metrics are now provided in the <a href="https://github.com/pemistahl/ling

Please provide performance metrics in the benchmarks about lingua-py HOT 5 CLOSED

pemistahl commented on July 30, 2024

Please provide performance metrics in the benchmarks

from lingua-py.

Comments (5)

nickchomey commented on July 30, 2024 1

Thanks, I'll have to give that a try and share some rough results here. I do think it would be nice/useful to present such stats in the official benchmark comparisons as there's no way to know what "noticeably slower" means. I know that Fasttext and cld2 tend to be exceptionally fast, so perhaps noticeably slower is still quite acceptable. But if it's a difference of 0.001s vs 1s, then obviously that's a problem.

from lingua-py.

pemistahl commented on July 30, 2024

In chapter 9.5 of the README it says:
Lingua's high detection accuracy comes at the cost of being noticeably slower than other language detectors.

The statistical models in Lingua are larger than those of similar libraries. So querying them takes more time.

There is a benchmark script in this repo which gives you a clue how performant the library is. You can run it locally with poetry:

poetry run python3 scripts/benchmark.py

from lingua-py.

datatalking commented on July 30, 2024

@nickchomey I'm relatively new to this repo but it has more languages than the translation repo I have been using. Could help test and show an "output chart" or help craft then submit a PR for this, so I'm willing to collab with you to look at a few options to generate the stats.

from lingua-py.

nickchomey commented on July 30, 2024

@datatalking this isn't a focus for me at the moment and probably won't be for at least a few months, so Im not able to collaborate on anything. But if you have time and desire to do so, that would be great!

from lingua-py.

pemistahl commented on July 30, 2024

Performance metrics are now provided in the README.

from lingua-py.

Recommend Projects

Please provide performance metrics in the benchmarks about lingua-py HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent