Comments (1)
The library was initially optimized for speed. That's why we only use 320 n-grams for comparison. If you want to improve the results you have to train the library with around 9000 n-grams first.
After you trained the library with your specified number of n-grams, you can use the library this way.
<?php
$detector = new LanguageDetection\Language();
$detector->setMaxNgrams(9000);
var_dump($detector->detect("Please delete my account.")->close());
My results:
["en"]=>
float(0.88903898635478)
["ca"]=>
float(0.81949122807018)
["ia"]=>
float(0.81797076023392)
["ro"]=>
float(0.81117348927875)
["fy"]=>
float(0.81021247563353)
Also you need to consider that's it's not easy to determine the language on such small sentences. Many languages have the same alphabet and you will not find much differences between these languages.
from language-detection.
Related Issues (20)
- Support for Kazakh language
- the word "LOL" is not an english word ? HOT 1
- Compatible for PHP 8 HOT 2
- Language detection with php 5.6 HOT 4
- The detected languages seem wrong very often HOT 2
- where is project amdvbflash? HOT 2
- English text recognition HOT 2
- Feature Request - Min language's values
- Detection of english string does not work correctly HOT 8
- What's the right way of checking whether or not the text is in a specific language? HOT 2
- Deprecation notice with PHP 8.1 HOT 4
- What dataset? HOT 2
- Unable to detect Chinese if there is only 1 character HOT 1
- Incorrect language is being returned for specific words
- Testing
- How can I add a new language?
- Can you recommend any article data to train better? The default data is too small
- Is there any way to get the full name of the language along with the language code?
- "ia" language?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from language-detection.