Comments (11)
მოგესალმებით, კარგი წამოწყებაა! ვეცდები ჩავერთო ლექსიკონის დახვეწის საქმეში.
ერთი შეკითხვა, ლექსიკონში (ka_GE.dic) არის ასეთი ჩანაწერები, მაგ: "ჰექტარი/NSNN". აქ NSNN რას ნიშნავს? საჭიროა?
დიახ, არის საჭიროა! "/NSNN" არის ლექსიკონის "აფიქსი" ("affix", და ვხედავ ka.aff
).
from ka_ge.spell.
man 5 hunspell
-ში წერია affix-compression როგორ მუშაობს.
from ka_ge.spell.
Would you like me to fork and contribute with your project?
Use Sublime Text and use Regex to Sandro's dictionary to select the brackets and all commas and remove JSON brackets and comma, and after press enter. And in XML, use Regex to select and remove all string tags. Observe that Atom isn't good for big files to select and remove.
After use compare plugins in Atom, in Sublime Text and in Visual Studio Code to compare the dictionaries:
Atom
https://atom.io/packages/compare-files
https://atom.io/packages/split-diff
Sublime Text
https://forum.sublimetext.com/t/how-to-diff-in-sublime-3/25335/9
Visual Studio Code
http://dailydotnettips.com/2015/06/04/how-to-compare-files-in-visual-studio-code/
from ka_ge.spell.
Regarding the 0xh3x (Giorgi Jvaridze) / scraped-words
It includes word scraped from these sources:
- Georgian wiki taken from http://dumps.wikimedia.org/
- http://buki.ge/library-list.html
- http://lib.ge/
- http://www.nplg.gov.ge/dlibrary/
- http://forum.ge/
One thing to keep in mind is that scraping was done 6 years ago. So even if some other wordlists might contain same sources, they might be more recent and contain more words.
blogpost with more info
from ka_ge.spell.
Would you like me to fork and contribute with your project?
If you like to help, I'd be happy! - however comparing the word list and my dictionary doesn't make much sense - if you really want to go further than just removing the json formatting, you'll need to be able to run the build scripts (see README.md for a short description of the requirements). Then comparing has to be done to Bumbeishvili's words from the database and to Scannell's in words/
(Read the readme file everywhere - there is not much documentation, but it might help a little bit.) This comparison can be done automatically.
Before you make any effort to include a word list, make sure that it is published with a license that allows us to integrate it into the dict and distribute it under MIT license (contact the author if needed - as it is the case with sandrinio)
Sorry for the long text - I just want to avoid that you do your work in vain in the end...
from ka_ge.spell.
I checked GBoard on my mobile, at "About", it is said to be under open-source licences, some components under OFL, some others under BSD licence, some under Apache licence. I extracted the GBoard and found the licence. It is said that GBoard is under OFL licence. I attached GBoard's licence file for you.
from ka_ge.spell.
OFL = Open Font License - so probably GBoard contains some fonts under this license. The software itself seems to be proprietary, using libraries under BSD and Apache.
from ka_ge.spell.
Ah, I was wrong, then I mean all libraries are under BSD and Apache licences. But I'll contact Google team and return their answer to you, OK?
from ka_ge.spell.
მოგესალმებით, კარგი წამოწყებაა! ვეცდები ჩავერთო ლექსიკონის დახვეწის საქმეში.
ერთი შეკითხვა, ლექსიკონში (ka_GE.dic) არის ასეთი ჩანაწერები, მაგ: "ჰექტარი/NSNN". აქ NSNN რას ნიშნავს? საჭიროა?
from ka_ge.spell.
დიახ, არის საჭიროა! "/NSNN" არის ლექსიკონის "აფიქსი" ("affix", და ვხედავ ka.aff).
მაგეებში უნდა გავერკვე.
from ka_ge.spell.
spell.on.ge - ამათ ხომ არ დაკავშირებიხართ, იქნებ გაგეერთიანებინათ ძალები?
from ka_ge.spell.
Related Issues (5)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ka_ge.spell.