Giter Site home page Giter Site logo

hindi-hunspell's Issues

Add license statement

Hi guys,

this is an awesome piece of work!
I think LibreOffice project would be very interested in adding it to its spellcheckers.
Their vocab is no match.
I think the only link that is missing is a license statement (maybe MIT license?).

Best,
Barney

path of saving dictionary files does not match with the one mentioned in the config file

Correct your dictionary as shown below:

## Download and unzip your file
# mkdir hindi
# cd hindi
# wget https://github.com/Shreeshrii/hindi-hunspell/raw/master/hi_spellchecker_OOo3.oxt
# unzip hi_spellchecker_OOo3.oxt 

## Note the path mentioned in config file
# grep origin dictionaries.xcu
<value>%origin%/dictionaries/hi_IN.aff %origin%/dictionaries/hi_IN.dic</value>

## affix and dict files should be in a folder and not at root
## Create a folder and move dict files
# mkdir dictionaries
# mv hi_IN.* dictionaries/
# cd dictionaries

## Add number of lines at the top of the dict file
# wc -l hi_IN.dic
207694 hi_IN.dic
# vi hi_IN.dic

## SET UTF-8 at the top of affix file
# vi hi_IN.aff

## zip the file and test with libreoffice
# cd ..
# zip -r hindi.oxt *

Unexpected behaviour of hi_IN dictionary

Tried hi_IN dictionary with Hunspell (v1.7.0) on macOS 10.13. It is producing incorrect results for words within and outside the dictionary.

Hunspell 1.7.0
शब्द
# शब्द 0

द्वारा
# द्व 0
# र 4

ढूंढेगा
# ढूंढेग 0

आवश्यकता
# आवश्यकत 0

कोशिश
# क 0
# श 2
# श 4

बिंदु
# ब 0
# ंदु 2

दुनिया
# ब 0
# ंदुन 2
# य 7

बनाना
# बन 0
# न 3

अपरिमाप्य
# अपर 0
# म 4
# प्य 6

I have also noticed that the words are incorrectly split where , ि, , , mAtra occurs. This may contribute to the incorrect results.

How to make a hunspell file for sanskrit?

Suppose that I have a Sanskrit pada list in devanAgarI. How do I make a hunspell file? Detailed step by step instructions would be very helpful. नमः पार्वतीपतये 🙏

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.