Ralf’s Nepali dictionary

Here is how I create Ralf's Nepali dictionary:

1. Get spelling dictionary, license is GPLv3 compatible.

2. Generating word list via Ubuntu terminal:

am3msi@am3msi-desktop:~/Documents/201004/nepali-dictionary$ unmunch ne_NP.dic ne_NP.aff > nepali-wordlist

The object file is too big (523 MB). I will only use ne_NP.dic for the PLS dictionary creation.

3. Add <lexicon> tag at the beginning (<lexicon>) and at the end (</lexicon>) of ne_NP.dic with gedit.

4. Creating .xml file with Nepali <lexeme> elements:

am3msi@am3msi-desktop:~/Documents/201004/nepali-dictionary$ saxonb-xslt -ext:on -s:ne_NP.dic -xsl:'http://spirit.blau.in/simon/files/2010/04/create-xml-file.xsl' -o:nepali.xml

5. Obviously, I Nepali is written in the Devanagari script. This means that I can use this table for vowel conversion, and this table for consonant conversion.

6. Generating Nepali <phoneme> elements:

~/Documents/201004/nepali-dictionary$ saxonb-xslt -ext:on -s:nepali.xml -xsl:'improve-nepali-dictionary.xsl' -o:nepali-0.1.xml

7. Download Ralf's Nepali dictionary, and import it into simon. Take a look at the imported dictionary:

nepali

Left column: Nepali words
Right column: Nepali SAMPA phonemes

Tags: ,

Comments are closed.