Take a look at Ralf's Latin IPA FLAC files (section: xaa). I want to build a speech model that recognizes these words.
1. I have to prepare a prompts file:
$ cat /home/ubuntu/Documents/201008/latin-0.1.2/split/xaa-folder/lexicon-xaa.xml | saxonb-xslt -ext:on -s:- -xsl:/home/ubuntu/Documents/201008/latin-0.1.2/lexiconxaa2prompts.xsl
2. Remove old files:
$ rm -i -r /home/ubuntu/.kde/share/apps/simon
rm: descend into directory `/home/ubuntu/.kde/share/apps/simon'? y
rm: remove directory `/home/ubuntu/.kde/share/apps/simon/model'? y
rm: remove regular file `/home/ubuntu/.kde/share/apps/simon/shadowvocabulary.xml'? y
rm: remove regular file `/home/ubuntu/.kde/share/apps/simon/protocol.log'? y
rm: remove directory `/home/ubuntu/.kde/share/apps/simon'? y
3. Start simon 0.3.0. Manage scenarios > New. Name of the new scenario: latin-xaa
4. Vocabulary > Import dictionary > Target: Active dictionary > PLS lexicon > File: /home/ubuntu/Documents/201008/latin-0.1.2/split/xaa-folder/lexicon-xaa.xml
5. Grammar > Add sentence > Add structure: Unknown
6. Training > Import training data > Import Prompts > Prompts: /home/ubuntu/Documents/201008/latin-0.1.2/split/xaa-folder/prompts-xaa
It is necessary to convert the FLAC files into WAV format via Ubuntu terminal:
mkdir /home/ubuntu/Documents/201008/latin-0.1.2/split/xaa-folder/wav-xaa
cd /home/ubuntu/Documents/201008/latin-0.1.2/split/xaa-folder/flac-xaa && \
for f in *.flac; do sox "$f" "/home/ubuntu/Documents/201008/latin-0.1.2/split/xaa-folder/wav-xaa/${f%.flac}.wav"; done
Import Prompts > Base directory: /home/ubuntu/Documents/201008/latin-0.1.2/split/xaa-folder/wav-xaa > Importing 899 files
7. Add dictation plug-in.
8. Start ksimond. Connect with ksimond. Press Connect. Press Synchronize. Press Activate. The following error message appears:
Could not start recognition because the system reports that the recognition is not ready.
Please check if you have defined a vocabulary, an appropriate grammar and recorded a few trainings samples.
The system will then, upon synchronization, generate the model which will be used for the recognition.
What did I forget? I have an active vocabulary (each word has a recognition rate of 1). I have defined a grammar (Unknown). I have imported training data. And I have the dictation plugin. I check Settings > Configure simon > Model settings > User generated model. So everything should be clear.
9. I don’t know what went wrong. Any hint?
[see comments]
12. Download Latin speech model 'xaa'.