Ralf’s German speech model 0.1.5

How I create Ralf's German speech model version 0.1.5 (download):

1. Delete old directories (cp-xmb):

rm -r /home/ubuntu/.kde/share/apps/simon
rm -r /home/ubuntu/.kde/share/apps/simond
rm /home/ubuntu/.kde/share/config/sscrc
rm /home/ubuntu/.kde/share/config/speechmodelmanagementrc
rm /home/ubuntu/.kde/share/config/simonsoundrc
rm /home/ubuntu/.kde/share/config/simonscenariosrc
rm /home/ubuntu/.kde/share/config/simonrecognitionrc
rm /home/ubuntu/.kde/share/config/simonrc
rm /home/ubuntu/.kde/share/config/simoncommandrc
rm /home/ubuntu/.kde/share/config/samrc
rm /home/ubuntu/.kde/share/config/ksimondrc

2. Ubuntu terminal:

ubuntu@ubuntu-desktop:~$ saxonb-xslt -ext:on -s:/home/ubuntu/Documents/201006/audacity/xmb-folder/lexicon-xmb.xml -xsl:/home/ubuntu/Documents/201006/audacity/copy-flac.xsl -o:/home/ubuntu/Documents/201006/audacity/xfa-folder/cp-dummy

This command produced the file:///home/ubuntu/Documents/201008/lexicon-all.xml with 24000 German words.

3. Concatenate all prompts in one single file (cp-xmb):

cat /home/ubuntu/Documents/201006/audacity/xaa-folder/prompts-xaa \
/home/ubuntu/Documents/201006/audacity/xab-folder/prompts-xab \
/home/ubuntu/Documents/201006/audacity/xac-folder/prompts-xac \
[...]
/home/ubuntu/Documents/201006/audacity/xbm-folder/prompts-xbm \
/home/ubuntu/Documents/201006/audacity/xmb-folder/prompts-xmb \
> /home/ubuntu/Documents/201008/prompts-all

4. Convert FLAC files into wav format (cp-xmb):

cd /home/ubuntu/Documents/201006/audacity/xaa-folder/flac-xaa && \
for f in *.flac; do sox "$f" /home/ubuntu/Documents/201008/wav-all/${f%.flac}.wav; done && \
cd /home/ubuntu/Documents/201006/audacity/xab-folder/flac-xab && \ [...]
for f in *.flac; do sox "$f" /home/ubuntu/Documents/201008/wav-all/${f%.flac}.wav; done

5. Start simon. Manage scenarios > New.
6. Simon > Vocabulary: Import lexicon-all.xml (this dictionary file contains 24000 German words). Import as active dictionary.
7. Grammar > Add sentence: Unknown
8. Training > Import training data:
/home/ubuntu/Documents/201008/prompts-all
/home/ubuntu/Documents/201008/wav-all
Importing 24,696 Files...
9. An error message appears:

Could not copy /home/ubuntu/Documents/201008/wav-all/mɪttʀaːgəndɐ.wav to /home/ubuntu/.kde/share/apps/simon/model/training.data//mɪttʀaːgəndɐ_9445_2010-08-25_19-12-34.wav. Please check if you have all the needed permissions.

What is the reason for this error message? I have to find out.

10. After solving some issues, the speech model is now available for download. It contains 24000 German words.

Tags:

Comments are closed.