I tested with sam a German sentence that is part of the Voxforge collection:
1. I took a look into http://script.blau.in/german/27/prompts.xml. Then I downloaded ralfherzog-20071213-de27.tgz (6.2 MB). It contains FLAC files in 16 kHz / 16bit. I don’t know how I can convert FLAC files (48 kHz) into wav files (16 kHz). So I take the Voxforge version with 16 kHz.
2. After unpacking, I converted file:///home/liberty/200908/ralfherzog-20071213-de27/flac/de27-60.flac with SoundConverter into
file:///home/liberty/.kde/share/apps/simon/model/samtestwav.data/de27-60.wav.
3. I changed the test prompts base path to /home/liberty/.kde/share/apps/simon/model/samtestwav.data.
4. I pressed the Test model button. This is the test log (emphasis by me):
/usr/bin/sox -2 -s -L /home/liberty/.kde/share/apps/simon/model/samtestwav.data/de27-60.wav /home/liberty/.kde/tmp-liberty-desktop/sam/internalsamuser/test/samples/de27-60.wav
Preperation
Recoding audio...
Generating MLF...
Recognizing...
Prompts entry: JEDER HAT EIN STÜCK BEKOMMEN
Received recognition result for: /home/liberty/.kde/tmp-liberty-desktop/sam/internalsamuser/test/samples/de27-60.wav: Therapien
Analyzing recognition results...
Finished
5. All words (jeder, hat, ein, Stück, bekommen) are part of my simon shadow dictionary. But they aren’t part of the active lexicon yet.
6. The recognition rate of the individual words and of the whole sentence was 0. Maybe I should train these words with simon first, and then try again.








