Posts Tagged ‘revision 897’

“JEDER HAT EIN STÜCK BEKOMMEN”

Tuesday, August 11th, 2009

I tested with sam a German sentence that is part of the Voxforge collection:

1. I took a look into http://script.blau.in/german/27/prompts.xml. Then I downloaded ralfherzog-20071213-de27.tgz (6.2 MB). It contains FLAC files in 16 kHz / 16bit. I don’t know how I can convert FLAC files (48 kHz) into wav files (16 kHz). So I take the Voxforge version with 16 kHz.

2. After unpacking, I converted file:///home/liberty/200908/ralfherzog-20071213-de27/flac/de27-60.flac with SoundConverter into
file:///home/liberty/.kde/share/apps/simon/model/samtestwav.data/de27-60.wav.

3. I changed the test prompts base path to /home/liberty/.kde/share/apps/simon/model/samtestwav.data.

4. I pressed the Test model button. This is the test log (emphasis by me):

/usr/bin/sox -2 -s -L /home/liberty/.kde/share/apps/simon/model/samtestwav.data/de27-60.wav /home/liberty/.kde/tmp-liberty-desktop/sam/internalsamuser/test/samples/de27-60.wav
Preperation
Recoding audio...
Generating MLF...
Recognizing...
Prompts entry: JEDER HAT EIN STÜCK BEKOMMEN
Received recognition result for: /home/liberty/.kde/tmp-liberty-desktop/sam/internalsamuser/test/samples/de27-60.wav: Therapien
Analyzing recognition results...
Finished

5. All words (jeder, hat, ein, Stück, bekommen) are part of my simon shadow dictionary. But they aren’t part of the active lexicon yet.

6. The recognition rate of the individual words and of the whole sentence was 0. Maybe I should train these words with simon first, and then try again.

I changed the field Test-Prompts

Tuesday, August 11th, 2009

After reading these instructions, I created the file /home/liberty/.kde/share/apps/simon/model/prompts_test. It has the following content:

Hamburgern_S2_2009-07-19_22-44-21 HAMBURGERN
aufgenommen_S2_2009-07-25_11-36-59 AUFGENOMMEN
Fraktionen_S1_2009-07-19_19-33-01 FRAKTIONEN
breiteren_S2_2009-07-24_15-23-54 BREITEREN
Expansionen_S2_2009-07-19_18-05-20 EXPANSIONEN
Hannovers_S2_2009-07-19_23-07-09 HANNOVERS
Hannovers_S1_2009-07-19_23-07-02 HANNOVERS

Then I changed the input file for the field Test-Prompts:

test-prompts

The path for the wav prompts and the wav test prompts is the same.

Here is a small excerpt from the log (emphasis by me):

/usr/bin/sox -2 -s -L /home/liberty/.kde/share/apps/simon/model/training.data//Hamburgern_S2_2009-07-19_22-44-21.wav /home/liberty/.kde/tmp-liberty-desktop/sam/internalsamuser/test/samples/Hamburgern_S2_2009-07-19_22-44-21.wav

The wav files that are mentioned by prompts_test are being copied to a sub-folder of internalsamuser. This is a temporary folder.

And this is the test result:

sentences

sam couldn’t open julius jconf file

Tuesday, August 11th, 2009

I want to figure out how sam works.

jconf

1. I pressed Test model.
2. About 30 seconds later the error message Couldn't open julius jconf file: "". appeared.
3. I guess that I have to define a path for the JConf file. I think that I will give the following file a try: /home/liberty/200908/speech2text/trunk/simond/default.jconf.

By the way, forget about the red question mark in the screen shot of the previous post. The paths are as follows:
/usr/share/kde4/apps/simon/model/tree.hed
/usr/share/kde4/apps/simon/model/wav_config

I am getting the following error message:

recognizing

I get this error message even if I activate/connect simon/ksimond. I guess I have to use a different default.jconf file (located at a different location) because “relative paths must be relative to THIS FILE“.

I am giving the following location a try: /home/liberty/.kde/share/apps/simond/models/a/active/julius.jconf. It seems to work:

test-log

And this could be the overall recognition rate. But it isn’t. Just about 49 % were recognized correctly. I think that I will listen to some of the wav files. Maybe I will have to throw away some of the training samples?

sam: path of the input files

Monday, August 10th, 2009

Let’s take a look into the Voxforge forum (emphasis by me):

“replacing the model files in ~/.kde/share/apps/simond/models//active with the voxforge model files will work (I actually already tested that a while ago).”

Let’s compare the path with the paths that are proposed by sam:

sam

Most of the paths are clear (marked in green, blue, brown). But where are the input files located that I marked with red color (tree.hed, wav_config, JConf)?

I hope that there will be a sam handbook, too.

Let’s compare the path proposed by sam home/liberty/.kde/share/apps/simon/model with the path mentioned in the Voxforge post quoted in the beginning of this article: simond is different from simon.

And what is with “the option of automatically downloading a recent voxforge snapshot“? Obviously, at the moment it isn’t possible to download directly a speech model from Voxforge.