Posts Tagged ‘WordListDate’

Fresh start with the German speech model

Monday, August 24th, 2009

I am making a fresh start with the German speech model. I renamed the folder file:///home/liberty/.kde/share/apps/simon/model into file:///home/liberty/.kde/share/apps/simon/model-backup-20090824. So I can access my already recorded wav files (and the other files). The result of this operation is that the dictionary is now empty.

I am now importing the PLS dictionary /home/liberty/200908/voxDE20090209-modified.xml to the active lexicon. During the import process, the following three files were created:

1. file:///home/liberty/.kde/share/apps/simon/model/modelsrcrc

WordListDate=2009,8,24,3,59,11

2. file:///home/liberty/.kde/share/apps/simon/model/model.voca

% NS_B
<s>    sil
% NS_E
</s>    sil
% Unknown
übten    y: p t @ n
übte    y: p t @
übt    y: p t
übst    y: p s t
üblichsten    y: p l I C s t n=
[...]

3. file:///home/liberty/.kde/share/apps/simon/model/lexicon

A [A] a:
AACHEN [Aachen] a: x @ n
AACHEN [Aachen] a: x N
AACHEN [Aachen] a: x n=
AACHENS [Aachens] a: x @ n s
AACHENS [Aachens] a: x n= s
AB [ab] a p
ABBAU [Abbau] a p b aU
[...]

I can say that sam helped me to understand what is happening internally in simon. So it seems that I am on the right path. What is to do next?

My goal is to import wav files with the corresponding prompts.

I just defined a grammar. The sentence structure is just “Unknown”. I have only one kind of word: Unknown. Very simple. All words are of the category Unknown. I want to keep it as simple as possible.

OK, there seems to be something wrong. I just synchronized the speech model. And the wav files reappeared. This wasn’t intended. I think that maybe I should rename the folder /usr/share/kde4/apps/simon/model. But the owner is root. Maybe I should use the command gksudo nautilus. I renamed the folder into /usr/share/kde4/apps/simon/model-backup-20090824.

I don’t know how to reset the German speech model. simon just recognized correctly the words “aufnehmen”, and “sehr”. That is not too bad. But other words weren’t recognized.

How can I delete the German speech model? I just want to keep a backup of the wav files, and of the prompts.