Let’s take a look at this file (ISO-8859-1):
<word> <name>Abbrechen</name> <pronunciation>gls a p b R E C @ n</pronunciation> <terminal>SimpelKommando</terminal> </word> <word> <name>Acht</name> <pronunciation>a x t</pronunciation> <terminal>Nummer</terminal> </word>
Obviously, simon does now import the glottal stop. The word Abbrechen is with glottal stop. The word Acht is without glottal stop. More info:
Introduced the “gls” phoneme for a glotal stop (“?” in SAMPA).
Great. This should improve the recognition rate when you use Ralf's German dictionary for training. As far as I know, the German BOMP dictionary does have glottal stops, too.
The glottal stop is a difficult phoneme. I think that it is a good decision if simon supports the glottal stop.
The abbreviation gls is better than kn.
Additional thought: Maybe it is possible to develop a simon scenario (XML file) with an .xsl style-sheet? You all know that I am primarily interested in dictation, so a simon scenario with let’s say 1.000 German words would be an interesting start. This should be possible, but I don’t want to invest too much time. So I am looking for a way to get quick results.
My dream is an XML-based framework:
- SSML prompts that link to FLAC audio files;
- Ralf's German dictionary (PLS);
- an .xsl style-sheet that accesses e.g. 99 SSML prompts for training (source file), and that chooses the corresponding words from the PLS dictionary (source file);
- the object file could be a simon scenario.
The command in the Ubuntu terminal would look as follows:
$ saxonb-xslt -ext:on -s:'http://script.blau.in/german/35/prompts.xml' -xsl:'create-simon-scenario.xsl' -o:simon-scenario.xml
What do you think? It is possible to access files that are located in the Internet directly from the Ubuntu terminal, and process them.
> Additional thought: Maybe it is possible to develop a simon scenario (XML file) with an .xsl style-sheet? You all know that I am primarily interested in dictation, so a simon scenario with let’s say 1.000 German words would be an interesting start.
I can do you one better: You can import dictionaries to your active vocabulary. Which means that you could import your PLS as the active dictionary of a scenario. Beware, tough that you probably won’t get the recognition to run on this without an insane amount of training.
The German Voxforge Repo is not enough – I tried
Greetings,
Peter