German 500+ words speech models

My goal is to offer several German speech models that are derived from Ralf's German IPA FLAC files. The speech models will have the names xaa, xab, xac, etc.. Each model will have 500+ German words.

This will be a test series. I can’t guarantee that it will work out. But I think that my concept should work. At least, I hope so.

One note to the simon developers: Could you please modify the dictation plug-in so that it inserts a space after each word? That would be fine because I would like to produce one or several videos, and I don’t want to hit the space bar after each word that has been recognized.

I have prepared the special style-sheet compare.xsl for this test series. This style-sheet compares Ralf's German dictionary (source file #1) with Ralf's German IPA FLAC files (section xaa) (source file #2), and outputs two result documents: A small PLS dictionary that I will import as active dictionary into simon (result file #1), and a prompts file in HTK compatible format (result file #2) so that I can use the simon import training data function.

The style-sheet is invoked in the Ubuntu terminal:

$ saxonb-xslt -ext:on -s:'/home/ubuntu/Documents/201006/german-0.1.9.5/german-0.1.9.8.xml' -xsl:'/home/ubuntu/Documents/201007/ipa-prompts/compare.xsl' -o:'/home/ubuntu/Documents/201006/audacity/xaa-folder/dummy.xml'

I hope that it will work as expected.

Tags:

Comments are closed.