To get an impression how I create the German PLS dictionary, watch the video (19.2 MB, WMV):
[20100101: video removed]
Currently, I am preparing a new version of Ralf’s German dictionary. The dictionary should be 100% simon compatible (version 0.1 contains some minor mistakes).
This is what I did yesterday:
1. I created more than 80.000 pronunciations with eSpeak from a set of 300.000 words. Not all words were transcribed, I don’t know what went wrong.
2. Then I created an XSLT stylesheet to transform the eSpeak phoneset into IPA with saxonb-xslt.
3. The result was that I had a list of the phonemes, but the graphemes are missing. What can I do? I decided to start dictating the missing graphemes with DNS 9.5. You can see the dictation process in the video.
Tags: node16, saxonb-xslt, video