“The server application is called simond and uses tcp/ip. Audio streaming over the network is supported. As only the server uses the HTK you could do a huge setup of simon with one main server that compiles the model thus further limiting the need for the HTK (it only has to be installed once, on the server side).”
Well, one major problem is that simon makes use of HTK which isn’t freely distributable. But a future workaround could be to use the internet for training (simond as server which internally makes use of HTK). Using TCP/IP seems to be a good idea.
—
Another idea would be to replace HTK by Sphinx. Is there anyone out there who is capable to do that?
—
This is what we need: Make simon work out of the box. What is necessary? A speech model that works out of the box. I will try to create one with sam that works with my own voice.
Tags: free software
Interesting experience: “testing out the server-side speech recognition”