September 2nd, 2010 by producer
This happens in my Ubuntu terminal:
ubuntu@ubuntu-desktop:~$ cd Documents/201005/speech2text && git pull origin master
remote: Counting objects: 66, done.
remote: Compressing objects: 100% (41/41), done.
remote: Total 41 (delta 30), reused 0 (delta 0)
Unpacking objects: 100% (41/41), done.
From git://speech2text.git.sourceforge.net/gitroot/speech2text/speech2text
* branch master -> FETCH_HEAD
Updating cbc744d..4ae4550
Fast-forward
CMakeLists.txt | 6 +-
cmake/FindPortaudio.cmake | 1 +
package/KDE/CMakeLists.txt | 92 +-
package/NSIS/anpassungen.nsi | 57 +-
simon/doc/de/index.docbook | 4183 +++++++++++++-------
simon/doc/en/index.docbook | 2 +-
simon/doc/po/simonmanual.pot | 1510 ++++----
simon/doc/po/simonmanual_de.po | 1979 +++++-----
.../src/simonmodelmanagementui/ImportDict/dict.cpp | 2 +-
.../ImportTrainingData/importtrainingdata.cpp | 31 +-
.../ImportTrainingData/importtrainingdata.h | 2 +
version.h | 2 +-
12 files changed, 4580 insertions(+), 3287 deletions(-)
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$
Great:
“+ allowedFileTypes << "wav";
+ allowedFileTypes << "mp3";
+ allowedFileTypes << "ogg";
+ allowedFileTypes << "flac";"
I am planning to release ogg files (because I don’t have unlimited webspace, and because I think that ogg quality is OK for training).
Posted in Ubuntu | 1 Comment »
August 30th, 2010 by producer
Tags: de
Posted in Ubuntu, dictionary | No Comments »
August 30th, 2010 by producer
Here is the result after importing a dictionary that uses the IPA symbol [χ] instead of the ASCII symbol [x]:
The SAMPA symbol is upper-case [X], and not lower-case [x]. This is OK, I think.
The next version of Ralf's German dictionary will employ the IPA symbol [χ] instead of [x].
Posted in Ubuntu | No Comments »
August 30th, 2010 by producer
This is what I am doing in the Ubuntu terminal:
ubuntu@ubuntu-desktop:~$ cd Documents/201005/speech2text && git pull origin master
remote: Counting objects: 15, done.
remote: Compressing objects: 100% (8/8), done.
remote: Total 8 (delta 7), reused 0 (delta 0)
Unpacking objects: 100% (8/8), done.
From git://speech2text.git.sourceforge.net/gitroot/speech2text/speech2text
* branch master -> FETCH_HEAD
Updating 5853434..4d72b2a
Fast-forward
sam/src/samview.cpp | 2 +-
simonlib/simonmodeltest/fileresultmodel.cpp | 4 ++--
2 files changed, 3 insertions(+), 3 deletions(-)
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$ ./build_ubuntu.sh
I will see later whether the sam Edit sample button has been fixed.
Posted in Ubuntu | 2 Comments »
August 29th, 2010 by producer
A few seconds ago:
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$ git pull origin masterremote: Counting objects: 17, done.
remote: Compressing objects: 100% (9/9), done.
remote: Total 9 (delta 8), reused 0 (delta 0)
Unpacking objects: 100% (9/9), done.
From git://speech2text.git.sourceforge.net/gitroot/speech2text/speech2text
* branch master -> FETCH_HEAD
Updating afe2203..5853434
Fast-forward
simon/model/phonemes | 1 +
.../src/simonmodelmanagementui/ImportDict/dict.cpp | 3 ---
2 files changed, 1 insertions(+), 3 deletions(-)
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$ ./build_ubuntu.sh
Let’s take a look into sam:
1. Sorting is now working. Thanks for fixing that issue.
2. But the Edit sample button seems to be broken now.
I imported the words Hochkulturen (with IPA-[χ]) and Hochkultur (with ASCII-[x]), but only the word Hochkultur is being displayed:
After importing the words Hochkulturen (with UTF-8-[χ]) and Hochkultur (with ASCII-[x]), only one word remains. Obviously, the other word Hochkulturen has been omitted by simon. Maybe this is a simon security precaution.
Posted in Ubuntu | 1 Comment »
August 29th, 2010 by producer
This is what I did a few minutes ago because sorting has been fixed and because of the symbol [χ]:
ubuntu@ubuntu-desktop:~$ cd Documents/201005/speech2text && git pull origin master
remote: Counting objects: 35, done.
remote: Compressing objects: 100% (18/18), done.
remote: Total 18 (delta 15), reused 0 (delta 0)
Unpacking objects: 100% (18/18), done.
From git://speech2text.git.sourceforge.net/gitroot/speech2text/speech2text
* branch master -> FETCH_HEAD
Updating 97be4d2..afe2203
Fast-forward
sam/src/samview.cpp | 1 +
.../src/simonmodelmanagementui/ImportDict/dict.cpp | 3 +++
simonlib/simonmodeltest/fileresultmodel.cpp | 7 +++++--
3 files changed, 9 insertions(+), 2 deletions(-)
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$ ./build_ubuntu.sh
I want to see whether the Ach-Laut is imported correctly by importing these entries (first entry with ASCII-[x]; second entry with IPA-[χ]:
<lexeme role="Substantiv">
<grapheme>Hochkultur</grapheme>
<phoneme>hoːxkʊltuːʀ</phoneme>
<phoneme>hoːxkʊltuːɐ̯</phoneme>
</lexeme>
<lexeme role="Substantiv">
<grapheme>Hochkulturen</grapheme>
<phoneme>hoːχkʊltuːʀən</phoneme>
<phoneme>hoːχkʊltuːɐ̯n</phoneme>
</lexeme>
No, it didn’t work out:
The word Hochkultur is displayed with the correct SAMPA. But the word Hochkulturen doesn’t have the correct SAMPA.
Posted in Ubuntu | No Comments »
August 29th, 2010 by producer
A few minutes ago, I tested section ‘xea’ against all. The overall recognition rate was about 95%.
Take a closer look at the right column: The word DIVERGIERST has a recognition rate of 100%. The word DIN has a recognition rate of 11.9569%. You can see that the sorting is not correct.
Btw, it would be interesting to export the recognition results into a text file. What about adding an Export test results button?
Tags: sam
Posted in Ubuntu | 2 Comments »
August 29th, 2010 by producer
At the moment, I am testing with sam the speech model xea (contains about 800 German words) against Ralf’s German speech model 0.1.6 (contains about 25000 German words). The file:///home/ubuntu/Documents/201008/sam-0.1.6/test-xea-against-all.sam has the following content:
/home/ubuntu/.kde/share/apps/simon/model/hmmdefs
/home/ubuntu/.kde/share/apps/simon/model/tiedlist
/home/ubuntu/.kde/share/apps/simon/model/model.dict
/home/ubuntu/.kde/share/apps/simon/model/model.dfa
/home/ubuntu/Documents/201008/wav-all
/home/ubuntu/Documents/201006/audacity/xea-folder/wav-xea
/home/ubuntu/Documents/201008/sam-0.1.6/lexicon
/home/ubuntu/Documents/201008/sam-0.1.6/model.grammar
/home/ubuntu/Documents/201008/sam-0.1.6/simple.voca
/home/ubuntu/Documents/201008/prompts-wav.txt
/home/ubuntu/Documents/201006/audacity/xea-folder/prompts-xea
/usr/share/kde4/apps/simon/model/tree1.hed
/usr/share/kde4/apps/simon/model/wav_config
16000
/usr/share/kde4/apps/simond/default.jconf
2
Question: what does the 2 in the last line mean?
By the way, I used the Serialize scenarios button (input file:///home/ubuntu/Documents/201007/german-speech-model-0.1.6/scenario-0.1.6.xml) to generate the following files:
file:///home/ubuntu/Documents/201008/sam-0.1.6/lexicon
file:///home/ubuntu/Documents/201008/sam-0.1.6/model.grammar
file:///home/ubuntu/Documents/201008/sam-0.1.6/simple.voca
I think that it is a good idea to test a subset (e.g. xea) against the superset (xaa+xab+xac+…+xpw).
Tags: sam
Posted in Ubuntu | No Comments »
August 28th, 2010 by producer
How I create Ralf’s German speech model 0.1.6:
1. Concatenate the different prompts file into one single prompts file:
cat /home/ubuntu/Documents/201006/audacity/xaa-folder/prompts-xaa \
/home/ubuntu/Documents/201006/audacity/xab-folder/prompts-xab \ Read the rest of this entry »
Tags: german-speech-model
Posted in Ubuntu | No Comments »
August 26th, 2010 by producer
How I create Ralf's German speech model version 0.1.5 (download):
1. Delete old directories (cp-xmb):
rm -r /home/ubuntu/.kde/share/apps/simon
rm -r /home/ubuntu/.kde/share/apps/simond Read the rest of this entry »
Tags: german-speech-model
Posted in Ubuntu | No Comments »
August 23rd, 2010 by producer
Ralf's German dictionary uses the normal [x] as sign for the Ach-Laut. According to the Wiktionary, the Ach-Laut is represented by [χ] (U+03C7):
lachen /[ˈlaχn̩]/, Dach /[daχ]/, Bucht /[bʊχt]/, doch /[dɔχ]/, auch /[aʊ̯χ]/
The current development version of simon doesn’t accept the [χ]. I would like to transform the [x] into [χ] in a future version of Ralf's German dictionary.
Tags: de, [χ]
Posted in dictionary | No Comments »
August 22nd, 2010 by producer
A few minutes ago, I deleted several files from the /home/ubuntu/.kde/share/config/ folder. Then I repeated steps 1 – 4. Then I pressed the following buttons: Connect – Synchronize – Activate. And then, this error message appeared:
Obviously, a lot of words beginning with the letter b are affected. I think that maybe something went wrong with section xdb.
I found the mistake: The file:///home/ubuntu/Documents/201006/audacity/xdb-folder/prompts-xdb contains wrong content. I will have to fix that.
It is working now.
Posted in Ubuntu | 3 Comments »
August 21st, 2010 by producer
I want to use Ralf’s German speech model version 0.1.4 as static model. Because my last try a few minutes ago was not successful, I am writing this detailed report:
1. Quit simon and ksimond.
2. Delete the folders file:///home/ubuntu/.kde/share/apps/simon and
file:///home/ubuntu/.kde/share/apps/simond with the following commands:
$ rm -i -r /home/ubuntu/.kde/share/apps/simon
$ rm -r /home/ubuntu/.kde/share/apps/simond
3. Start simon. Manage scenarios > Import > Import from file. I choose file:///home/ubuntu/Downloads/german-speech-model-0.1.4/scenario-0.1.4.xml.
4. Simon > Settings > Configure simon > Model settings > Static model. Load the following paths:
file:///home/ubuntu/Downloads/german-speech-model-0.1.4/hmmdefs-0.1.4
file:///home/ubuntu/Downloads/german-speech-model-0.1.4/tiedlist-0.1.4
file:///home/ubuntu/Downloads/german-speech-model-0.1.4/macros-0.1.4
file:///home/ubuntu/Downloads/german-speech-model-0.1.4/stats-0.1.4
Press Apply. Press OK.
5. Start ksimond.
simon > Connect button.
simon > Synchronize button.
simon > Activate button. Then the following error message appears:
Could not start recognition because the system reports that the recognition is not ready.
Please check if you have defined a vocabulary, an appropriate grammar and recorded a few trainings samples.
The system will then, upon synchronization, generate the model which will be used for the recognition
What went wrong? I had imported the scenario scenario-0.1.4.xml which includes a grammar (only the terminal Unknown), and the dictation plug-in. That should be sufficient. A vocabulary is available, too, of course.
Question: What went wrong?
Posted in Ubuntu | 2 Comments »
August 21st, 2010 by producer
Take a look at Ralf's Latin dictionary - German pronunciation (version 0.1.1; 2010-08-21), and download Ralf's Latin speech model 0.1.1. Of course, this is an early approach. The speech model contains about 800 different Latin words.
Tags: latin, node19
Posted in Ubuntu, dictionary | No Comments »
August 21st, 2010 by producer
Some information about Ralf’s German speech model version 0.1.4 (download):
1. Paths to the files on my computer:
file:///tmp/kde-ubuntuJPP9Sy/simond/default/compile/hmm24/hmmdefs
file:///tmp/kde-ubuntuJPP9Sy/simond/default/compile/tiedlist
file:///tmp/kde-ubuntuJPP9Sy/simond/default/compile/hmm24/macros
file:///tmp/kde-ubuntuJPP9Sy/simond/default/compile/stats
2. The hmmdefs file of Ralf’s German speech model 0.1.3 has a size of 28 MB. At the moment, I am preparing Ralf's German speech model 0.1.4. But the size of hmmdefs is just 3.7 MB. I don’t know the reason for this.
3. I didn’t test whether Ralf’s German speech model 0.1.4 works as static model.
4. Ralf’s German speech model 0.1.4 contains a vocabulary with 21000 words.
Feedback is as always welcome.
Tags: german-speech-model
Posted in Ubuntu, dictionary | No Comments »
August 21st, 2010 by producer
How I create Ralf's German dictionary version 0.2:
1. Add this rule to improve-german.xsl:
<xsl:when test="contains(lower-case(../grapheme), 'prüf')"><xsl:value-of select="replace($sierra, 'pʀyf','pʀyːf')"/></xsl:when>
2. Add rule: Read the rest of this entry »
Tags: de
Posted in Ubuntu, dictionary | No Comments »
August 20th, 2010 by producer
At the moment, I am importing 21007 wav files into simon. I want to increase the amount of training data.
It takes a few minutes until all files are imported.
I hope that it will work as expected.
Edit: No, it just worked partially. Just about 862 words are contained in the file file:///home/ubuntu/.kde/share/apps/simond/models/default/active/model.dict. Why does simon use just this small subset?
Edit #2: I selected the “static model” option instead of “user generated model”. I will try again.
Edit #3: Obviously, I imported only a small dictionary. This means that the acoustic model has been trained with 21007 words, but the language model contains just 862 words.
Edit #4: I think that I found the mistake. The file file:///home/ubuntu/Documents/201008/lexicon-all.xml doesn’t contain valid XML code. I have to fix that, then I will try again.
Posted in Ubuntu | 1 Comment »
August 17th, 2010 by producer
Watch the video about the German speech model 'xea'. These are the 700 words that are being displayed during recognition:
Dienst Diner Dealerin Dealerinnen Dealern Dialogablaufs Dialoge Dialogeingriff Dialogen Dialogist Dialogs Dialysator Dialyse Dialysen Diamantarmband Diamantbohrer Diamanten Diamantenhalskette Read the rest of this entry »
Tags: german-speech-model, xea
Posted in Ubuntu, video | No Comments »
August 16th, 2010 by producer
How I install the German speech model 'xea':
1. Delete file:///home/ubuntu/.kde/share/apps/simon and
file:///home/ubuntu/.kde/share/apps/simond
2. Applications > Universal Access > simon > Settings > Configure simon > Model settings > Static model
3. Set the paths of the base model. The paths on my computer are as follows:
file:///home/ubuntu/Documents/201006/audacity/xea-folder/german-speech-model-xea/hmmdefs-xea
file:///home/ubuntu/Documents/201006/audacity/xea-folder/german-speech-model-xea/tiedlist-xea
file:///home/ubuntu/Documents/201006/audacity/xea-folder/german-speech-model-xea/macros-xea
file:///home/ubuntu/Documents/201006/audacity/xea-folder/german-speech-model-xea/stats-xea
4. Press Apply button. Press OK button.
5. Manage scenarios > Import > Import from file > file:///home/ubuntu/Documents/201006/audacity/xea-folder/german-speech-model-xea/scenario-xea.xml.
6. Applications > Universal Access > ksimond
7. simon > Press the Connect button. Then press Synchronize. Then press Activate. simon is now activated.
That is all for the moment. Is the following terminology correct?
1. AM = hmmdefs + tiedlist + macros + stats
2. LM = scenario-xea.xml
Is it correct to say: "speech model = AM + LM"?
Tags: xea
Posted in Ubuntu, video | No Comments »
August 15th, 2010 by producer
Watch the video with the title German speech model 'xdb'. These 650 words appear in the video:
BTM BTSV BTX Beete beten Beetes Beethoven Beethovens Beethovensaal Beethovensaals Bertas Bertelsmann Bertas Berti Bestand Bestandes Bestands Bestandsführung Bestandsgröße Bestandskonto Bestandsliste Bestandslisten Bestandsnachweis Read the rest of this entry »
Tags: xdb
Posted in Ubuntu, video | No Comments »