Archive for the ‘Ubuntu’ Category

git pull origin master

Thursday, September 2nd, 2010

This happens in my Ubuntu terminal:

ubuntu@ubuntu-desktop:~$ cd Documents/201005/speech2text && git pull origin master
remote: Counting objects: 66, done.
remote: Compressing objects: 100% (41/41), done.
remote: Total 41 (delta 30), reused 0 (delta 0)
Unpacking objects: 100% (41/41), done.
From git://speech2text.git.sourceforge.net/gitroot/speech2text/speech2text
* branch master -> FETCH_HEAD
Updating cbc744d..4ae4550
Fast-forward
CMakeLists.txt | 6 +-
cmake/FindPortaudio.cmake | 1 +
package/KDE/CMakeLists.txt | 92 +-
package/NSIS/anpassungen.nsi | 57 +-
simon/doc/de/index.docbook | 4183 +++++++++++++-------
simon/doc/en/index.docbook | 2 +-
simon/doc/po/simonmanual.pot | 1510 ++++----
simon/doc/po/simonmanual_de.po | 1979 +++++-----
.../src/simonmodelmanagementui/ImportDict/dict.cpp | 2 +-
.../ImportTrainingData/importtrainingdata.cpp | 31 +-
.../ImportTrainingData/importtrainingdata.h | 2 +
version.h | 2 +-
12 files changed, 4580 insertions(+), 3287 deletions(-)
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$

Great:

“+ allowedFileTypes << "wav";

+ allowedFileTypes << "mp3";

+ allowedFileTypes << "ogg";

+ allowedFileTypes << "flac";"

I am planning to release ogg files (because I don’t have unlimited webspace, and because I think that ogg quality is OK for training).

Ralf’s German dictionary 0.2.1

Monday, August 30th, 2010

How I build Ralf's German dictionary version 0.2.1: (more…)

SAMPA of Ach-Laut: [X] instead of [x]

Monday, August 30th, 2010

Here is the result after importing a dictionary that uses the IPA symbol [χ] instead of the ASCII symbol [x]:

ach-lautThe SAMPA symbol is upper-case [X], and not lower-case [x]. This is OK, I think.

The next version of Ralf's German dictionary will employ the IPA symbol [χ] instead of [x].

samview.cpp

Monday, August 30th, 2010

This is what I am doing in the Ubuntu terminal:

ubuntu@ubuntu-desktop:~$ cd Documents/201005/speech2text && git pull origin master
remote: Counting objects: 15, done.
remote: Compressing objects: 100% (8/8), done.
remote: Total 8 (delta 7), reused 0 (delta 0)
Unpacking objects: 100% (8/8), done.
From git://speech2text.git.sourceforge.net/gitroot/speech2text/speech2text
* branch master -> FETCH_HEAD
Updating 5853434..4d72b2a
Fast-forward
sam/src/samview.cpp | 2 +-
simonlib/simonmodeltest/fileresultmodel.cpp | 4 ++--
2 files changed, 3 insertions(+), 3 deletions(-)
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$ ./build_ubuntu.sh

I will see later whether the sam Edit sample button has been fixed.

./build_ubuntu.sh

Sunday, August 29th, 2010

A few seconds ago:

ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$ git pull origin masterremote: Counting objects: 17, done.
remote: Compressing objects: 100% (9/9), done.
remote: Total 9 (delta 8), reused 0 (delta 0)
Unpacking objects: 100% (9/9), done.
From git://speech2text.git.sourceforge.net/gitroot/speech2text/speech2text
* branch master -> FETCH_HEAD
Updating afe2203..5853434
Fast-forward
simon/model/phonemes | 1 +
.../src/simonmodelmanagementui/ImportDict/dict.cpp | 3 ---
2 files changed, 1 insertions(+), 3 deletions(-)
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$ ./build_ubuntu.sh

Let’s take a look into sam:

sam-edit-sample1. Sorting is now working. Thanks for fixing that issue.
2. But the Edit sample button seems to be broken now.

I imported the words Hochkulturen (with IPA-[χ]) and Hochkultur (with ASCII-[x]), but only the word Hochkultur is being displayed:

hochkulturAfter importing the words Hochkulturen (with UTF-8-[χ]) and Hochkultur (with ASCII-[x]), only one word remains. Obviously, the other word Hochkulturen has been omitted by simon. Maybe this is a simon security precaution.

git pull origin master

Sunday, August 29th, 2010

This is what I did a few minutes ago because sorting has been fixed and because of the symbol [χ]:

ubuntu@ubuntu-desktop:~$ cd Documents/201005/speech2text && git pull origin master
remote: Counting objects: 35, done.
remote: Compressing objects: 100% (18/18), done.
remote: Total 18 (delta 15), reused 0 (delta 0)
Unpacking objects: 100% (18/18), done.
From git://speech2text.git.sourceforge.net/gitroot/speech2text/speech2text
* branch master -> FETCH_HEAD
Updating 97be4d2..afe2203
Fast-forward
sam/src/samview.cpp | 1 +
.../src/simonmodelmanagementui/ImportDict/dict.cpp | 3 +++
simonlib/simonmodeltest/fileresultmodel.cpp | 7 +++++--
3 files changed, 9 insertions(+), 2 deletions(-)
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$ ./build_ubuntu.sh

I want to see whether the Ach-Laut is imported correctly by importing these entries (first entry with ASCII-[x]; second entry with IPA-[χ]:

<lexeme role="Substantiv">
<grapheme>Hochkultur</grapheme>
<phoneme>hoːxkʊltuːʀ</phoneme>
<phoneme>hoːxkʊltuːɐ̯</phoneme>
</lexeme>
<lexeme role="Substantiv">
<grapheme>Hochkulturen</grapheme>
<phoneme>hoːχkʊltuːʀən</phoneme>
<phoneme>hoːχkʊltuːɐ̯n</phoneme>
</lexeme>

No, it didn’t work out:

ach-laut The word Hochkultur is displayed with the correct SAMPA. But the word Hochkulturen doesn’t have the correct SAMPA.

sam: sorting should be improved

Sunday, August 29th, 2010

A few minutes ago, I tested section ‘xea’ against all. The overall recognition rate was about 95%.

recognition-rateTake a closer look at the right column: The word DIVERGIERST has a recognition rate of 100%. The word DIN has a recognition rate of 11.9569%. You can see that the sorting is not correct.

Btw, it would be interesting to export the recognition results into a text file. What about adding an Export test results button?

sam: test section ‘xea’ against all

Sunday, August 29th, 2010

At the moment, I am testing with sam the speech model xea (contains about 800 German words) against Ralf’s German speech model 0.1.6 (contains about 25000 German words). The file:///home/ubuntu/Documents/201008/sam-0.1.6/test-xea-against-all.sam has the following content:

/home/ubuntu/.kde/share/apps/simon/model/hmmdefs
/home/ubuntu/.kde/share/apps/simon/model/tiedlist
/home/ubuntu/.kde/share/apps/simon/model/model.dict
/home/ubuntu/.kde/share/apps/simon/model/model.dfa
/home/ubuntu/Documents/201008/wav-all
/home/ubuntu/Documents/201006/audacity/xea-folder/wav-xea
/home/ubuntu/Documents/201008/sam-0.1.6/lexicon
/home/ubuntu/Documents/201008/sam-0.1.6/model.grammar
/home/ubuntu/Documents/201008/sam-0.1.6/simple.voca
/home/ubuntu/Documents/201008/prompts-wav.txt
/home/ubuntu/Documents/201006/audacity/xea-folder/prompts-xea
/usr/share/kde4/apps/simon/model/tree1.hed
/usr/share/kde4/apps/simon/model/wav_config
16000
/usr/share/kde4/apps/simond/default.jconf
2

Question: what does the 2 in the last line mean?

By the way, I used the Serialize scenarios button (input file:///home/ubuntu/Documents/201007/german-speech-model-0.1.6/scenario-0.1.6.xml) to generate the following files:
file:///home/ubuntu/Documents/201008/sam-0.1.6/lexicon
file:///home/ubuntu/Documents/201008/sam-0.1.6/model.grammar
file:///home/ubuntu/Documents/201008/sam-0.1.6/simple.voca

I think that it is a good idea to test a subset (e.g. xea) against the superset (xaa+xab+xac+…+xpw).

Ralf’s German speech model 0.1.6

Saturday, August 28th, 2010

How I create Ralf’s German speech model 0.1.6:

1. Concatenate the different prompts file into one single prompts file:

cat /home/ubuntu/Documents/201006/audacity/xaa-folder/prompts-xaa \
/home/ubuntu/Documents/201006/audacity/xab-folder/prompts-xab \ (more...)

Ralf’s German speech model 0.1.5

Thursday, August 26th, 2010

How I create Ralf's German speech model version 0.1.5 (download):

1. Delete old directories (cp-xmb):

rm -r /home/ubuntu/.kde/share/apps/simon
rm -r /home/ubuntu/.kde/share/apps/simond (more...)

What went wrong?

Sunday, August 22nd, 2010

A few minutes ago, I deleted several files from the /home/ubuntu/.kde/share/config/ folder. Then I repeated steps 1 – 4. Then I pressed the following buttons: Connect – Synchronize – Activate. And then, this error message appeared:

phonemes-affectedObviously, a lot of words beginning with the letter b are affected. I think that maybe something went wrong with section xdb.

I found the mistake: The file:///home/ubuntu/Documents/201006/audacity/xdb-folder/prompts-xdb contains wrong content. I will have to fix that.

It is working now.

Trying to use as static model

Saturday, August 21st, 2010

I want to use Ralf’s German speech model version 0.1.4 as static model. Because my last try a few minutes ago was not successful, I am writing this detailed report:

1. Quit simon and ksimond.
2. Delete the folders file:///home/ubuntu/.kde/share/apps/simon and
file:///home/ubuntu/.kde/share/apps/simond with the following commands:

$ rm -i -r /home/ubuntu/.kde/share/apps/simon
$ rm -r /home/ubuntu/.kde/share/apps/simond

3. Start simon. Manage scenarios > Import > Import from file. I choose file:///home/ubuntu/Downloads/german-speech-model-0.1.4/scenario-0.1.4.xml.

4. Simon > Settings > Configure simon > Model settings > Static model. Load the following paths:
file:///home/ubuntu/Downloads/german-speech-model-0.1.4/hmmdefs-0.1.4
file:///home/ubuntu/Downloads/german-speech-model-0.1.4/tiedlist-0.1.4
file:///home/ubuntu/Downloads/german-speech-model-0.1.4/macros-0.1.4
file:///home/ubuntu/Downloads/german-speech-model-0.1.4/stats-0.1.4

Press Apply. Press OK.

5. Start ksimond.
simon > Connect button.
simon > Synchronize button.
simon > Activate button. Then the following error message appears:

Could not start recognition because the system reports that the recognition is not ready.

Please check if you have defined a vocabulary, an appropriate grammar and recorded a few trainings samples.

The system will then, upon synchronization, generate the model which will be used for the recognition

What went wrong? I had imported the scenario scenario-0.1.4.xml which includes a grammar (only the terminal Unknown), and the dictation plug-in. That should be sufficient. A vocabulary is available, too, of course.

Question: What went wrong?

Ralf’s Latin speech model 0.1.1

Saturday, August 21st, 2010

Take a look at Ralf's Latin dictionary - German pronunciation (version 0.1.1; 2010-08-21), and download Ralf's Latin speech model 0.1.1. Of course, this is an early approach. The speech model contains about 800 different Latin words.

Ralf’s German speech model 0.1.4

Saturday, August 21st, 2010

Some information about Ralf’s German speech model version 0.1.4 (download):

1. Paths to the files on my computer:

file:///tmp/kde-ubuntuJPP9Sy/simond/default/compile/hmm24/hmmdefs
file:///tmp/kde-ubuntuJPP9Sy/simond/default/compile/tiedlist
file:///tmp/kde-ubuntuJPP9Sy/simond/default/compile/hmm24/macros
file:///tmp/kde-ubuntuJPP9Sy/simond/default/compile/stats

2. The hmmdefs file of Ralf’s German speech model 0.1.3 has a size of 28 MB. At the moment, I am preparing Ralf's German speech model 0.1.4. But the size of hmmdefs is just 3.7 MB. I don’t know the reason for this.

3. I didn’t test whether Ralf’s German speech model 0.1.4 works as static model.

4. Ralf’s German speech model 0.1.4 contains a vocabulary with 21000 words.

Feedback is as always welcome.

Ralf’s German dictionary 0.2

Saturday, August 21st, 2010

How I create Ralf's German dictionary version 0.2:

1. Add this rule to improve-german.xsl:

<xsl:when test="contains(lower-case(../grapheme), 'prüf')"><xsl:value-of select="replace($sierra, 'pʀyf','pʀyːf')"/></xsl:when>

2. Add rule: (more…)

Importing 21007 wav files

Friday, August 20th, 2010

At the moment, I am importing 21007 wav files into simon. I want to increase the amount of training data.

import-wav-filesIt takes a few minutes until all files are imported.

I hope that it will work as expected.

Edit: No, it just worked partially. Just about 862 words are contained in the file file:///home/ubuntu/.kde/share/apps/simond/models/default/active/model.dict. Why does simon use just this small subset?

Edit #2: I selected the “static model” option instead of “user generated model”. I will try again.

Edit #3: Obviously, I imported only a small dictionary. This means that the acoustic model has been trained with 21007 words, but the language model contains just 862 words.

Edit #4: I think that I found the mistake. The file file:///home/ubuntu/Documents/201008/lexicon-all.xml doesn’t contain valid XML code. I have to fix that, then I will try again.

Video: German speech model ‘xea’

Tuesday, August 17th, 2010

Watch the video about the German speech model 'xea'. These are the 700 words that are being displayed during recognition:

Dienst Diner Dealerin Dealerinnen Dealern Dialogablaufs Dialoge Dialogeingriff Dialogen Dialogist Dialogs Dialysator Dialyse Dialysen Diamantarmband Diamantbohrer Diamanten Diamantenhalskette (more…)

Installing German speech model ‘xea’

Monday, August 16th, 2010

How I install the German speech model 'xea':

1. Delete file:///home/ubuntu/.kde/share/apps/simon and
file:///home/ubuntu/.kde/share/apps/simond
2. Applications > Universal Access > simon > Settings > Configure simon > Model settings > Static model
3. Set the paths of the base model. The paths on my computer are as follows:
file:///home/ubuntu/Documents/201006/audacity/xea-folder/german-speech-model-xea/hmmdefs-xea
file:///home/ubuntu/Documents/201006/audacity/xea-folder/german-speech-model-xea/tiedlist-xea
file:///home/ubuntu/Documents/201006/audacity/xea-folder/german-speech-model-xea/macros-xea
file:///home/ubuntu/Documents/201006/audacity/xea-folder/german-speech-model-xea/stats-xea

4. Press Apply button. Press OK button.
5. Manage scenarios > Import > Import from file > file:///home/ubuntu/Documents/201006/audacity/xea-folder/german-speech-model-xea/scenario-xea.xml.
6. Applications > Universal Access > ksimond
7. simon > Press the Connect button. Then press Synchronize. Then press Activate. simon is now activated.

That is all for the moment. Is the following terminology correct?
1. AM = hmmdefs + tiedlist + macros + stats
2. LM = scenario-xea.xml

Is it correct to say: "speech model = AM + LM"?

Video: German speech model ‘xdb’

Sunday, August 15th, 2010

Watch the video with the title German speech model 'xdb'. These 650 words appear in the video:

BTM BTSV BTX Beete beten Beetes Beethoven Beethovens Beethovensaal Beethovensaals Bertas Bertelsmann Bertas Berti Bestand Bestandes Bestands Bestandsführung Bestandsgröße Bestandskonto Bestandsliste Bestandslisten Bestandsnachweis (more…)

Video: German speech model ‘xab’

Friday, August 13th, 2010

I updated the video recognize-german.wmv (size: about 30 MB). In the video, you can see how I dictate more than 500 different German words with the German speech model “xab”. The space bars are now inserted automatically thanks to the dictation plug-in. The speech model has been trained with Ralf's German IPA FLAC files (section: xab). Each word has been trained exactly one time.