Archive for the ‘Ubuntu’ Category

simon_0.3.0-1ubuntu8_amd64.deb

Friday, November 26th, 2010

A few minutes ago, I downloaded and installed simon_0.3.0-1ubuntu8_amd64.deb.

I want to develop Ralf's German speech model 0.1.9.2 with 36000 German words. Here is what happens:

Click on Manage plug-ins.

Press the Add button.

The KDE Crash Handler appears. Read the generated crash info.

My goal is to activate the dictation plug-in. But unfortunately, simon crashes before I can do that.

simon_0.3.0-1ubuntu3_amd64.deb

Tuesday, November 23rd, 2010

A few minutes ago, I downloaded simon_0.3.0-1ubuntu3_amd64.deb. But when I try to install simon, the following error message appears:

I don’t know how to solve this dependency issue.

./build_ubuntu.sh: Permission denied

Sunday, November 7th, 2010

This happened in my Ubuntu terminal a few minutes ago:

ubuntu@ubuntu-desktop:~/Documents/201011$ git clone git://speech2text.git.sourceforge.net/gitroot/speech2text/speech2text
Initialized empty Git repository in /home/ubuntu/Documents/201011/speech2text/.git/
remote: Counting objects: 28969, done.
remote: Compressing objects: 100% (12227/12227), done.
remote: Total 28969 (delta 22628), reused 21585 (delta 16575)
Receiving objects: 100% (28969/28969), 67.51 MiB | 488 KiB/s, done.
Resolving deltas: 100% (22628/22628), done.
ubuntu@ubuntu-desktop:~/Documents/201011$ cd /home/ubuntu/Documents/201011/speech2text
ubuntu@ubuntu-desktop:~/Documents/201011/speech2text$ ./build_ubuntu.sh
bash: ./build_ubuntu.sh: Permission denied
ubuntu@ubuntu-desktop:~/Documents/201011/speech2text$

Obviously, it isn’t allowed to execute ./build_ubuntu.sh. Why?

Unable to locate libqt4-phonon-dev

Sunday, November 7th, 2010

simon is not running on my computer any more. I don’t know the reason for this. It is impossible to start simon or sam. I am able to start ksimond. I tried the following:

sudo apt-get install git-core build-essential cmake bison flex gettext gettext-kde kdeartwork \
kdelibs5-dev libxtst-dev libqt4-sql-sqlite qtmobility-dev libqt4-phonon-dev libattica-dev libattica0 zlib1g-dev \
portaudio19-dev

Then this message appeared:

Reading package lists... Done
Building dependency tree
Reading state information... Done
E: Unable to locate package libqt4-phonon-dev

How can I install the package libqt4-phonon-dev? I am using Ubuntu 10.10. My guess is that there were some changes during the upgrade from Ubuntu 10.04 to Ubuntu 10.10.

qwt_series_data.h

Saturday, November 6th, 2010

A few minutes ago in my Ubuntu terminal:

[ 91%] Built target translations
[ 91%] Built target translations
[ 91%] Built target de-handbook
[ 91%] Built target en-handbook
[ 91%] Built target sam_automoc
Scanning dependencies of target sam
[ 91%] Building CXX object sam/src/CMakeFiles/sam.dir/samview.o
In file included from /home/ubuntu/Documents/201005/speech2text/sam/src/samview.cpp:27:
/home/ubuntu/Documents/201005/speech2text/sam/src/carraydata.h:23: fatal error: qwt_series_data.h: No such file or directory
compilation terminated.
make[2]: *** [sam/src/CMakeFiles/sam.dir/samview.o] Error 1
make[1]: *** [sam/src/CMakeFiles/sam.dir/all] Error 2
make: *** [all] Error 2
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$

./build_ubuntu.sh: Permission denied

Wednesday, November 3rd, 2010

Let’s take a look into my Ubuntu terminal:

ubuntu@ubuntu-desktop:~$ cd /home/ubuntu/Documents/201005/speech2text && git pull origin master
From git://speech2text.git.sourceforge.net/gitroot/speech2text/speech2text
* branch master -> FETCH_HEAD
Already up-to-date.
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$ ./build_ubuntu.shbash: ./build_ubuntu.sh: Permission denied
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$

Why is the “Permission denied”?

Could not find Qwt 6.x

Thursday, October 21st, 2010

This happened an hour ago in my Ubuntu terminal:

#
– Found Qt-Version 4.6.2 (using /usr/bin/qmake)
#
– Found X11: /usr/lib/libX11.so
#
CMake Error at cmake/FindQwt6.cmake:101 (MESSAGE):
#
Could not find Qwt 6.x
#
Call Stack (most recent call first):
#
sam/src/CMakeLists.txt:1 (find_package)
#

#

#
– Configuring incomplete, errors occurred!
#
make: *** [cmake_check_build_system] Error 1
#
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$

There seems to be a problem with Qwt 6.x.

simond crashed. (Status: 1)

Saturday, October 2nd, 2010

The situation is as follows: I have Ralf’s German speech model 0.1.9.1. Here is what I do:

1. Delete old simon configuration files:

rm -r /home/ubuntu/.kde/share/apps/simon
rm -r /home/ubuntu/.kde/share/apps/simond
rm /home/ubuntu/.kde/share/config/sscrc
rm /home/ubuntu/.kde/share/config/speechmodelmanagementrc
rm /home/ubuntu/.kde/share/config/simonsoundrc
rm /home/ubuntu/.kde/share/config/simonscenariosrc
rm /home/ubuntu/.kde/share/config/simonrecognitionrc
rm /home/ubuntu/.kde/share/config/simonrc
rm /home/ubuntu/.kde/share/config/simoncommandrc
rm /home/ubuntu/.kde/share/config/samrc
rm /home/ubuntu/.kde/share/config/ksimondrc

2. Start simon and ksimond. simon > Manage scenarios > Import from file: file:///home/ubuntu/Documents/201009/combine-0.2.1/ralfs-german-speech-model-0.1.9.1/scenario-de-0.1.9.1.xml

3. simon > Settings > Configure simon > Model settings > Static model:
file:///home/ubuntu/Documents/201009/combine-0.2.1/ralfs-german-speech-model-0.1.9.1/hmmdefs-de-0.1.9.1
file:///home/ubuntu/Documents/201009/combine-0.2.1/ralfs-german-speech-model-0.1.9.1/tiedlist-de-0.1.9.1
file:///home/ubuntu/Documents/201009/combine-0.2.1/ralfs-german-speech-model-0.1.9.1/macros-de-0.1.9.1
file:///home/ubuntu/Documents/201009/combine-0.2.1/ralfs-german-speech-model-0.1.9.1/stats-de-0.1.9.1
Press Apply. Press OK.

4. Press Connect. simon starts to synchronize with ksimond automatically. simon is now Activated. It is possible to dictate a few words:

eingepferchter eingepflanztem Gerundivum gehocktem Gericht Ätzbad Gerätedichtung gegärtes eingeteilter eingeteiltem eingetragene eingepflegtem Gesetzwidrigkeiten

Most words were recognized correctly. Ralf’s German speech model 0.1.9.1 consists of 31000 German words. I want to try the following:

5. Vocabulary > Active vocabulary > Clear > Do you really want to clear the whole vocabulary? > Yes > Are you absolutely sure you want to continue? > Continue

6. Import dictionary > Target: Active dictionary > Next > PLS lexicon > Next > /home/ubuntu/Documents/201008/german-0.2.1/german-0.2.3.xml > Next > Processing lexicon > Dictionary import was successful > Finish

7. simon is now automatically compiling the model. simon needs a lot of time for the creation of the dictionary file: file:///home/ubuntu/.kde/share/apps/simond/models/default/active/lexicon
The compilation is now finished. Press Activate. Something went wrong:

a. I pressed the Activate button.
b. A few moments later, the message Connection error: The remote host closed the connection appears.
c. There is something wrong with ksimond: simond crashed. (Status: 1)

8. I think that I will post this in the bug tracker. This bug appeared two times on my computer. This means that it is reproducable.

Ralf’s German speech model 0.1.8

Sunday, September 26th, 2010

You can download Ralf’s German speech model version 0.1.8. It contains 28000 German words (from sections: alpha bravo charlie diego echo).

git pull origin master

Friday, September 24th, 2010

A few minutes ago in my Ubuntu terminal:

$ cd /home/ubuntu/Documents/201005/speech2text && git pull origin master [...]
$ ./build_ubuntu.sh [...] (more...)

Ralf’s German dictionary 0.2.2

Sunday, September 19th, 2010

How I build Ralf's German dictionary version 0.2.2:

1. Add rule

<xsl:when test="contains(lower-case(../grapheme), 'system')"><xsl:value-of select="replace($sierra, 'systəm','systeːm')"/></xsl:when>

2. Improve about 35 elements:

<xsl:when test="contains(lower-case(../grapheme), 'angestellt')"><xsl:value-of select="replace($sierra, 'aŋɛstɛlt','aŋgɛstɛlt')"/></xsl:when>

3. Improve 131 elements:

<xsl:when test="contains(lower-case(../grapheme), 'brüch')"><xsl:value-of select="replace($sierra, 'bʀyç','bʀʏç')"/></xsl:when>

4. Add rule: (more…)

Ralf’s German speech model 0.1.7

Tuesday, September 14th, 2010

How I (try to) create Ralf’s German speech model version 0.1.7:

1. Get a list of file names:

ls /home/ubuntu/Documents/201009/combine-0.2.1/wav-all > /home/ubuntu/Documents/201009/combine-0.2.1/dir-wav-all

2. Compare the list of file names with Ralf’s German dictionary:

cat /home/ubuntu/Documents/201008/german-0.2.1/object/german-0.2.2.xml | \
saxonb-xslt -ext:on -s:- -xsl:/home/ubuntu/Documents/201009/combine-0.2.1/compare.xsl

3. Delete old simon configuration files:

rm -r /home/ubuntu/.kde/share/apps/simon
rm -r /home/ubuntu/.kde/share/apps/simond
rm /home/ubuntu/.kde/share/config/sscrc
rm /home/ubuntu/.kde/share/config/speechmodelmanagementrc
rm /home/ubuntu/.kde/share/config/simonsoundrc
rm /home/ubuntu/.kde/share/config/simonscenariosrc
rm /home/ubuntu/.kde/share/config/simonrecognitionrc
rm /home/ubuntu/.kde/share/config/simonrc
rm /home/ubuntu/.kde/share/config/simoncommandrc
rm /home/ubuntu/.kde/share/config/samrc
rm /home/ubuntu/.kde/share/config/ksimondrc

4. Import Ralf's German dictionary into simon. Before I do that, I go to simon > Manage scenarios > New, and create the scenario “de-20100914″. I will export this scenario, and then use it with the “serialize scenarios” function in sam.

5. simon > Add grammar: Zahlwort Adjektiv Substantiv Verb
Add command: dictation plug-in

6. I want to use sam for the creation of the speech model. More than 300000 words should be in the speech model that I want to create.

7. Manage scenarios: Export scenario “de-20100914″

8. A few steps later, the following error message (sam) appears:

Generation of the transcription failed. Please check if you have correctly specified the paths to mkphones0.led and mkphons1.led. (/usr/share/kde4/apps/simon/scripts/mkphones0.led, /usr/share/kde4/apps/simon/scripts/mkphones1.led)

And this is the corresponding build log:

Preperation
Generating wordlist…
Creating monophones…
“/usr/local/bin/HDMan” -A -D -T 1 -m -w “/tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//wlist” -g “/usr/share/kde4/apps/simon/scripts/global.ded” -n “/tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//monophones1″ -i “/tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//dict” “/tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//lexicon”
/usr/local/bin/HDMan -A -D -T 1 -m -w /tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//wlist -g /usr/share/kde4/apps/simon/scripts/global.ded -n /tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//monophones1 -i /tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//dict /tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//lexicon

No HTK Configuration Parameters Set

Output dictionary /tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//dict opened
Source dictionary /tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//lexicon opened
Dictionary /tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//dict created – 26168 words processed, 2 missing

No HTK Configuration Parameters Set

Generating Master Label File…
“/usr/local/bin/HLEd” -A -D -T 1 -l “*” -d “/tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//dict” -i “/tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//phones0.mlf” “/usr/share/kde4/apps/simon/scripts/mkphones0.led” “/tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//words.mlf”
/usr/local/bin/HLEd -A -D -T 1 -l * -d /tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//dict -i /tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//phones0.mlf /usr/share/kde4/apps/simon/scripts/mkphones0.led /tmp/kde-ubuntuJPP9Sy/sam/internalsamuser/compile//words.mlf

No HTK Configuration Parameters Set

Editing file: fiːɐ̯.lab

ERROR [+6550] LoadHTKList: Label Name Expected
FATAL ERROR – Terminating program /usr/local/bin/HLEd

I don’t know what went wrong.

9. I will release Ralf’s German speech model 0.1.7 later, not now.

Edit September 17, 2010: 10. Download Ralf's German speech model 0.1.7.

German speech model ‘bravo’

Sunday, September 12th, 2010

How I create the German speech model 'bravo':

1. Delete old files:

rm -r /home/ubuntu/.kde/share/apps/simon
rm -r /home/ubuntu/.kde/share/apps/simond

2. Start simon. Manage scenarios > New scenario.

3. Import PLS dictionary as active dictionary: /home/ubuntu/Documents/201009/combine-0.2.1/lexicon-bravo.xml

4. Define grammar: Unknown

5. Training > Import training data > Import prompts > /home/ubuntu/Documents/201009/combine-0.2.1/prompts-bravo and STOP. I have to convert the FLAC files into WAV files before I can import them into simon (I didn’t find out how post processing commands works).

6. Ubuntu terminal:

mkdir /home/ubuntu/Documents/201009/combine-0.2.1/wav-bravo
cd /home/ubuntu/Documents/201009/combine-0.2.1/bravo/ && \
for f in *.flac; do sox "$f" "/home/ubuntu/Documents/201009/combine-0.2.1/wav-bravo/${f%.flac}.wav"; done

7. Prompts base directory: /home/ubuntu/Documents/201009/combine-0.2.1/wav-bravo

8. Add dictation plug-in. Start ksimond. Press Connect button. Press Synchronize. Press Activate, the following message appears:

The recognition reported the following error:
The recognition could not be started because your model contains words that consists of sounds that are not covered by your acoustic model.

You need to either remove those words, transcribe them differently or train them.

Warning: The latter will not work if you are using static base models!

This could also be a sign of a base model that uses a different phoneme set than your scenario vocabulary.

The following words are affected (list may not be complete):
Betätigen, diejenigen

The following phonemes are affected (list may not be complete):
I-g+N=,

OK, I will move the two words Betätigen, diejenigen from the Unknown to the Unused terminal, and then compile again.

9. Press Activate. Dictate a few words into Firefox:

ausbildungsadäquate ausbildungsadäquatem ausbildungsfreundliche ausbildungsfreundlichem ausblendete ausblieb ausboote ausborgst ausbringendes ausbürstende ausdehnendem ausdehnenden ausdehnender ausdiskutiere ausdorrender ausdorrten ausdrucksfähigen

All words above were recognized correctly.

10. Download the German speech model 'bravo'.

Ralf’s Latin speech model 0.1.2

Sunday, September 12th, 2010

Download Ralf's Latin speech model version 0.1.2. It contains more than 2000 Latin words (from sections xaa and xaf) in German pronunciation.

Latin speech model ‘xaa’

Sunday, September 12th, 2010

Take a look at Ralf's Latin IPA FLAC files (section: xaa). I want to build a speech model that recognizes these words.

1. I have to prepare a prompts file:

$ cat /home/ubuntu/Documents/201008/latin-0.1.2/split/xaa-folder/lexicon-xaa.xml | saxonb-xslt -ext:on -s:- -xsl:/home/ubuntu/Documents/201008/latin-0.1.2/lexiconxaa2prompts.xsl

2. Remove old files:

$ rm -i -r /home/ubuntu/.kde/share/apps/simon
rm: descend into directory `/home/ubuntu/.kde/share/apps/simon'? y
rm: remove directory `/home/ubuntu/.kde/share/apps/simon/model'? y
rm: remove regular file `/home/ubuntu/.kde/share/apps/simon/shadowvocabulary.xml'? y
rm: remove regular file `/home/ubuntu/.kde/share/apps/simon/protocol.log'? y
rm: remove directory `/home/ubuntu/.kde/share/apps/simon'? y

3. Start simon 0.3.0. Manage scenarios > New. Name of the new scenario: latin-xaa

4. Vocabulary > Import dictionary > Target: Active dictionary > PLS lexicon > File: /home/ubuntu/Documents/201008/latin-0.1.2/split/xaa-folder/lexicon-xaa.xml

5. Grammar > Add sentence > Add structure: Unknown

6. Training > Import training data > Import Prompts > Prompts: /home/ubuntu/Documents/201008/latin-0.1.2/split/xaa-folder/prompts-xaa

It is necessary to convert the FLAC files into WAV format via Ubuntu terminal:

mkdir /home/ubuntu/Documents/201008/latin-0.1.2/split/xaa-folder/wav-xaa
cd /home/ubuntu/Documents/201008/latin-0.1.2/split/xaa-folder/flac-xaa && \
for f in *.flac; do sox "$f" "/home/ubuntu/Documents/201008/latin-0.1.2/split/xaa-folder/wav-xaa/${f%.flac}.wav"; done

Import Prompts > Base directory: /home/ubuntu/Documents/201008/latin-0.1.2/split/xaa-folder/wav-xaa > Importing 899 files

7. Add dictation plug-in.

8. Start ksimond. Connect with ksimond. Press Connect. Press Synchronize. Press Activate. The following error message appears:

Could not start recognition because the system reports that the recognition is not ready.

Please check if you have defined a vocabulary, an appropriate grammar and recorded a few trainings samples.

The system will then, upon synchronization, generate the model which will be used for the recognition.

What did I forget? I have an active vocabulary (each word has a recognition rate of 1). I have defined a grammar (Unknown). I have imported training data. And I have the dictation plugin. I check Settings > Configure simon > Model settings > User generated model. So everything should be clear.

9. I don’t know what went wrong. Any hint?

[see comments]

12. Download Latin speech model 'xaa'.

Import FLAC files

Friday, September 10th, 2010

A few minutes ago, I did git pull origin master, then I imported a folder with FLAC files into simon (simon > Training > Import training data > Import prompts). The wizard didn’t finish:

Obviously, (almost) all files were imported. But the wizard doesn’t finish. So I press cancel.

I didn’t setup postprocessing commands. The folder /home/ubuntu/.kde/share/apps/simon/model/training.data contains 3522 WAV audio files (from section alpha). Why not 3523 (see screen shot) WAV audio files? What went wrong? Why can’t I finish this wizard? I want to import FLAC files directly (without having them to convert before from FLAC to WAV format).

git pull origin master

Thursday, September 2nd, 2010

This happens in my Ubuntu terminal:

ubuntu@ubuntu-desktop:~$ cd Documents/201005/speech2text && git pull origin master
remote: Counting objects: 66, done.
remote: Compressing objects: 100% (41/41), done.
remote: Total 41 (delta 30), reused 0 (delta 0)
Unpacking objects: 100% (41/41), done.
From git://speech2text.git.sourceforge.net/gitroot/speech2text/speech2text
* branch master -> FETCH_HEAD
Updating cbc744d..4ae4550
Fast-forward
CMakeLists.txt | 6 +-
cmake/FindPortaudio.cmake | 1 +
package/KDE/CMakeLists.txt | 92 +-
package/NSIS/anpassungen.nsi | 57 +-
simon/doc/de/index.docbook | 4183 +++++++++++++-------
simon/doc/en/index.docbook | 2 +-
simon/doc/po/simonmanual.pot | 1510 ++++----
simon/doc/po/simonmanual_de.po | 1979 +++++-----
.../src/simonmodelmanagementui/ImportDict/dict.cpp | 2 +-
.../ImportTrainingData/importtrainingdata.cpp | 31 +-
.../ImportTrainingData/importtrainingdata.h | 2 +
version.h | 2 +-
12 files changed, 4580 insertions(+), 3287 deletions(-)
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$

Great:

“+ allowedFileTypes << "wav";

+ allowedFileTypes << "mp3";

+ allowedFileTypes << "ogg";

+ allowedFileTypes << "flac";"

I am planning to release ogg files (because I don’t have unlimited webspace, and because I think that ogg quality is OK for training).

Ralf’s German dictionary 0.2.1

Monday, August 30th, 2010

How I build Ralf's German dictionary version 0.2.1: (more…)

SAMPA of Ach-Laut: [X] instead of [x]

Monday, August 30th, 2010

Here is the result after importing a dictionary that uses the IPA symbol [χ] instead of the ASCII symbol [x]:

ach-lautThe SAMPA symbol is upper-case [X], and not lower-case [x]. This is OK, I think.

The next version of Ralf's German dictionary will employ the IPA symbol [χ] instead of [x].

samview.cpp

Monday, August 30th, 2010

This is what I am doing in the Ubuntu terminal:

ubuntu@ubuntu-desktop:~$ cd Documents/201005/speech2text && git pull origin master
remote: Counting objects: 15, done.
remote: Compressing objects: 100% (8/8), done.
remote: Total 8 (delta 7), reused 0 (delta 0)
Unpacking objects: 100% (8/8), done.
From git://speech2text.git.sourceforge.net/gitroot/speech2text/speech2text
* branch master -> FETCH_HEAD
Updating 5853434..4d72b2a
Fast-forward
sam/src/samview.cpp | 2 +-
simonlib/simonmodeltest/fileresultmodel.cpp | 4 ++--
2 files changed, 3 insertions(+), 3 deletions(-)
ubuntu@ubuntu-desktop:~/Documents/201005/speech2text$ ./build_ubuntu.sh

I will see later whether the sam Edit sample button has been fixed.