Archive for the ‘import’ Category

German speech model ‘debs’

Friday, March 4th, 2011

You can import the German speech model ‘debs’ into simon. The words are taken from section debs, and can be downloaded from Voxforge.

German speech model ‘debn’

Thursday, March 3rd, 2011

Import the German speech model ‘debn’ into simon. The words are taken from section debn. You can get them from Voxforge, too.

German speech model ‘debd’

Thursday, March 3rd, 2011

It is possible to import the German speech model ‘debd’ into simon. The words are from section debd, and can be downloaded from Voxforge.

German speech model ‘deat’

Thursday, March 3rd, 2011

You can import the German speech model ‘deat’ into simon. It contains the words from section ‘deat‘. The words (and the corresponding PLS dictionary) are available at Voxforge, too.

German speech model ‘xaae’

Tuesday, March 1st, 2011

You can import the German speech model ‘xaae’ into simon. You can find the corresponding audio files at Voxforge.

German speech model ‘xaaj’

Monday, February 28th, 2011

You can import the German speech model ‘xaaj’ (license: GPLv3) into simon. You can find the words that are used within this speech model at Voxforge.

German speech model ‘deaj’

Sunday, February 27th, 2011

You can import the German speech model ‘deaj’ into simon. The speech model contains words from section deaj. The words can be downloaded from Voxforge, too.

German: how to pronounce ‘Pension’

Sunday, February 27th, 2011

According to the Wiktionary, the German word ‘Pension’ can be pronounced in three different ways:

IPA: [pãˈzi̯oːn], auch [paŋ̩ˈzi̯oːn], [pɛnˈzi̯oːn] Plural: [pãˈzi̯oːnən], auch [paŋ̩ˈzi̯oːnən], [pɛnˈzi̯oːnən]

As far as I know, simon couldn’t import the IPA phoneme /ã/ correctly.

Latin speech model ‘xaz’

Friday, February 25th, 2011

You can import the Latin speech model ‘xaz’ into simon. The words are taken from section ‘xaz’. They can be found at Voxforge, too.

Latin speech model ‘xau’

Friday, February 25th, 2011

You can import the Latin speech model ‘xau’ into simon. It contains words from section ‘xau’. You can find these words at Voxforge, too.

Latin speech model ‘xap’

Friday, February 25th, 2011

You can import the Latin speech model ‘xap’ into simon. It contains words from section ‘xap’. You can get the audio files from Voxforge, too.

Latin speech model ‘xak’

Thursday, February 24th, 2011

You can import the Latin speech model ‘xak’ into simon. The words that are used in this speech model can be found in section xak.

German speech model ‘deae’

Tuesday, February 22nd, 2011

The German speech model 'deae' contains the words from Schott’s German IPA FLAC files (section: deae). You can – as always – import it into simon.

German speech model ‘alpha’

Sunday, February 20th, 2011

The German speech model 'alpha' contains words from section alpha. You can import it into simon.

German speech model ‘charlie’

Saturday, February 19th, 2011

Please, download and import the German speech model 'charlie' into simon. It contains the words from section charlie.

German speech model ‘diego’

Saturday, February 19th, 2011

You can import the German speech model 'diego' (0.6 MB) into simon. It contains the words from section diego.

German speech model ‘echo’

Saturday, February 19th, 2011

You can import the German speech model ‘echo’ (it contains the words from section echo) into simon. License is as always GPLv3.

IPA to SAMPA: voiced bilabial fricative

Saturday, April 10th, 2010

The simon PLS import process of Ralf's German dictionary is OK (not perfect, but OK).

A few minutes ago, I imported Ralf's Spanish dictionary (version 0.1; November 06, 2009) into simon. The performance of the import process was OK. I had to wait about 30 seconds until the import of this huge dictionary with about 850.000 Spanish words was finished. Maybe it is my new computer. Or maybe simon does have better routines for big dictionaries?

So at the moment, I think that it is not necessary that I reduce the size of Ralf's Spanish dictionary.

What is necessary? The simon developers should improve the IPA to SAMPA conversion. There are the IPA phonemes that should be converted into SAMPA.

Phonemes that are needed by Ralf's Spanish dictionary:

1. Voiced bilabial fricative. IPA-number 127. IPA-text: β. X-SAMPA: B.

I have to stop here. Take a look into the simon shadow dictionary:

spanish-sampa-phonemes

simon converted the β phoneme into correct SAMPA. Here is the corresponding entry from Ralf's Spanish dictionary:

<lexeme>
<grapheme>ababa</grapheme>
<phoneme>aβaβa</phoneme>
</lexeme>

But what is wrong? There are no spaces between the phonemes in aBaBa. So the β (IPA phoneme) has been converted correctly into B (SAMPA phoneme). But where are the spaces? The SAMPA transcription should look as follows: a B a B A

I don’t know why this error occurs. This is the reason why I am stopping here.

Try to train the Polish word “JEDEN”

Thursday, November 19th, 2009

I want to import a small sample dictionary into simon (Sphinx format):

polish-sphinx

The source can be found here (I don’t know how long this link will be valid). The dictionary contains 19 Polish words (US-ASCII). Here is what you have to do next:

universal

1. Select Applications > Universal Access > simon.

import-dictionary

2. Press the Wordlist button.
3. Press Import Dictionary.

shadow-dictionary

4. You can select the target: shadow dictionary or active dictionary. For this Polish example dictionary, choose active dictionary.
5. Press the Next button.

And now it is time to choose the appropriate lexicon format:

import-sphinx

Import the dictionary (with the 19 Polish words; see the screen-shot at the beginning of this post) as SPHINX lexicon.

sphinx-automatic

You have to select the path to the Polish Sphinx dictionary. After pressing the Next button, the following message appears:

finish

The Polish Sphinx dictionary has been imported successfully. Press the Finish button.

Now let’s train a Polish word:

add-polish

a. Select the Polish word JEDEN.
b. Add to Training.
c. Train selected Words.

You can now record the Polish word with simon:

train-polish

(more…)

Error when trying to import wav files

Tuesday, October 13th, 2009

I tried to import prompts 01 (40 wav files plus corresponding prompts file) into simon. I failed several times. I am getting the following message:

importing

I made several changes to the prompts file:
- converted the utterances to uppercase (<xsl:value-of select="upper-case(.)"/>);
- used a prompts file without punctuation;
- used the prompts file without “.wav” (<xsl:value-of select="replace(@src, '.flac','')"/>).

I tried to filter out all possible error sources. Did I forget something important?