<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>testing simon &#187; Ubuntu</title>
	<atom:link href="http://spirit.blau.in/simon/category/ubuntu/feed/" rel="self" type="application/rss+xml" />
	<link>http://spirit.blau.in/simon</link>
	<description>my first steps with the simon speech recognition software</description>
	<lastBuildDate>Tue, 10 Jan 2012 14:59:26 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Ralf&#8217;s Interlingua dictionary</title>
		<link>http://spirit.blau.in/simon/2012/01/10/ralfs-interlingua-dictionary/</link>
		<comments>http://spirit.blau.in/simon/2012/01/10/ralfs-interlingua-dictionary/#comments</comments>
		<pubDate>Tue, 10 Jan 2012 14:59:26 +0000</pubDate>
		<dc:creator>producer</dc:creator>
				<category><![CDATA[Ubuntu]]></category>
		<category><![CDATA[dictionary]]></category>

		<guid isPermaLink="false">http://spirit.blau.in/simon/?p=5743</guid>
		<description><![CDATA[This article explains how I create the dictionary, and how the imported result looks like in simon. A. Creation of the PLS dictionary: 1. Get spelling dictionary. 2. License is GPL. It says in the file README_en.txt: This spell check dictionary for Interlingua is licensed under GPL. [...] This hyphenation rules for Interlingua are licensed [...]]]></description>
			<content:encoded><![CDATA[<p>This article explains how I create the dictionary, and how the imported result looks like in simon.</p>
<p><strong>A. Creation of the PLS dictionary:</strong></p>
<p>1. <a href="http://extensions.services.openoffice.org/en/project/dict-ia">Get</a> spelling dictionary.<br />
2. License is <a href="http://www.gnu.org/licenses/gpl.txt">GPL</a>. It says in the file README_en.txt:</p>
<blockquote><p>This spell check dictionary for Interlingua is licensed under GPL. [...] This hyphenation rules for Interlingua are licensed under GPL.</p></blockquote>
<p>This means that I can use this spelling dictionary as source.<br />
3. Extract <a href="http://extensions.services.openoffice.org/en/download/4581">dict-ia-2010-11-29.oxt</a>.<br />
4. ISO 639-1 <a href="http://en.wikipedia.org/wiki/Interlingua">language</a> code is <code>ia</code>.<br />
5. Probably I will <a href="http://en.wikipedia.org/wiki/Interlingua#Interlingua_alphabet">use this table</a> for grapheme to phoneme conversion.</p>
<p>6. Check the encoding of ia_iso.aff and ia_iso.dic. Both files are encoded in ISO 8859-1. Probably it is best if I convert the encoding of both files into UTF-8.<br />
<code>iconv -f ISO-8859-1 -t UTF-8 < ia_iso.dic > interlingua-utf8.dic<br />
iconv -f ISO-8859-1 -t UTF-8 < ia_iso.aff > interlingua-utf8.aff</code><br />
Change the first line in interlingua-utf8.aff into SET UTF-8. Both files contain CRLF at the end of each line (Windows mode). I don&#8217;t know whether this is ok with the unmunch command. I will check it out:</p>
<blockquote><p><code>ubuntu@ubuntu:~/Documents/2011-II/Interlingua$ unmunch interlingua-utf8.dic interlingua-utf8.aff > interlingua-wordlist</code></p></blockquote>
<p>Obviously, it worked. The CRLF is part of the source files. The target file contains just a LF (Unix mode). There are a lot of duplicate entries. I think that these duplicate entries will be removed later by an <code>.xsl</code> script.</p>
<p>7. Add lexicon tags at the beginning and the end of interlingua-wordlist.</p>
<p>8. Create XML file:</p>
<blockquote><p><code>ubuntu@ubuntu:~/Documents/2011-II/Interlingua$ saxonb-xslt -s:interlingua-wordlist -xsl:'http://spirit.blau.in/simon/files/2010/04/create-xml-file.xsl' -o:interlingua.xml</code></p></blockquote>
<p>9. Create PLS dictionary:</p>
<blockquote><p><code>ubuntu@ubuntu:~/Documents/2011-II/Interlingua$ saxonb-xslt -s:interlingua.xml -xsl:'<a href='http://spirit.blau.in/simon/files/2012/01/improve-interlingua.xsl_.zip'>improve-interlingua.xsl</a>' -o:<a href="http://script.blau.in/interlingua-dictionary.xml.bz2">interlingua-dictionary.xml</a></code></p></blockquote>
<p><strong>B. <a href="http://script.blau.in/interlingua-dictionary.xml.bz2">Download</a> the dictionary. Import it into simon.</strong></p>
<p><a href="http://spirit.blau.in/simon/files/2012/01/interlingua.jpg"><img src="http://spirit.blau.in/simon/files/2012/01/interlingua-293x300.jpg" alt="" title="interlingua" width="293" height="300" class="alignleft size-medium wp-image-5746" /></a>The left column contains the words. The pronunciation column contains the corresponding SAMPA transcriptions. The Category column contains just &#8220;Unknown&#8221; entries.
<div style="clear:both"></div>
<p>Now you know how I created the dictionary and how the result looks like in simon.</p>
]]></content:encoded>
			<wfw:commentRss>http://spirit.blau.in/simon/2012/01/10/ralfs-interlingua-dictionary/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ralf&#8217;s Arabic dictionary</title>
		<link>http://spirit.blau.in/simon/2012/01/10/ralfs-arabic-dictionary/</link>
		<comments>http://spirit.blau.in/simon/2012/01/10/ralfs-arabic-dictionary/#comments</comments>
		<pubDate>Tue, 10 Jan 2012 11:55:50 +0000</pubDate>
		<dc:creator>producer</dc:creator>
				<category><![CDATA[Ubuntu]]></category>
		<category><![CDATA[dictionary]]></category>
		<category><![CDATA[Arabic]]></category>
		<category><![CDATA[PLS]]></category>
		<category><![CDATA[saxonb-xslt]]></category>
		<category><![CDATA[sed]]></category>
		<category><![CDATA[unmunch]]></category>

		<guid isPermaLink="false">http://spirit.blau.in/simon/?p=5726</guid>
		<description><![CDATA[This article explains the creation of an Arabic PLS dictionary and how the result looks like in simon. A. Creation of the dictionary: 1. Get Arabic spelling dictionary. 2. Check the license. Inside the file dict_ar-3.0.oxt there is a file with the name COPYING (in the docs folder). It says in the file: GPL 2.0/LGPL [...]]]></description>
			<content:encoded><![CDATA[<p>This article explains the creation of an Arabic PLS dictionary and how the result looks like in simon.</p>
<p><strong>A. Creation of the dictionary:</strong></p>
<p>1. <a href="http://extensions.services.openoffice.org/en/project/Arabicspellchecker">Get</a> Arabic spelling dictionary.<br />
2. Check the license. Inside the file <a href="http://extensions.services.openoffice.org/en/download/4955">dict_ar-3.0.oxt</a> there is a file with the name COPYING (in the docs folder). It says in the file:</p>
<blockquote><p>GPL 2.0/LGPL 2.1/MPL 1.1 tri-license</p></blockquote>
<p>This means that I can use this tri-licensed spelling dictionary as source for my future GPLv3 PLS dictionary.</p>
<p>3. Now I have to extract <code>dict_ar-3.0.oxt</code>.<br />
4. Let&#8217;s try the <code>unmunch</code> command inside the Ubuntu terminal:</p>
<blockquote><p><code>ubuntu@ubuntu:~/Documents/2011-II/Arabic$ unmunch ar.dic ar.aff > arabic</code></p></blockquote>
<p>It failed. I wasn&#8217;t able to unmunch the word list.<br />
5. I have to remove all numbers from ar.dic. This can be done with the <code>sed</code> command:</p>
<blockquote><p><code>sed 's/[0-9]*//g' ar.dic > arabic-without-numbers</code></p></blockquote>
<p>6. Remove the slash (&#8220;/&#8221;) from arabic-without-numbers with <a href="http://en.wikipedia.org/wiki/Geany">Geany</a>.<br />
7. Add lexicon tags at the beginning and the end of the file.<br />
8. Ubuntu terminal:</p>
<blockquote><p><code>ubuntu@ubuntu:~/Documents/2011-II/Arabic$ saxonb-xslt -s:arabic-without-numbers -xsl:'http://spirit.blau.in/simon/files/2010/04/create-xml-file.xsl' -o:arabic.xml</code></p></blockquote>
<p>9. ISO 639-1 language code is ar.<br />
10. Maybe I will <a href="http://en.wikipedia.org/wiki/Romanization_of_Arabic#Comparison_table">use this table</a> for the grapheme to phoneme conversion.<br />
11. Ubuntu terminal:</p>
<blockquote><p><code>ubuntu@ubuntu:~/Documents/2011-II/Arabic$ saxonb-xslt -s:arabic.xml -xsl:'<a href='http://spirit.blau.in/simon/files/2012/01/improve-arabic.xsl_.zip'>improve-arabic.xsl</a>' -o:<a href="http://script.blau.in/arabic-dictionary.xml.bz2">arabic-dictionary.xml</a></code></p></blockquote>
<p>I have to remove the number sign (&#8220;#&#8221;) with Geany from arabic.xml.</p>
<p><strong>B. <a href="http://script.blau.in/arabic-dictionary.xml.bz2">Download</a> the dictionary. Import it into <a href="http://simon-listens.blogspot.com/">simon</a>.</strong></p>
<p><a href="http://spirit.blau.in/simon/files/2012/01/arabic-pronunciation.jpg"><img src="http://spirit.blau.in/simon/files/2012/01/arabic-pronunciation-271x300.jpg" alt="" title="arabic-pronunciation" width="271" height="300" class="alignleft size-medium wp-image-5736" /></a>The left column contains 457089 Arabic words. The pronunciation column contains the corresponding SAMPA transcriptions. The third column contains just entries with &#8220;Unknown&#8221;. This is because the PLS dictionary contains no <code>role</code> attributes.
<div style="clear:both"></div>
<p>Now you know how I created the dictionary. And you know how the result looks like in simon.</p>
]]></content:encoded>
			<wfw:commentRss>http://spirit.blau.in/simon/2012/01/10/ralfs-arabic-dictionary/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ralf&#8217;s Hebrew dictionary</title>
		<link>http://spirit.blau.in/simon/2012/01/10/ralfs-hebrew-dictionary/</link>
		<comments>http://spirit.blau.in/simon/2012/01/10/ralfs-hebrew-dictionary/#comments</comments>
		<pubDate>Tue, 10 Jan 2012 09:43:30 +0000</pubDate>
		<dc:creator>producer</dc:creator>
				<category><![CDATA[Ubuntu]]></category>
		<category><![CDATA[dictionary]]></category>
		<category><![CDATA[Hebrew]]></category>
		<category><![CDATA[PLS]]></category>
		<category><![CDATA[saxonb-xslt]]></category>
		<category><![CDATA[unmunch]]></category>

		<guid isPermaLink="false">http://spirit.blau.in/simon/?p=5718</guid>
		<description><![CDATA[In 2009, I made some initial tests with Hebrew. Now it is time to develop a Hebrew PLS dictionary that is much bigger than the sample dictionary from 2009 (which I have deleted). This article explains how I create the dictionary, and how the result looks like when imported into simon. A. Creation of the [...]]]></description>
			<content:encoded><![CDATA[<p>In 2009, I made some <a href="http://spirit.blau.in/simon/2009/09/10/confidence-score-with-hebrew/">initial tests with Hebrew</a>. Now it is time to develop a Hebrew PLS dictionary that is much bigger than the sample dictionary from 2009 (which I have deleted). This article explains how I create the dictionary, and how the result looks like when imported into simon.</p>
<p><strong>A. Creation of the dictionary:</strong></p>
<p>1. <a href="http://extensions.services.openoffice.org/en/project/dict-he">Get</a> Hebrew spelling dictionary from OpenOffice.org.<br />
2. License is <a href="http://www.gnu.org/licenses/gpl-2.0.html">GPL</a>. There is a copyright notice inside the file <code>he_IL.aff</code>.</p>
<p>3. I tried to unmunch the dictionary in the Ubuntu terminal, but unfortunately I failed:</p>
<blockquote><p><code>ubuntu@ubuntu:~/Documents/2011-II/Hebrew$ unmunch he_IL.dic he_IL.aff > hebrew-test</code></p></blockquote>
<p>4. The source file <code>he_IL.dic</code> contains a lot of numbers. I remove them with the Ubuntu terminal:</p>
<blockquote><p><code>ubuntu@ubuntu:~/Documents/2011-II/Hebrew$ <a href="http://www.cyberciti.biz/faq/sed-remove-all-digits-input-from-input/">sed</a> 's/[0-9]*//g' he_IL.dic > hebrew-without-numbers</code></p></blockquote>
<p>With Geany, I remove the &#8220;,&#8221; (commas) and the &#8220;/&#8221; (slashes) that still are included within in the file hebrew-without-numbers. Now I have a clean word list with 43.000 Hebrew words.</p>
<p>5. Add lexicon tags at the beginning and the end of hebrew-without-numbers.<br />
6. Ubuntu terminal:</p>
<blockquote><p><code>ubuntu@ubuntu:~/Documents/2011-II/Hebrew$ saxonb-xslt -s:hebrew-without-numbers -xsl:'http://spirit.blau.in/simon/files/2010/04/create-xml-file.xsl' -o:hebrew.xml</code></p></blockquote>
<p>7. ISO 639-1 <a href="http://en.wikipedia.org/wiki/Hebrew_language">language</a> code is <code>he</code>.<br />
8. I need a table for grapheme to phoneme conversion. Maybe I will <a href="http://en.wikipedia.org/wiki/Hebrew_alphabet#Pronunciation">use this table</a>. There are several tables available at Wikipedia. I am not sure which one I should use. I have an idea: as far as I know, Yiddish and Hebrew <a href="http://spirit.blau.in/simon/2012/01/03/ralfs-yiddish-dictionary/">share the same alphabet</a>. This means I could try to use the Yiddish <a href="http://spirit.blau.in/simon/files/2012/01/improve-yiddish.xsl_.zip">improve-yiddish.xsl</a> style sheet:</p>
<blockquote><p><code>ubuntu@ubuntu:~/Documents/2011-II/Hebrew$ saxonb-xslt -s:hebrew.xml -xsl:'/home/ubuntu/Documents/2011-II/Yiddish/dictionaries/improve-yiddish.xsl' -o:hebrew-dictionary.xml</code></p></blockquote>
<p>The result is that most Hebrew letters have been converted into IPA. There is only one Hebrew letter that hasn&#8217;t been converted: [א] I will add this phone to the <code>.xsl</code> style sheet with the name <code>improve-hebrew.xsl</code>. Now I try it again:</p>
<blockquote><p><code>ubuntu@ubuntu:~/Documents/2011-II/Hebrew$ saxonb-xslt -s:hebrew.xml -xsl:'<a href='http://spirit.blau.in/simon/files/2012/01/improve-hebrew.xsl_.zip'>improve-hebrew.xsl</a>' -o:<a href="http://script.blau.in/hebrew-dictionary.xml.bz2" title="IPA phonetic dictionary, first draft">hebrew-dictionary.xml</a></code></p></blockquote>
<p>The result is not so good: Maybe I should adjust the grapheme to phoneme conversion rules for modern standard Israeli Hebrew. Or is this not necessary? I think for a first draft I can use the Yiddish transformation rules.</p>
<p><strong>B. <a href="http://script.blau.in/hebrew-dictionary.xml.bz2">Download</a> the dictionary. Import it into <a href="http://simon-listens.org/index.php?id=122&#038;L=1">simon</a> as shadow dictionary.</strong></p>
<p><a href="http://spirit.blau.in/simon/files/2012/01/hebrew-SAMPA.jpg"><img src="http://spirit.blau.in/simon/files/2012/01/hebrew-SAMPA-255x300.jpg" alt="" title="hebrew-SAMPA" width="255" height="300" class="alignleft size-medium wp-image-5723" /></a>Take a look at the result: The left column contains 43933 Hebrew words. The pronunciation column contains the corresponding SAMPA transcriptions. The category column is unemployed (or to be more exact: displays just <code>Unknown</code>) since the source PLS dictionary contains no <code>role</code> attributes.</p>
<div style="clear:both"></div>
<p>Now you know how I created the dictionary. And you know how the result looks like in simon. This dictionary uses more or less Yiddish pronunciation because I was too lazy to adjust it to modern standard Israeli Hebrew. It shouldn&#8217;t be a problem to adjust the style sheet <code>improve-hebrew.xsl</code> so that the phoneme results are better.</p>
]]></content:encoded>
			<wfw:commentRss>http://spirit.blau.in/simon/2012/01/10/ralfs-hebrew-dictionary/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ralf&#8217;s Yiddish dictionary</title>
		<link>http://spirit.blau.in/simon/2012/01/03/ralfs-yiddish-dictionary/</link>
		<comments>http://spirit.blau.in/simon/2012/01/03/ralfs-yiddish-dictionary/#comments</comments>
		<pubDate>Tue, 03 Jan 2012 17:05:18 +0000</pubDate>
		<dc:creator>producer</dc:creator>
				<category><![CDATA[Ubuntu]]></category>
		<category><![CDATA[dictionary]]></category>
		<category><![CDATA[PLS]]></category>
		<category><![CDATA[saxonb-xslt]]></category>
		<category><![CDATA[unmunch]]></category>
		<category><![CDATA[yiddish]]></category>

		<guid isPermaLink="false">http://spirit.blau.in/simon/?p=5663</guid>
		<description><![CDATA[This article explains some details about the creation of the dictionary, and how the result looks like in simon. A. How I create Ralf's Yiddish dictionary: 1. Get spelling dictionary. 2. License is GPLv3. 3. Extract jidysz.net.ooo.spellchecker.oxt. 4. Ubuntu terminal: cd /home/ubuntu/Documents/2011-II/Yiddish/dictionaries sudo apt-get install hunspell-tools unmunch yi.dic yi.aff &#62; yiddish-wordlist 5. Add &#60;lexicon&#62; at [...]]]></description>
			<content:encoded><![CDATA[<p>This article explains some details about the creation of the dictionary, and how the result looks like in simon.</p>
<p>A. How I create <code>Ralf's Yiddish dictionary</code>:</p>
<p>1. <a href="http://extensions.services.openoffice.org/en/project/jidysz-net-ooo-spellchecker">Get</a> spelling dictionary.<br />
2. License is <code>GPLv3</code>.<br />
3. Extract <a href="http://extensions.services.openoffice.org/en/download/4324"><code>jidysz.net.ooo.spellchecker.oxt</code></a>.<br />
4. Ubuntu terminal:<br />
<code>cd /home/ubuntu/Documents/2011-II/Yiddish/dictionaries<br />
sudo apt-get install hunspell-tools<br />
unmunch yi.dic yi.aff &gt; yiddish-wordlist</code><br />
5. Add  <code>&lt;lexicon&gt;</code> at the beginning of yiddish-wordlist. Add <code>&lt;/lexicon&gt;</code> at the end of this file.<br />
6. Generate <code>.xml</code> document with lexicon, lexeme and grapheme elements:</p>
<blockquote><p><code>ubuntu@ubuntu:~/Documents/2011-II/Yiddish/dictionaries$ saxonb-xslt -s:yiddish-wordlist -xsl:'http://spirit.blau.in/simon/files/2010/04/create-xml-file.xsl' -o:yiddish.xml</code></p></blockquote>
<p>7. ISO 639-1 <a href="http://en.wikipedia.org/wiki/Yiddish_language">language code</a> is yi.<br />
8. I think I will use <a href="http://en.wikipedia.org/wiki/Yiddish_orthography#The_Yiddish_alphabet">this table</a> as source for the grapheme to phoneme mapping.<br />
9. Ubuntu terminal:</p>
<blockquote><p><code>ubuntu@ubuntu:~/Documents/2011-II/Yiddish/dictionaries$ saxonb-xslt -s:yiddish.xml -xsl:'<a href='http://spirit.blau.in/simon/files/2012/01/improve-yiddish.xsl_.zip'>improve-yiddish.xsl</a>' -o:yiddish-dictionary.xml</code></p></blockquote>
<p>B. <a href="http://script.blau.in/yiddish-dictionary.xml.bz2">Download the dictionary</a>, and <a href="http://spirit.blau.in/simon/2010/06/17/tutorial-import-german-dictionary/">import</a> it into simon.</p>
<p><a href="http://spirit.blau.in/simon/files/2012/01/yiddish.jpg"><img src="http://spirit.blau.in/simon/files/2012/01/yiddish-243x300.jpg" alt="" title="yiddish" width="243" height="300" class="alignleft size-medium wp-image-5670" /></a>Take a look at the result. The left column contains the Yiddish words. This dictionary contains 99980 words. The right column contains the corresponding SAMPA transcription.<br />
<a href="http://en.wikipedia.org/wiki/Yiddish_language">Yiddish</a> is written in the Hebrew alphabet. The <a href="http://en.wikipedia.org/wiki/Hebrew_alphabet">Hebrew alphabet</a> is written from right to left. Obviously, the corresponding SAMPA transcriptions are written from left to right. This means that the phoneme order should be fine.</p>
<div style="clear:both"></div>
<p>There are a lot of other PLS dictionaries available. <a href="http://spirit.blau.in/simon/import-pls-dictionary/">Find the PLS dictionary</a> that suits your language.</p>
]]></content:encoded>
			<wfw:commentRss>http://spirit.blau.in/simon/2012/01/03/ralfs-yiddish-dictionary/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Import of your Grammar</title>
		<link>http://spirit.blau.in/simon/2012/01/02/import-of-your-grammar/</link>
		<comments>http://spirit.blau.in/simon/2012/01/02/import-of-your-grammar/#comments</comments>
		<pubDate>Sun, 01 Jan 2012 22:14:58 +0000</pubDate>
		<dc:creator>producer</dc:creator>
				<category><![CDATA[Ubuntu]]></category>
		<category><![CDATA[German]]></category>
		<category><![CDATA[grammar]]></category>

		<guid isPermaLink="false">http://spirit.blau.in/simon/?p=5642</guid>
		<description><![CDATA[In this post I want to write some words about the Grammar / Import function of simon 0.3. Here is what I do: 1. Import Schott&#8217;s German dictionary as active dictionary into simon. 2. Open the Grammar tab. Press the Import button. 3. Simon starts a wizard. Press the Next button. 4. Let&#8217;s try and [...]]]></description>
			<content:encoded><![CDATA[<p>In this post I want to write some words about the Grammar /  Import function of simon 0.3. Here is what I do:</p>
<p>1. Import <a href="http://spirit.blau.in/simon/2011/11/01/schott%e2%80%99s-german-dictionary-0-2-8/">Schott&#8217;s German dictionary</a> as active dictionary into simon.</p>
<p><a href="http://spirit.blau.in/simon/files/2012/01/grammar-01.jpg"><img class="alignleft size-medium wp-image-5643" title="grammar-01" src="http://spirit.blau.in/simon/files/2012/01/grammar-01-300x287.jpg" alt="" width="300" height="287" /></a>2. Open the Grammar tab. Press the <code>Import</code> button.</p>
<div style="clear:both"></div>
<p><a href="http://spirit.blau.in/simon/files/2012/01/grammar-02.jpg"><img class="alignleft size-medium wp-image-5645" title="grammar-02" src="http://spirit.blau.in/simon/files/2012/01/grammar-02-300x202.jpg" alt="" width="300" height="202" /></a>3. Simon starts a wizard. Press the <code>Next</code> button.</p>
<div style="clear:both"></div>
<p><a href="http://spirit.blau.in/simon/files/2012/01/grammar-03.jpg"><img src="http://spirit.blau.in/simon/files/2012/01/grammar-03-300x202.jpg" alt="" title="grammar-03" width="300" height="202" class="alignleft size-medium wp-image-5647" /></a>4. Let&#8217;s try and check the option <code>Also import unknown sentences</code>. I don&#8217;t know whether this is a good decision. So let&#8217;s give it a try.<br />
This is interesting: &#8220;words with more than one terminal&#8221; &#8211; <strong>is it now possible to use more than one entry for the <code>role</code> attribute?</strong> The current version of Schott&#8217;s German dictionary employs just one entry for each <code>role</code> attribute. The PLS standard allows <a href="http://www.w3.org/TR/pronunciation-lexicon/#S4.4">more entries</a>.<br />
Please, download and extract <a href="http://script.blau.in/etc/15000-german-utterances.zip">Schott&#8217;s German utterances</a>. This compressed folder <code>15000-german-utterances.zip</code> contains a plain text file with more than 15000 utterances. I am the author of these utterances, and I have licensed them under the <a href="http://script.blau.in/etc/GPL_License">GPLv3</a>. The utterances are designed to be used in conjunction with Schott&#8217;s German dictionary (former name: Ralf&#8217;s German dictionary). Every word that is included within Schott&#8217;s German utterances should be included in Schott&#8217;s German dictionary, too. I am 99% sure that this is the case, but I can&#8217;t guarantee it. If some words are missing in Schott&#8217;s German dictionary, please inform me, and I will include them within the next version of Schott&#8217;s German dictionary.<br />
You can import my German utterances using simon&#8217;s <code>Import Text</code> option (copy &#038; paste).
<div style="clear:both"></div>
<p>5. The import has been completed. There are a lot of lines that contain the <code>Unknown</code> terminal. Probably it would have been better if I wouldn&#8217;t have checked the option <code>Also import unknown sentences</code> in step 4.</p>
<p><a href="http://spirit.blau.in/simon/files/2012/01/system-monitor-zombie.jpg"><img src="http://spirit.blau.in/simon/files/2012/01/system-monitor-zombie-300x248.jpg" alt="" title="system-monitor-zombie" width="300" height="248" class="alignleft size-medium wp-image-5654" /></a>6. Because simon didn&#8217;t react any more, I forced it to quit. I tried to start simon several times, but it wouldn&#8217;t start. Now there are several simon zombie statuses displayed. I was able to end these zombie / sleeping processes. But at the moment, it seems to be impossible to start simon again (a new zombie / sleeping status is beeing created if I try to start simon again).
<div style="clear:both"></div>
<p><strong>Conclusion:</strong> I don&#8217;t recommend to check the option <code>Also import unknown sentences</code>. I tried the import Grammar function before without checking this option. Simon reacted normal, everything seemed to be fine.</p>
]]></content:encoded>
			<wfw:commentRss>http://spirit.blau.in/simon/2012/01/02/import-of-your-grammar/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>German speech model &#8216;deef&#8217;</title>
		<link>http://spirit.blau.in/simon/2011/08/01/german-speech-model-deef/</link>
		<comments>http://spirit.blau.in/simon/2011/08/01/german-speech-model-deef/#comments</comments>
		<pubDate>Mon, 01 Aug 2011 19:36:17 +0000</pubDate>
		<dc:creator>producer</dc:creator>
				<category><![CDATA[Ubuntu]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://spirit.blau.in/simon/?p=5616</guid>
		<description><![CDATA[Visit Schott&#8217;s German IPA FLAC files (section: deef) [source 1] or Voxforge [source 2]. Get the corresponding speech model [object], and import it into simon 0.3. Watch my video about this speech model: These are the words that were recognized in the video: Randschicht Randproblem Randprobleme Randproblemen Randproblems Randpunkt Randsportart Randstellung Randträger Rechenvorrichtung Randwerbung Randwinkel [...]]]></description>
			<content:encoded><![CDATA[<p>Visit <a href="http://script.blau.in/german/deef.xml">Schott&#8217;s German IPA FLAC files (section: deef)</a> [source 1] or <a href="http://voxforge.org/home/downloads/speech/german-speech-files/german-ipa-flac-files-section-deef#fzA1VS6rGxKikVGyA9lUyg">Voxforge</a> [source 2]. Get the corresponding <a href="http://ubuntuone.com/p/17gg/">speech model</a> [object], and import it into simon 0.3. Watch my video about this speech model:</p>
<p><a href="http://www.youtube.com/watch?v=T3THE8mDZCg"><img src="http://img.youtube.com/vi/T3THE8mDZCg/2.jpg"></a></p>
<p><a href="http://www.youtube.com/watch?v=T3THE8mDZCg">Click here</a> to view the video on YouTube.</p>

<p>These are the words that were recognized in the video:</p>
<blockquote><p>Randschicht Randproblem Randprobleme Randproblemen Randproblems Randpunkt Randsportart Randstellung Randträger Rechenvorrichtung Randwerbung Randwinkel Randträger Randzone Rangabzeichen Rangabzeichens Rangelei Rangfolgen Rangliste Ranglisten Rangordnung Rangordnungen Ranguns Rangstufe Rangstufen Rappe Ratte Rapport Rapporte Rapporten Rapports Rapsfeld Rapsfelder Rapsfeldern Rapsfelds Rasenfläche Rassenkämpfen Rasenloch Rasenplätze Rasenplatz Rasenplätze Rasenplätzen Rasensport Rasenstück Rasentraktor Rasentrimmer Raserei Rasereien Rasierapparat Rasierapparaten Rasierapparats Rasierer Rasierklinge Rasierpinsel Rasierschaum Rasierseife Rasierwasser Rasierzeug Rasmussen Raspe Raspeln Rassenhass Rassenhasses Rassenkampf Rassenkonflikt Rassenkrawall Rassenkunde Rassenkämpfe Rassenkämpfen Rassenmischung Rassenmischungen Rassenproblem Rassenprobleme Rassentrennung Rassenunruhen Rassepferd Rastalocken Rastblech Rastdorn Rastdorne Rasterbild Rasterdecke Rasterdruck Rasterelektronenmikroskop Rasterfahndung Rasterfahndungen Rastergestaltung Rastermaße Rastermaßen Rastermaßes Rastern Rasterpapier Rasterpunkt Rasterpunktabfühlung Rasterpunktlesen Rasters Rasterung Rasterweite Rasterweiten Rastkappe Rastkontakt Rastlappen Rastlosigkeit Rastmechanismus Rastmoment Rastmontage Rastplatz Rastplätze Rastplätzen Rastpunkt Raststätten Rasur Ratifikationen Ratifikationsurkunde Ratifikationsurkunden Ratifizierens Ratifizierung Ratifizierungen Ratifizierungsdebatte Rechnerbaugruppe Ratingen Ration Rationalisieren Rationalisierens Rationalisierung Rationalisierungen Rationalisierungsdruck Rationalismus Rationalist Rationalisten Rationalität Rationen Rationierens Rationierung Rationierungen Ratlosigkeit Rasanz Ratsamkeit Ratsbeschluss Ratsbeschlusses Ratschlag Ratschlags Ratspräsidenten Ratspräsident Ratsmitgliedern Ratsmitglieder Ratsmitglied Ratsherren Ratsche Ratssitzung Ratssitzungen Ratstisch Ratsvorsitz Ratsvorsitzende Randverbinder Rattenfleckfieber Rattenfänger Rattengift Rattenhaus Rattenkönig Rattenloch Rattenloches Rattenplage Rattenschwanz Rattermarken Ratzinger Rauchkammer Rauch Rauchabzug Rauchbombe Rauchens Rauchen Raucher Raucherinsel Rauchern Rauchgas Raucherzimmer Raucherzone Rauchers Rauchfahne Rauchfang Rauchfass Rauchfleisch Rauchgas Rauchgasfilter Rauchgasfühler Rauchgaskanal Rauchgaswäsche Rauchgaszug Rauchgenuss Rauchgenusses Rauchglas Rauchigkeit Rauchkammer Rauchmaschine Rauchmelder Rauchpilz Rauchschwaden Rauchsäule Rauchsäulen Rauchverbot Rauchvergiftung Rauchverzehrer Rauchverzicht Rauchvorhang Rauchwand braven Raudi Rauchpilz Raufbold Rauferei Raufhandel Rauflustigkeit Raufrost Rauheiten Rauheit Rauheitsbeiwert Rauheitsbestimmung Rauhut Raum Raumabtrennung Raumakustik Raumangst Raumanzug Raumaufteilung Raumaufteilungen Raumausstatter Raumbedarfs Raumbelastung Ratsmitgliedern Raumbereich Raumbereichen Raumbereichs Raumbuch Raumfahrtagentur Raumfahrtbehörde Raumtransport Raumtransport Raumfahrtkonzern Raumfahrtkonzerns Raumfahrtprogramm Raumfahrtsparte Raumfahrttechnik Raumfahrzeuge Raumfahrzeuge Raumflug Raumflugs Raumforscher Raumforschung Raumfähre Raumgeräuschpegel Raumgeschwindigkeit Raumgestaltung Raumgewicht Raumgruppe Raumhöhe Rauminhalt Raumkapsel Raumkelle Raumklima Raumladung Raumlufttechnik Raummangel Raummaß Raumordnung Raumordnungsverfahrens Raumpfleger Raumpflegerin Raumplanung Raumprogramm Raumrichtung Raumschiff Raumschiffen Raumschiffes Raumthermostat Raumverhältnis Raumverhältnisse Raumverhältnissen Raumverhältnisses Raumwinkel Raumzeitalter Raute Rauchwaren Raupenantrieb Raupenantrieben Raupenbagger Rauschebart Rauschens Rauschgift Rauschgiftdezernat Rauschgifte Rauschgiftsucht Rauschgiftverbot Rauschgold Rauschgoldengel Rauschmittel Rauschpegel Rauschpegels Rauschunterdrückung Rauschuntergrund Rauschuntergrunds Rausschmeißer Rauschgifts Rausschmisse Raupentrieb Rauchzeichen Randwähler Ravennas Reagenzglases Reagenzgläser Reagenzpapier Reaktionsmöglichkeiten </p></blockquote>
<p>A lot of words were recognized correctly. That is not a bad result.</p>
]]></content:encoded>
			<wfw:commentRss>http://spirit.blau.in/simon/2011/08/01/german-speech-model-deef/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>sudo make uninstall</title>
		<link>http://spirit.blau.in/simon/2011/08/01/sudo-make-uninstall/</link>
		<comments>http://spirit.blau.in/simon/2011/08/01/sudo-make-uninstall/#comments</comments>
		<pubDate>Mon, 01 Aug 2011 15:43:38 +0000</pubDate>
		<dc:creator>producer</dc:creator>
				<category><![CDATA[Ubuntu]]></category>
		<category><![CDATA[Natty Narwhal]]></category>

		<guid isPermaLink="false">http://spirit.blau.in/simon/?p=5613</guid>
		<description><![CDATA[A few minutes ago in my Ubuntu terminal: ubuntu@ubuntu:~/Documents/2011-II/speech2text/build$ sudo make uninstall Then download of simon_0.3.0-1ubuntu8_amd64.deb. Then I installed this version with Ubuntu Software Center.]]></description>
			<content:encoded><![CDATA[<p>A few minutes ago in my Ubuntu terminal:</p>
<blockquote><p>ubuntu@ubuntu:~/Documents/2011-II/speech2text/build$ sudo make uninstall</p></blockquote>
<p>Then download of <a href="http://sourceforge.net/projects/speech2text/files/simon/0.3.0/Ubuntu/10.10/">simon_0.3.0-1ubuntu8_amd64.deb</a>. Then I installed this version with Ubuntu Software Center.</p>
]]></content:encoded>
			<wfw:commentRss>http://spirit.blau.in/simon/2011/08/01/sudo-make-uninstall/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>There is no PPA for Ubuntu 11.04</title>
		<link>http://spirit.blau.in/simon/2011/08/01/there-is-no-ppa-for-ubuntu-11-04/</link>
		<comments>http://spirit.blau.in/simon/2011/08/01/there-is-no-ppa-for-ubuntu-11-04/#comments</comments>
		<pubDate>Mon, 01 Aug 2011 09:22:15 +0000</pubDate>
		<dc:creator>producer</dc:creator>
				<category><![CDATA[Ubuntu]]></category>
		<category><![CDATA[Natty Narwhal]]></category>

		<guid isPermaLink="false">http://spirit.blau.in/simon/?p=5608</guid>
		<description><![CDATA[There is no PPA for Ubuntu 11.04. How is it possible to install an old PPA (Maverick)? What is the command that I have to enter to install an old PPA?]]></description>
			<content:encoded><![CDATA[<p>There is <a href="http://sourceforge.net/tracker/?func=detail&#038;aid=3316692&#038;group_id=190872&#038;atid=935103">no PPA for Ubuntu 11.04</a>. How is it possible to install an old PPA (<a href="http://ppa.launchpad.net/grasch-simon-listens/simon/ubuntu/dists/maverick/">Maverick</a>)? What is the command that I have to enter to install an old PPA?</p>
]]></content:encoded>
			<wfw:commentRss>http://spirit.blau.in/simon/2011/08/01/there-is-no-ppa-for-ubuntu-11-04/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Export test result with sam</title>
		<link>http://spirit.blau.in/simon/2011/08/01/export-test-result-with-sam/</link>
		<comments>http://spirit.blau.in/simon/2011/08/01/export-test-result-with-sam/#comments</comments>
		<pubDate>Mon, 01 Aug 2011 07:31:30 +0000</pubDate>
		<dc:creator>producer</dc:creator>
				<category><![CDATA[Ubuntu]]></category>
		<category><![CDATA[sam]]></category>

		<guid isPermaLink="false">http://spirit.blau.in/simon/?p=5600</guid>
		<description><![CDATA[Yesterday, I installed Qwt 6, and then built simon 0.3.60. It was difficult, but in the end it worked out fine. And look, sam offers now an Export test result button (top right of the screen shot): I want to export the following information: Filename, Expected result, Actual result, Recognition rate (below 50%). The resulting [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday, I <a href="http://spirit.blau.in/simon/2011/07/31/remove-the-package-simon/#comment-392">installed Qwt 6</a>, and then built <strong>simon 0.3.60</strong>. It was difficult, but in the end it worked out fine. And look, sam offers now an <code>Export test result</code> button (top right of the screen shot):</p>
<p><a href="http://spirit.blau.in/simon/files/2011/08/export-test-result.jpg"><img src="http://spirit.blau.in/simon/files/2011/08/export-test-result-300x188.jpg" alt="" title="export-test-result" width="300" height="188" class="alignleft size-medium wp-image-5601" /></a>I want to export the following information: Filename, Expected result, Actual result, Recognition rate (below 50%). The resulting document should be a simple text file (or XML file or whatever). Is this possible with the current <code>Export test result</code> function of simon?</p>
<div style="clear:both"></div>
]]></content:encoded>
			<wfw:commentRss>http://spirit.blau.in/simon/2011/08/01/export-test-result-with-sam/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ppa.launchpad.net &#8211; where is natty?</title>
		<link>http://spirit.blau.in/simon/2011/07/31/ppa-launchpad-net-where-is-natty/</link>
		<comments>http://spirit.blau.in/simon/2011/07/31/ppa-launchpad-net-where-is-natty/#comments</comments>
		<pubDate>Sun, 31 Jul 2011 19:22:27 +0000</pubDate>
		<dc:creator>producer</dc:creator>
				<category><![CDATA[Ubuntu]]></category>
		<category><![CDATA[Natty Narwhal]]></category>

		<guid isPermaLink="false">http://spirit.blau.in/simon/?p=5594</guid>
		<description><![CDATA[This is what I did a few minutes ago: ubuntu@ubuntu:~/Documents/2011-II/speech2text$ sudo add-apt-repository ppa:grasch-simon-listens/simon Then I typed into the Ubuntu terminal: ubuntu@ubuntu:~/Documents/2011-II/speech2text$ sudo apt-get update And then the following message appeared: [...] W: Failed to fetch http://ppa.launchpad.net/grasch-simon-listens/simon/ubuntu/dists/natty/main/source/Sources 404 Not Found W: Failed to fetch http://ppa.launchpad.net/grasch-simon-listens/simon/ubuntu/dists/natty/main/binary-amd64/Packages 404 Not Found Then I took a look into http://ppa.launchpad.net/grasch-simon-listens/simon/ubuntu/dists/. Obviously [...]]]></description>
			<content:encoded><![CDATA[<p>This is what I did a few minutes ago:</p>
<blockquote><p>ubuntu@ubuntu:~/Documents/2011-II/speech2text$ sudo add-apt-repository ppa:grasch-simon-listens/simon</p></blockquote>
<p>Then I typed into the Ubuntu terminal:</p>
<blockquote><p>ubuntu@ubuntu:~/Documents/2011-II/speech2text$ sudo apt-get update</p></blockquote>
<p>And then the following message appeared:</p>
<blockquote><p>[...] W: Failed to fetch http://ppa.launchpad.net/grasch-simon-listens/simon/ubuntu/dists/natty/main/source/Sources  404  Not Found</p>
<p>W: Failed to fetch http://ppa.launchpad.net/grasch-simon-listens/simon/ubuntu/dists/natty/main/binary-amd64/Packages  404  Not Found</p></blockquote>
<p>Then I took a look into http://ppa.launchpad.net/grasch-simon-listens/simon/ubuntu/dists/. Obviously there isn&#8217;t a directory called natty.</p>
<p>I want to get the newest version of simon/sam. Where can I find it?<br />
&#8211;> It is not <a href="http://sourceforge.net/projects/speech2text/files/">here</a>. This version is from October 2010.<br />
&#8211;> At <a href="http://ppa.launchpad.net/grasch-simon-listens/simon/ubuntu/dists/">ppa.launchpad.net/grasch-simon-listens/</a> I find only versions made for lucid and maverick.<br />
&#8211;> I <a href="http://spirit.blau.in/simon/2011/07/31/remove-the-package-simon/">wasn&#8217;t successful with git</a> because some <a href="http://spirit.blau.in/simon/2011/07/31/remove-the-package-simon/#comment-390">problem with qwt 6</a> came up.</p>
<p>I want to see whether the newest version of sam has an <a href="http://sourceforge.net/tracker/?func=detail&#038;aid=3059424&#038;group_id=190872&#038;atid=935106">&#8216;Export test results&#8217; button</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://spirit.blau.in/simon/2011/07/31/ppa-launchpad-net-where-is-natty/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

