<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="http://www.developerdotstar.com/community" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>developer.* Blogs - My Experience with Speech Recognition: From Yesteryear to Tomorrow (Part 1) - Comments</title>
 <link>http://www.developerdotstar.com/community/node/232</link>
 <description>Comments for &quot;My Experience with Speech Recognition: From Yesteryear to Tomorrow (Part 1)&quot;</description>
 <language>en</language>
<item>
 <title>Speech Recognition: Do Your Homework</title>
 <link>http://www.developerdotstar.com/community/node/232#comment-970</link>
 <description>&lt;p&gt;Editor&#039;s note: I am closing further comment on this thread since it has unfortunately only become a home for &quot;help me with my homework&quot; messages. (Gunish, I think you could make a nice living helping students with their speech recognition homework assignments.)&lt;/p&gt;
&lt;p&gt;All the best,&lt;br /&gt;
Dan&lt;/p&gt;
</description>
 <pubDate>Mon, 27 Mar 2006 04:19:25 -0800</pubDate>
 <dc:creator>Daniel Read</dc:creator>
 <guid isPermaLink="false">comment 970 at http://www.developerdotstar.com/community</guid>
</item>
<item>
 <title>Speech recognition with ANN + GA</title>
 <link>http://www.developerdotstar.com/community/node/232#comment-968</link>
 <description>&lt;p&gt;Now i&#039;m doing project for final bachelor course of Comp Sci - it is &quot;Speech recognition based on Artificial Neural network and Genetic Algorithm&quot;. I use VC++ for coding, and some main modules are completed, it&#039;s enough to demo. But now i want to build a new module that can be draw spectrum frequency of origion .wav file and after applied FFT. Who has source code of above module, plz help me. plz to me by email &lt;a href=&quot;mailto:letamn@gmail.com&quot;&gt;letamn@gmail.com&lt;/a&gt; . Thanks a lot. Mapcon&lt;/p&gt;
</description>
 <pubDate>Mon, 27 Mar 2006 03:16:55 -0800</pubDate>
 <dc:creator>mapcon</dc:creator>
 <guid isPermaLink="false">comment 968 at http://www.developerdotstar.com/community</guid>
</item>
<item>
 <title>Learn about Speech Recognition in VB Dot NET</title>
 <link>http://www.developerdotstar.com/community/node/232#comment-807</link>
 <description>&lt;p&gt;Plz sir send me detailed info and some idea how to program for speech recognition in VB Dot NET&lt;/p&gt;
</description>
 <pubDate>Wed, 01 Feb 2006 20:51:38 -0800</pubDate>
 <dc:creator>Dawood Khan</dc:creator>
 <guid isPermaLink="false">comment 807 at http://www.developerdotstar.com/community</guid>
</item>
<item>
 <title>speech recognition using VB.net</title>
 <link>http://www.developerdotstar.com/community/node/232#comment-771</link>
 <description>&lt;p&gt;how to write coding for speech recognition using VB.net&lt;/p&gt;
</description>
 <pubDate>Tue, 17 Jan 2006 17:09:51 -0800</pubDate>
 <dc:creator>sarathi</dc:creator>
 <guid isPermaLink="false">comment 771 at http://www.developerdotstar.com/community</guid>
</item>
<item>
 <title>i need urgently</title>
 <link>http://www.developerdotstar.com/community/node/232#comment-755</link>
 <description>&lt;p&gt;hello sir how to recognize speech from engine in vb.net&lt;/p&gt;
</description>
 <pubDate>Tue, 10 Jan 2006 01:25:45 -0800</pubDate>
 <dc:creator>irulapparaj</dc:creator>
 <guid isPermaLink="false">comment 755 at http://www.developerdotstar.com/community</guid>
</item>
<item>
 <title>Thanks! But how ?</title>
 <link>http://www.developerdotstar.com/community/node/232#comment-457</link>
 <description>&lt;p&gt;hey Garima,&lt;/p&gt;
&lt;p&gt;Always Wonder how you never fail to amaze me,There are so many &#039;Hows &#039; that i want answered :&lt;/p&gt;
&lt;p&gt;How are you?&lt;br /&gt;
( Hell i wanna know that )&lt;/p&gt;
&lt;p&gt;How did you find me out?&lt;br /&gt;
(i though we were never gonna communicate again all of our lives!)&lt;/p&gt;
&lt;p&gt;How did u know i got a new job?&lt;br /&gt;
( yeah i got a great job, as a Sr. Lead Software Specialist (.Net)&lt;br /&gt;
at &lt;a href=&quot;http://www.gatesix.com&quot; title=&quot;www.gatesix.com&quot;&gt;www.gatesix.com&lt;/a&gt; the company is not that big but its a great startup. I gotta Build a Entire .NET Development Infrastructure and Team by my self, phew!)&lt;/p&gt;
&lt;p&gt;send me a mail at &lt;a href=&quot;mailto:gunish.chawla@gatesix.com&quot;&gt;gunish.chawla@gatesix.com&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;dont get lost now!&lt;/p&gt;
&lt;p&gt;-Gunish&lt;/p&gt;
</description>
 <pubDate>Tue, 19 Jul 2005 06:28:46 -0700</pubDate>
 <dc:creator>Gunish</dc:creator>
 <guid isPermaLink="false">comment 457 at http://www.developerdotstar.com/community</guid>
</item>
<item>
 <title>When is the Next Part gonne be out!</title>
 <link>http://www.developerdotstar.com/community/node/232#comment-453</link>
 <description>&lt;p&gt;Hi Gunish,&lt;br /&gt;
I found this article very intreaguing, i hope that the next part comes out soon, altough the english was not very flamboyish but still it was worth reading,&lt;br /&gt;
anyways, best of luck with your new job, hope to read more soon!&lt;br /&gt;
Garima&lt;/p&gt;
</description>
 <pubDate>Mon, 18 Jul 2005 06:41:33 -0700</pubDate>
 <dc:creator>Garima</dc:creator>
 <guid isPermaLink="false">comment 453 at http://www.developerdotstar.com/community</guid>
</item>
<item>
 <title>Our (same) boat definately has a HOLE !</title>
 <link>http://www.developerdotstar.com/community/node/232#comment-440</link>
 <description>&lt;p&gt;The reason i chose microsoft technologies over other such as Java is simple, &#039;to avoid reinventing the wheel&#039;. Microsoft has always provided trmendous support over their own stuff than anybody else, really. and yes dealing with API&#039;s is really an complicated issue which certainly requires more than a &#039;schozmo&#039; developer like me! lol. &lt;/p&gt;
&lt;p&gt;Anyways and you are perfectly right about pointing toward Neural Nets as the area under review by maximum developers, weather be it Handwriting recognition or any other human interface parser. The Issue that i have come across is that Neural Nets are STILL not used by Microsoft in doing Speech Recognition in the SASDK&#039;s Telephone English Speech recognition Server, nor the ON-NOTE PDA , tablet pc - handwriting recognition algorithm, my contribution being i am close to perfecting simulation over a neural net model of speech recognition patters, i have constantly employed neural nets in various idea over many years from networking to vision parsing but everything is miserable not even near the mark of acceptable range.&lt;br /&gt;
Instead of giving up i have tried to use them in many places and out of all the applications that can DIGEST a neural net, speech recognition has been the most promising.&lt;br /&gt;
My next part of the article is supposed to talk about how the SASDk works and how this can be considerably improved by employing neural nets to not SYNTHESISES speech but to recognize one ... i am typically using a 3 layer network based on the KOHNEN , GROSSBERG Model of neural nets, its far from acceptable standards but still i do believe that this is how the future models will definitely be built!&lt;/p&gt;
&lt;p&gt;About the language recognition... i would recommend you towards a very insignificant product out on the net but with a very effective algorithm. These are called yahoo Crackers, typically work to steal yahoo ID&#039;s passwords by many techniques such as brute forcing etc. but since the introduction of the &quot;Enter the Letters Printed Above&quot; scheme in almost all the websites , including Developer Dot Star, hackers have started building algorithms that can read  distorted English.. i have definitely come across at least two crackers that were able to constantly Brute Force yahoo by successfully overriding the &quot;Enter the Keywords Printed Above&quot; issue.&lt;br /&gt;
the introduction or curvature such as a 3d- ball beneath the x-y alphabet plane introduces a significant amount of distortion in the alphabets which normal recognizers fail to process. This defiantly is a challenging problem, and results of recognition algorithms only get there half way across... the recognition ratio being around just 53%,&lt;br /&gt;
i am sure this is what you mean in the Chinese language recognition.&lt;br /&gt;
i have plans to work on this issue when i get to the part of implementing Vision in desktops... moreover .... i have once seen a program at The Discover Channel, which showed an Robot At the MIT AI Lab which would &#039;LEARN TO RECOGNIZE and DIFFERENCIATE&#039; between different shapes such as a Ball or a Cube, this was one thing that was not algorithmically implemented but was done on the base of a Self Learning Expert System clubbed with a neural network... the robot eventually tries to GRAB the ball and if the ball moves away, and it is not able to reach it, it extends its arm.. this also caused the robot to realize the length of Itâ€™s OWN arm, a very interesting situation from AI&#039;s point of view other that the classical Sheep Dog Simulation or the PICK up a glass of water and Put it BACK, situation..&lt;br /&gt;
at this point i would certainly like to say that there is no PERFECT Algorithmic way to address this problem, thus we should be working more towards &#039;Writing Programs that can write programs to write Programs&#039; Methodology...&lt;br /&gt;
Indeed this is the only way we can Fill up the hole in our BOAT by coming up with a solution that Microsoft is NOT Interests in developing or investing.&lt;br /&gt;
-&lt;br /&gt;
Gunish&lt;/p&gt;
</description>
 <pubDate>Sat, 02 Jul 2005 11:07:59 -0700</pubDate>
 <dc:creator>Gunish</dc:creator>
 <guid isPermaLink="false">comment 440 at http://www.developerdotstar.com/community</guid>
</item>
<item>
 <title>Thanks for an interesting article</title>
 <link>http://www.developerdotstar.com/community/node/232#comment-439</link>
 <description>&lt;p&gt;I have been pondering a related problem, Gunish: accurately recognizing Chinese handwritten characters.&lt;/p&gt;
&lt;p&gt;In the past, I thought the answer was to raise the &quot;level&quot; of the geometry, where geometry consists of &quot;metric&quot; geometry (study of properties such as length which are not invariant under simple changes), &quot;projective&quot; geometry (study of invariants under projection and perspective) and &quot;topology&quot; (study of invariants under continuous, nonbreaking changes in shape).&lt;/p&gt;
&lt;p&gt;I thought one might have a shot at recognizing handwritten letters in various languages by considering only topological features. The trouble is that (for example) the English letter A is topologically equivalent to O, therefore you need a separate geometry for &quot;things that stick out like limbs&quot; and &quot;things that come to a point with respect to the rest of the letter&quot;.&lt;/p&gt;
&lt;p&gt;At that point it seemed that the recognizer would have to be its own mathematician and dynamically evolve mathematical theories for recognizing variations of a letter.&lt;/p&gt;
&lt;p&gt;It would in fact have the same sort of problems I faced in learning to write Chinese, where I would make embarassing blunders in calligraphy on the new Shenzen subway, pointed out by six year olds. As it happens, for example, the second bar of the symbol for the number three has to be shorter than the first bar.&lt;/p&gt;
&lt;p&gt;Yet, when you examine signage in Hong Kong, the letters are systematically distorted!&lt;/p&gt;
&lt;p&gt;Where we&#039;ve gotten to with respect to both voice and handwriting recognition is the idea of &quot;training&quot; the machine, but consider some kid dumped by his parents at Lo Wu in 1955, who grows up in Hong Kong, and learns to read signage because it&#039;s that, or starve.&lt;/p&gt;
&lt;p&gt;He&#039;s forming scientific theories and testing them.&lt;/p&gt;
&lt;p&gt;A teacher of mine at Princeton, Gil Harman, did work in Lisp on models that would form theories and test them but as far as I know this work is commercialized only the form of neural nets.&lt;/p&gt;
&lt;p&gt;Anyway, certainly sounds like you are on track to something. My only warning, apart from the above considerations, is that relying on Microsoft APIs to be stable over multiple releases of Windows is a bad idea. Microsoft&#039;s policy in the past has been to adopt favored companies, and give them information on changes to APIs, and new APIs, only on condition that they become official Microsoft certified sites.&lt;/p&gt;
&lt;p&gt;This is why there are no API calls at all anywhere in the software for Build Your Own .Net Language and Compiler. Instead, the utilities.DLL and windowsUtilities.DLL do the best they can to provide needed functionality exclusively in terms of the documented, and presumably stable, behavior of Visual Basic .Net.&lt;/p&gt;
&lt;p&gt;This has its own dangers. I use &quot;legacy&quot; character input and output, for example, in file2String and string2File to translate files to and from strings. There&#039;s a chance that in some future release, the functionality exposed by Microsoft.VisualBasic may in some cases be downgraded out of existence if it is inconvenient to support.&lt;/p&gt;
&lt;p&gt;However, wrapping the function in a clear, transparent, Saran wrap like &quot;file2String&quot; makes it obvious that this is the goal, and, the code can be replaced.&lt;/p&gt;
&lt;p&gt;Since Build Your Own, I have started always qualifying ANY reference to functionality that is in Microsoft.VisualBasic with &quot;Microsoft.VisualBasic.x&quot; so as to be able to find possible exposures. Soon, it shall be time to simply remove the reference and the imports even from VB.Net projects because in many cases the functionality is available in more reliable (and, more internationalizable) form elsewhere.&lt;/p&gt;
&lt;p&gt;Or, to junk VB. C# is .Net &quot;equivalent&quot; but VB encourages legacy and USA centric ways of thinking.&lt;/p&gt;
&lt;p&gt;But, I have decided, not before a VB version of Spinoza is complete, because it is quite late in my own personal game, and developing a compiler for a new language is my highest priority.&lt;/p&gt;
&lt;p&gt;You and I are in the same general boat. You are relying on Microsoft software, and I am relying on VB.Net having a future in the global marketplace.&lt;/p&gt;
&lt;p&gt;However, my experience has long been that you can drive yourself batshit by backing up continually to do things &quot;perfectly&quot;.&lt;/p&gt;
</description>
 <pubDate>Sat, 02 Jul 2005 06:41:08 -0700</pubDate>
 <dc:creator>Edward G Nilges</dc:creator>
 <guid isPermaLink="false">comment 439 at http://www.developerdotstar.com/community</guid>
</item>
<item>
 <title>hi,
u must be the same guy w</title>
 <link>http://www.developerdotstar.com/community/node/232#comment-438</link>
 <description>&lt;p&gt;hi,&lt;br /&gt;
u must be the same guy who i met at OIST Software expo... because there also yopu had brought an AI Project that was working in human interaction through speech,well i asked u there itself how did u make it and u said &#039;its a long and booring story&#039; i guesses you didnt want to talk then. i had decided that time that one day i would find you and make you teach me how to work around with AI and speech in particular,so i guess i HAVE found you and now i want you to help me out, post your contact number or something so that i can call you and get in touch with you!&lt;br /&gt;
-&lt;br /&gt;
Love ur work&lt;br /&gt;
Swetha&lt;/p&gt;
</description>
 <pubDate>Sat, 02 Jul 2005 00:44:37 -0700</pubDate>
 <dc:creator>Swetha Medhani</dc:creator>
 <guid isPermaLink="false">comment 438 at http://www.developerdotstar.com/community</guid>
</item>
<item>
 <title>lot of experience with speech</title>
 <link>http://www.developerdotstar.com/community/node/232#comment-437</link>
 <description>&lt;p&gt;lot of experience with speech recognition i see!&lt;br /&gt;
when did u start working on it anyways&lt;br /&gt;
... i think i met you at a software contest hel at NRI Institute ...&lt;br /&gt;
and i am sure that you dont remember me ... but i was preety impressed by your imagination and futuristic vision.&lt;br /&gt;
i hope that you post the next part soon enough so that i can get the complete idea of it !&lt;br /&gt;
-Pooja&lt;/p&gt;
</description>
 <pubDate>Sat, 02 Jul 2005 00:39:49 -0700</pubDate>
 <dc:creator>Pooja</dc:creator>
 <guid isPermaLink="false">comment 437 at http://www.developerdotstar.com/community</guid>
</item>
<item>
 <title>hey this is ankush,, 
nice o</title>
 <link>http://www.developerdotstar.com/community/node/232#comment-436</link>
 <description>&lt;p&gt;hey this is ankush,,&lt;br /&gt;
nice one ... was very informative .... i hope you will be writing the next part soon though!&lt;/p&gt;
</description>
 <pubDate>Sat, 02 Jul 2005 00:37:10 -0700</pubDate>
 <dc:creator>Guest</dc:creator>
 <guid isPermaLink="false">comment 436 at http://www.developerdotstar.com/community</guid>
</item>
<item>
 <title>My Experience with Speech Recognition: From Yesteryear to Tomorrow (Part 1)</title>
 <link>http://www.developerdotstar.com/community/node/232</link>
 <description>&lt;p&gt;I am in no way suggesting that voice recognition systems have become perfect, but what I am implying is that we have started to tread the right path...&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;http://www.developerdotstar.com/community/node/232&quot;&gt;read more&lt;/a&gt;&lt;/p&gt;</description>
 <comments>http://www.developerdotstar.com/community/node/232#comment</comments>
 <category domain="http://www.developerdotstar.com/community/taxonomy/term/20">Software Development</category>
 <pubDate>Wed, 22 Jun 2005 11:06:45 -0700</pubDate>
 <dc:creator>Gunish Rai Chawla</dc:creator>
 <guid isPermaLink="false">232 at http://www.developerdotstar.com/community</guid>
</item>
</channel>
</rss>
