<![CDATA[Gizmodo: voice recognition]]> http://tags.gizmodo.com/assets/base/img/thumbs140x140/gizmodo.com.png <![CDATA[Gizmodo: voice recognition]]> http://gizmodo.com/tag/voicerecognition http://gizmodo.com/tag/voicerecognition <![CDATA[ASUS Puts Voice Recognition in 20-Inch Touchscreen EeeTop ET2002, Now Car Ready]]> ASUS likes to throw crap against the wall to see what sticks and in this case they put voice recognition into its newest EeeTop. The software is accurate, but I'm not seeing the need for it in a 20-inch PC.

I do have to say the voice recognition seems to be on the ball from the video; it is actually impressive when Sascha, the German chap in the vid, butchers the pronunciation of "Kanye West" (around 3:46 in video) and the computer lady still understands it and plays a song.

But why do you need this voice functionality baked into an all-in-one with such a big screen where you are bound to have your hands somewhat free to control the touchscreen? Though I guess, it could be useful for when in a kitchen before cooking up a MacGyver Chef creation. [Netbook News]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=5346048&view=rss&microfeed=true
<![CDATA[Asus Eees Getting Voice Recognition This Year]]> Who knows if it will work, but Asus wants to put "an end to the keyboard." So a variety of Eee products will roll out with voice recognition capabilities later this year.

According to Asus CEO Jerry Shen:

...internally we have one team dedicated to studying voice recognition. Touch and gesture input is universal, whereas language is not universal...The first Eee PC or Eee Top products implementing voice-recognition and features will be ready by Q3/Q4 2009 – with our dedicated development team working with third parties in both Japan and the US and reporting directly to me. So this is something we will see very soon...

Hmm, it's a fine idea for home (if it works). But I'd rather not be talking to my netbook at Starbucks. [TechRadar]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=5181923&view=rss&microfeed=true
<![CDATA[Google Adding Advanced Voice Search to the iPhone]]> Google has created an app for the iPhone that will give the handset advanced voice recognition, reports John Markoff from the NY Times. The app can answer location related questions (Finding the nearest Starbucks), give driving directions, respond to generic questions, and even search local data from the address book.

It works by recording a soundbite, uploading it to Google's servers, which will crunch the data and return an answer "within seconds on a fast wireless network". Saul compared the function to that offered on Yahoo's, and found Google's to be more accurate, but still return junk results sometimes. AT&T and Microsoft also have handset voice recognition beyond simple dialing that many other phones offer. The app will be free and might be available to download as soon as Friday. [NYT]

Related: Gizmodo's Essential Iphone Apps

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=5086741&view=rss&microfeed=true
<![CDATA[Microsoft to Develop Voice-Recognition iPhone App Via Subsidiary Tellme]]> Voice recognition is one if the iPhone gaping-est of holes, but it will soon be plugged by none other than Microsoft. But before you look forward to putting your multitouching fingers all over those glorious nine letters on your iPhone screen, know that MS is only indirectly involved via Tellme, a company they acquired in early 2007 which has a similar app for Blackberries already. But it's a start. And the app looks like it would be pretty useful.

Press a button to start recording and say what you're searching for—the name of a business, for instance, and Tellme will search and locate matches in your vicinity. Great for when you can't stop and type in locations manually. Unfortunately the app doesn't appear to access your contacts for good ol' standard voice-dialing (at least in the BlackBerry version), so that's something we'll still have to wait for. Along with mobile MS Office for iPhone [Yahoo News/CNET]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=5056274&view=rss&microfeed=true
<![CDATA[BlackBerry Pearl Gets Google Maps with Voice Search]]> Google has just released a new version of their maps application that supports voice recognition search on the BlackBerry Pearl. That means Pearl users can load Google Maps, hold a side button, say "pizza, i said PIZZA...PEE ZZ UH" and have the application magically find tacos in your immediate vicinity. This upgrade doesn't look to be automatic, so hit the link to have Google text message your phone for the "experimental" update. [Google Maps via Gear Diary]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=5021821&view=rss&microfeed=true
<![CDATA[New Yorker: Why We Won't Have Fully Conversational Robots]]> John Seabrook wrote a recent feature in The New Yorker about interactive-voice-response systems (I.V.R.) commonly used with customer service and tech support telephone hotlines. Seabrook spent time at B.B.N. Technologies watching these systems transcribe callers' words and analyzing the tone of voice for emotions present. While breaking down the history of automated telephone services and voice recognition innovations, he attempts to tackle the larger question of whether or not we can create a fully conversational, quasi-conscious robot, akin to 2001: A Space Odyssey's Hal 9000. Judging from the number of experts interviewed for the piece, the answer is a resounding no.

  • While machines that could accurately reproduce the sound of human speech, such as Wolfgang von Kempelen's talking head, have been around since the late 1700s, no device has been able to learn the syntactical rules necessary for generating conversation.
  • Secondly, the act of hearing and interpreting is more difficult to instill in a machine because of the on-the-fly signal processing that would be required. The complexity of the ear allows it to pick up on the most subtle nuances in sound (according to the article, people can distinguish between hot and cold coffee just by hearing it poured into a glass.
  • Roger Schank is a philosopher-programmer who has spent his professional live trying to create a conscious computer that not only has a memory, but can also learn. After years in the field, Schank is skeptical it will ever happen. He says replicating idle chatter and the sheer complexity of speech in general is beyond the abilities of current scientists.
  • Steven Pinkner, a Harvard cognitive scientist, says that natural speech could rely on the breadth of one's knowledge, which is "extraordinarily difficult" to endow to a computer.
  • R&D efforts in speech recognition began in the 1950s and '60s, but researchers are still hung up on the number of ways to communicate the word yes. Speech engineers for Nuance found that Southerners in the U.S. tend to add "sir" or "ma'am" to responses where as Northerners do not. And "Valley Girl" speak tends to make computers interpret declarative statements as questions.
  • Finding it difficult to make a computer able to "learn," scientists turned to brute-force computing and algorithms that relied upon mass amounts of data. But in 1969, high-ranking Bell Labs staffer John Pierce wrote that a speech machine that could recognize, but not understand, was utterly pointless.
  • The big emphasis on speech recognition has now moved to emotional analysis, which still uses algorithms to estimate a caller's state of mind. Stanford researcher Elizabeth Shriberg says its impossible to compare emotions in acted speech to emotions in real speech. The escalation of anger, for example, happens in smaller, more subtle increments with authentic speech.
  • The most promising breakthrough in emotional recognition is an agression detector that has been deployed through out parts of Europe. Sound Intelligence were able to recreate the processes of the inner ear on a computer, which spawned a device that could learn the sounds of different objects in action and identify them. The Dutch city of Groningen has placed this technology in its pubs, where if it detects excessively aggressive speech in the pub, it will alert the nearest police station. But as Seabrook comments, "This is no HAL."
  • Other research labs, like the Speech Analysis and Interpretation Laboratory, have turned to facial recognition to glean emotional insight, but have come up dry. "Emotions aren't discrete," lab chief Shrikanth Narayanan told the New Yorker. "They are a continuum, and it isnt clear to any one perceiver where one emotion ends and the other begins." To add insult to injury, there hasn't been any real demand for emotional recognition outside the call center arena.

So while we might not ever see a robot become a Nobel Laureate, there is one lession to be learned from this New Yorker piece — never talk freely while on hold with customer service. Seabrook learned while at B.B.N. Technologies that they still record the call while you're on hold to assess your emotional state. After a profanity-laced tirade, one annoyed caller took a couple of hits from his bong, waited a little longer, and hung up. [The New Yorker]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=5019369&view=rss&microfeed=true
<![CDATA[Asimo Understands Multiple People Yelling At Once, Has Future on Wall Street]]> Honda research engineers have given Asimo the ability to understand three voices at once, thanks to an array of eight microphones that can recognize each voice individually. The recognition software, HARK, can process the commands with 70-80 percent accuracy and the microphones are placed all over Asimo's head and body for spatial recognition purposes. The current application for this technology is using Asimo as a judge for verbal Paper-Rock-Scissors, where everyone calls out their answer at once, and Asimo decides who said what, and who wins. Though a great technical feat, this feels less glamorous than the robot's stint as orchestra conductor, no? [New Scientist]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=5015260&view=rss&microfeed=true
<![CDATA[In Car, Hands-Free Text Messaging Using Microsoft Sync]]> Microsoft has teamed up with voice recognition software firm Nuance to develop its own hands-free text software for use with Microsoft Sync. The software would allow users to dictate SMS messages and possibly emails later on down the line. This would undoubtedly result in far less cellphone-related accidents, not to mention instances where I flip off dudes in Fords. Expect the software to be available in Ford models already equipped with Sync starting in early 2008. [Tech.co.uk]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=308429&view=rss&microfeed=true
<![CDATA[ThePudding.com Phone Service Listens to Your Calls, Makes You Watch Ads]]> It sounds like a double-whammy of a bad idea: a free phone service that determines which ads to target to you by applying speech-recognition to all your conversations. To make things worse, the home page of ThePudding.com insults potential customers by saying it's "a breakthrough technology that makes your phone calls interesting." Hey, my phone calls are a thrill a minute.

Although it will offer service, ThePudding isn't trying to claim a piece of the pie that Skype, Vonage and the cable companies have been wrassling over for years. According to the AP, it hopes to "license its speech-recognition service to other companies that use Voice over Internet Protocol." But AP tech writer Peter Svensson had mixed results when testing ThePudding's speech recognition:

"Relevant ads appeared when this reporter talked about restaurants and computers, but the software was oddly insistent that he should seek a career as a social worker, showing multiple ads and links pointing to that field."
The description of the service inspires such Kubrickian paranoia, I could have just as easily used that classic image of Alex strapped to the chair, eyelids peeled back with clamps. Welcome to the future, my little droogies. AP]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=302929&view=rss&microfeed=true
<![CDATA[UPDATED: iPhone Speech Recognition Demo from VoiceSignal]]>
We've heard of VoiceSignal speech recognition for lots of other phones, but now VoiceSignal sent us a video that allegedly shows it working for the first time on the iPhone. According to the guy in the clip, a couple of VoiceSignal engineers designed this app, but all we see it doing so far is controlling music on the iPhone.

It's for real (see update below). Sure will be nice to be able to use speech commands with the iPhone, telling it to call so-and-so on those mic-equipped earbuds while keeping the phone in the pocket.

UPDATE: We got this exclusive info from Chris LeBlanc at VoiceSignal: "It works just like our other apps, so it's speaker independent and needs no training at all to recognize names, numbers and general speech. We will demo the continuous voicemode on the iPhone soon, I've already seen it — and it's nearly ready." Chris has promised us a demo copy, so we'll give you a first look as soon as it arrives. [VoiceSignal]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=292344&view=rss&microfeed=true
<![CDATA[Tinkering with Voice-Controlled Helicopters]]> Those crazy Brits are flying voice-controlled helicopters now, using a Direct Voice Input system by QinetiQ that lets pilots fly their Gazelle helicopters by simply yelling at them. So far the makers claim 90% "effective speech control" of the helicopter's "non-safety critical avionic functions," but we're just wondering what avionic functions on a helicopter aren't safety-critical. And let's see—to shoot the guns, you yell "bang!", right?

The system is speaker-independent, so it doesn't need to be trained for each pilot's particular voice or way of speaking. Now this means that backseat drivers can actually have some real power. Let's just hope carmakers aren't tempted to try this shit.

Speech recognition technology allows voice control of aircraft systems [Gizmag]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=271419&view=rss&microfeed=true
<![CDATA[Windows Vista Voice Recognition Crashes and Burns]]>

What happens when a Microsoft employee demos the speech recognition in Windows Vista before it's ready? Check it out for yourselves. It's not a pretty sight.

Our buddies at Jalopnik pointed us to the flakey Windows Vista voice recognition demo shown at a recent event. We tried to record it off of CNBC but our computers melted over the weekend, luckily not in the Dell way.

Update: Reader Guillermo tells us that the engineer who's "responsible" for the bug that caused the demo meltdown posted an update on his site.

Fun with Vista's speech recognition [Guardian Technology Blog]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=191020&view=rss&microfeed=true