<![CDATA[Gizmodo: speech]]> http://tags.gizmodo.com/assets/base/img/thumbs140x140/gizmodo.com.png <![CDATA[Gizmodo: speech]]> http://gizmodo.com/tag/speech http://gizmodo.com/tag/speech <![CDATA[Amazon Caves to Snippy Authors: Kindle's Text-to-Voice Feature Now Optional]]> We reported the Author's Guild's litigious anger about the Kindle 2's text-to-voice feature, claiming it violates audio book copyrights. It's an arguable position, but Amazon has gone ahead and caved to the literate man.

Amazon has backed down for the Guild's comfort rather than for any legal motivation, since it's pretty clear that a robotic reading isn't the same as an audio book. "We strongly believe many rights holders will be more comfortable with the text-to-speech feature if they are in the driver's seat," Amazon said.

But instead of removing the feature, Amazon has simply made it optional, so publishers can decide for themselves what's best for their property. That means it's unlikely the much-advertised (though not necessarily much-loved) feature is going to be too useful for Kindle 2 owners. We're not thrilled, since the Guild has no real legal leg to stand on and seems to need Amazon more than Amazon needs them, but then again, we're not fancy-pants authors. [CNET]

I'm a fancy-pants author and I say this is crap! -JC

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=5162143&view=rss&microfeed=true
<![CDATA[Microsoft's Stevie Ballmer, Sony's Sir Howie Stringer Headlining CES Keynotes]]> With Bill Gates' part time transition in full effect, Steve Ballmer will be picking up Keynote duties at 2009's CES, but only for the pre show speech. Opening night honors go to the charming and witty Sir Howard Stringer, who is like Sony's CEO answer to Benny Hill. I'd say I'm satisfied with these two as speakers, considering how entertaining they've been on previous occasions. [CES]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=5043821&view=rss&microfeed=true
<![CDATA[AT&T Bringing (a Tiny, Frustrating Bit Of) Speech Recognition to the iPhone, Others]]> AT&T's Speech Mashups is a web-based service that will bring voice-activated search to the iPhone, as well as other Edge and 3G handsets. Instead of managing speech recognition on the actual handset, Speech Mashups sends the audio sample to the server, processes it and sends back a text transcription or command to your phone. Unfortunately for iPhone owners, this does not mean voice dialing or speech-to-text app support. Not at all.

AT&T is not currently planning to use this tech to manipulate current iPhone apps (Contacts? Maps? Mail?) but instead will deploy it in web services for a number of net-enabled handsets. This is a somewhat curious choice for AT&T, but it would be difficult to implement system-wide speech recognition without either modifying existing apps or running a (currently disallowed) background service to catch commands. Speech Mashups will be an interesting service for the other handsets it shows up on, but they already have simple voice commands. By building secondary voice capabilities like this for a phone without basic ones, AT&T has inadvertently highlighted one of Apple's most irritating restrictions on iPhone development. [Gadget Lab]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=5028357&view=rss&microfeed=true
<![CDATA[Scientist Studies Cars That Speak and Listen Like KITT]]> KITTdash1.jpgI used to want a car just like KITT when I was a kid, though perhaps without that slightly annoying nasal voice. And now a Stanford scientist is looking at whether nattering to your vehicle may be good for you, the car, other road users and even (darnit) your insurers or advertising execs.

"Your car could know a heck of a lot about you," says Clifford Nass, things like where you go, what speed you drive, what music you listen to. And, of course, companies will pay for that info, but Nass is also wondering if your car could speak back to you, would it result in you driving more safely, or even cheer you up if you were depressed?

Nass' research is interesting to insurers, of course, because if your car could detect that you consistently drove too fast or were often angrily swearing at the traffic while you wrenched at the wheel, they might be tempted to up your premiums. But Nass also sees our future cars being like dumber versions of KITT, behaving more like a co-pilot and actually speaking back to us. Perhaps pointing out that there's a pedestrian in the road ahead, or warning you that you're driving too aggressively.

Some of Nass' results are fascinating:
•Depressed drivers drive better when their car reacts as if it too is depressed
•Cars that speak in complex sentences tend to make you pay attention, rather than zoning out
•Older drivers prefer younger voices
•The car's voice is important: BMW had to recall Series 5 cars because male German drivers didn't listen to its female voice
•Drivers feel closer to the computer if they believe it's embedded in the car, rather than a remote device.

That last one seems like an extension of our personification of cars: I've named every one of my cars, and I bet you guys do too. And while a talking, monitoring, advice-giving car may be some people's idea of a nightmare, you can see that the idea has potential for improving safety: "You can imagine how much it costs for anti-lock brakes. But what if you could simply change the voice in the car?" points out Nass. What do you think, guys? Talking cars FTW? ... or KITT-like cars as nightmare? [Physorg]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=388377&view=rss&microfeed=true
<![CDATA[Computer Learns Baby Talk, Pacifies Programmers]]> Stanford researchers have developed a program that can, at a most basic level, learn language. In studying whether the brain is hard-wired with preset sounds or if it acquires the basics of speech dynamically, a computer program was designed to learn speech. After listening to hours of Japanese and English mother-to-baby talk, the computer was able to learn the basic vowel sounds just as a baby. The computer performed so well that its accuracy was measured between 80-90%, depending on the software architecture (and whether it was "nappie time").

Language acquisition has long been seen as a specialty of the brain—a strength of the species, if you will. To re-create one of our most distinct qualities in a lab setting is either impressive for computers or pitiful for humans. Then again, be comforted knowing that when computers do speak, they'll sound just like Arnold Schwarzenegger, anyway. [arstechnica]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=282242&view=rss&microfeed=true
<![CDATA[SpinVox Winds Your Voicemails into Text For Easy Reading]]> We had a brief chat with a SpinVox co-founder today and he told us all about this speech-to-text service. SpinVox, when integrated with a cellphone or landline provider, can take your voicemail messages and automatically transcribe them into text that gets sent to your email or your phone as a text message.

This is actually a pretty cool service, seeing as other transcription services we've seen are either expensive or strange to use. SpinVox has lined up Cincinnati Bell and Skype, and are working on some deals with major carriers now (no details yet). We know many people who don't bother listening to voicemails because it requires dialing in, pressing buttons, and listening—these are very lazy people.

Other cool SpinVox usages are sending memos and broadcast messages from your phone by calling a number and speaking. Sounds like a great way to send messages to your husband to pick up some tampons.

Product Page [Spinvox]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=244592&view=rss&microfeed=true
<![CDATA[Microsoft Discovers Secret to Speech Recognition]]>

Microsoft [Thanks Eric!]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=237176&view=rss&microfeed=true
<![CDATA[Asus Z801 Karaoke Phone]]> Asus' Z801 Karaoke phone could be the coolest phone we've ever seen. Not for the looks, the design, the amount of megapixels, or even a fast data connection, but for its Karaoke feature. When you play back an MP3 on the Z801, you can sing along and the phone will speech-to-text what you say and display the words right on the phone.

If this feature indeed works the way they describe, this could be the best phone ever. If not, it's just another phone with a 2-megapixel camera that's available only in China.

PhoneDaily [via Mobile Mentalism]

]]>
http://gizmodo.com/index.php?op=postcommentfeed&postId=223060&view=rss&microfeed=true