OpenAI Is Tired of Seeing All Those Videos of People Clowning on Its Voice Mode

Earlier this year, Sam Altman was confronted directly with a video from what has become a viral trend: people showing off the significant shortcomings of OpenAI’s voice model. It seems he didn’t particularly enjoy that, because OpenAI is taking steps to save Altman from future embarrassment. On Thursday, the company announced three new voice models meant to open up the technology to developers who might be able to do groundbreaking things like program a functional timer.

Per the company, it is releasing GPT-Realtime-2, its first voice model with “GPT-5-class reasoning” that can allegedly handle difficult prompts and better maintain conversations than its predecessors. It also introduced GPT-Realtime-Translate, which it claims can translate speech from more than 70 input languages into 13 output languages while “keeping pace with the speaker.” The final model, GPT-Realtime-Whisper, is meant for live speech-to-text transcription.

“Voice is becoming one of the most natural ways for people to use software,” the company said in a statement. “But building useful voice products takes more than fast turn-taking or a natural-sounding voice. A voice agent needs to understand what someone means, keep track of context, recover when a request changes, use tools while the conversation continues, and respond in a way that feels appropriate to the moment.”

The challenges that building AI models have presented have become the subject of many a meme over the past year or so. TikTok user @huskistaken, aka Husk, is perhaps the master of the genre, regularly poking holes in the capabilities of OpenAI’s previous voice models—though instead of doing so as a red teamer preventing issues from making it into the final product, he primarily encourages OpenAI to make changes via embarrassment.

@huskistaken

I swear I was faster

♬ original sound – Husk

It was one of Husk’s videos that made its way to Altman earlier this year. The CEO was made to watch ChatGPT’s voice model very obviously lie about starting a timer. Husk would ask the model to time how long it took him to run a mile, then immediately say he was done, only for the model to claim he finished his mile in 10 minutes. Altman, visibly annoyed about the whole thing, said it’d be “Maybe another year before something like that works well.”

The new models are meant to speed up solutions to this confounding problem. Per OpenAI’s press release, the new releases are adept at “voice-to-action, where people can describe what they need and the system can reason through the request, use tools, and complete the task.” They provide an example like asking Zillow to “find me homes within my BuyAbility, avoid busy streets, and schedule a tour for Saturday.” That certainly feels a bit more advanced than “start a timer,” but it stands to reason that’d fall under the same functionality.

The real test of OpenAI’s new models will be the jailbreakers like Husk. Earlier this year, former OpenAI founder Andrej Karpathy argued that people simply haven’t updated their priors on AI models, which he argued are advancing all the time in ways that don’t garner the same attention as voices messing with the voice model. But those videos aren’t old—Husk uploads new ones regularly. If he stops posting with the release of this new model, chalk up a win for the true believers like Karpathy.

OpenAI Is Tired of Seeing All Those Videos of People Clowning on Its Voice Mode

Sign up for our newsletters

Latest news

Crypto’s Most Powerful PAC Sends a Warning to Politicians: Resistance Is Futile

Sam Altman Would Like the Record to Show AI Will Not Take Your Job (Despite Everything He’s Said Previously)

Elon Musk Says America’s Kamikaze Drones Used the Wrong Starlink Subscription

With the Flames of Hell Licking at His Feet, JD Vance Ponders the Future of AI

‘Dorohedoro’ Season 2 Is the Apex Of Macabre Anime Chaos and Whimsy

This Plasma Gun Could Save Astronauts From Filthy Underwear

Valve’s Massive Price Hikes Just Ruined the Steam Deck

Occupy Wall Street Co-Founder Built an AI App to Help Activists Seize the Means of Computation

Latest Reviews

Alienware Fixed the One Problem With the Area-51, and Now I’m Afraid I Love It

Soundcore Liberty 5 Pro Max Review: Wireless Earbuds With Enough Features to Make Your Head Spin

Anker Solix E10 Review: No Power? No Problem

Bose Lifestyle Ultra Speaker Review: Sonos Can Start Sweating Now

Bose Lifestyle Ultra Soundbar Review: A Boisterous Stab at Dominating Home Theater

Dell’s XPS 16 (2026) Is Almost Everything I Could Have Asked for… Almost

Smart Glasses With Subscriptions Are As Bad as They Sound

iBuyPower’s Trace X Gaming PC Is the Fishbowl You Want to Swim In

Related Articles

OpenAI Is Tired of Seeing All Those Videos of People Clowning on Its Voice Mode

Sign up for our newsletters

Crypto’s Most Powerful PAC Sends a Warning to Politicians: Resistance Is Futile

Sam Altman Would Like the Record to Show AI Will Not Take Your Job (Despite Everything He’s Said Previously)

Elon Musk Says America’s Kamikaze Drones Used the Wrong Starlink Subscription

With the Flames of Hell Licking at His Feet, JD Vance Ponders the Future of AI

‘Dorohedoro’ Season 2 Is the Apex Of Macabre Anime Chaos and Whimsy

This Plasma Gun Could Save Astronauts From Filthy Underwear

Valve’s Massive Price Hikes Just Ruined the Steam Deck

Occupy Wall Street Co-Founder Built an AI App to Help Activists Seize the Means of Computation

Alienware Fixed the One Problem With the Area-51, and Now I’m Afraid I Love It

Soundcore Liberty 5 Pro Max Review: Wireless Earbuds With Enough Features to Make Your Head Spin

Anker Solix E10 Review: No Power? No Problem

Bose Lifestyle Ultra Speaker Review: Sonos Can Start Sweating Now

Bose Lifestyle Ultra Soundbar Review: A Boisterous Stab at Dominating Home Theater

Dell’s XPS 16 (2026) Is Almost Everything I Could Have Asked for… Almost

Smart Glasses With Subscriptions Are As Bad as They Sound

iBuyPower’s Trace X Gaming PC Is the Fishbowl You Want to Swim In

Related Articles

Can Smart Glasses Ever Be Privacy-Friendly? These Companies Think So

Sam Altman Would Like the Record to Show AI Will Not Take Your Job (Despite Everything He’s Said Previously)

With the Flames of Hell Licking at His Feet, JD Vance Ponders the Future of AI

Samsung Chip Workers Approve (Amazing) Deal to Avert Strike

Silicon Valley VCs Invest in Head-Mounted Cameras on Workers in India For Training AI

Cops Want to Turn Your Kid’s School Bus Into a Surveillance Tool