Tech News

An Algorithm Generated Eerily Accurate Portraits Based Only On Someone’s Voice

By Melanie Ehrenkranz Published June 7, 2019, 1:10 pm ET

Reading time 3 minutes

Technology can learn a lot about us, whether we like it or not. It can figure out what we like, where we’ve been, how we feel. It can even make us say or do things we’ve never said or done. And according to new research, it can start to figure out what you look like based simply on the sound of your voice.

MIT researchers published a paper last month called Speech2Face: Learning the Face Behind a Voice which explores how an algorithm can generate a face based on a short audio recording of that person. It’s not an exact depiction of the speaker, but based on images in the paper, the system was able to create an image of a front-facing face with a neutral expression with accurate gender, race, and age.

The researchers trained the deep neural network on millions of educational YouTube clips with over 100,000 different speakers, according to the paper. While the researchers note that their method doesn’t generate exact images of a person based on these short audio clips, the examples shown in the study do indicate that the resulting portraits eerily resemble what the person actually looks like. It’s not necessarily similar enough that you’d be able to identify someone based on the image, but it does signal the new reality that even in a rudimentary form, an algorithm can guess—and generate—what someone looks like based exclusively on their voice.

The researchers do address ethical considerations in the paper, namely around the fact that their system doesn’t reveal the “true identity of a person” but rather creates “average-looking faces.” This is to ensure that it isn’t an invasion of privacy. However, the researchers did raise some thorny ethical questions with the type of data they used for their model. One of the individuals included in the dataset told Slate that he didn’t remember signing a waiver for the YouTube video he was featured in that ended up being fed through the algorithm. But the videos are publicly available information, and so legally, this type of consent wasn’t required.

“Since my image and voice were singled out as an example in the Speech2Face paper, rather than just used as a data point in a statistical study, it would have been polite to reach out to inform me or ask for my permission,” Nick Sullivan, head of cryptography at Cloudflare who was used in the study, told Slate.

The researchers also indicate in their study that the dataset that they used isn’t an accurate representation of the world population since it was just pulling from a specific subset of videos on YouTube. It’s therefore biased—a common issue among machine learning datasets.

It’s certainly nice that the researchers pointed out the ethical considerations with their work. However, as advancements in technology go, they won’t always be iterated on and deployed by teams or individuals with good intentions. There are of course a number of ways in which this type of system can be exploited, and if someone figures out a way to create even more realistic depictions of someone based simply on an audio recording, it points to a future in which anonymity becomes increasingly difficult to achieve. Whether you like it or not.

Share this story

Sign up for our newsletters

Subscribe and interact with our community, get up to date with our customised Newsletters and much more.

An Algorithm Generated Eerily Accurate Portraits Based Only On Someone’s Voice

Sign up for our newsletters

Latest news

Xbox Might Have Way to Win Over PlayStation Fans as Sony Ditches Discs

‘Kong x Godzilla: The Ride’ Immerses You in the Monsterverse

How to Watch France vs Spain Livestream Free from Anywhere

New York Issues the Nation’s First Statewide Moratorium on New Large Data Centers

‘The Mandalorian and Grogu’ Is Finally Coming Home

Anker Goes All-In on Soundcore Earbuds, With Space A40 at Record Low as a 5x Cheaper Alternative to AirPods Pro

‘Channel Zero: No-End House’ Holds Up as a Liminal-Space Nightmare

Samsung Dolby Soundbar Is Cheaper Than Off-Brand Speakers at 50% Off, Comes With Wireless Subwoofer

Latest Reviews

The Best Budget Laptops Under $1,000 for Back to School

Roborock Saros 20 Review: Jack of All Trades, Master of Most

You Know What Your Bathroom Needs? A Smart Mirror With Party Lighting

Narwal Freo Z10 Turbo Review: Midrange Vacuum, High-End Performance

X by Xreal a01+ Review: AR Glasses That Are Light on Your Face (and Wallet)

Razer Blade 16 (2026) Review: A Gaming Laptop You Can Actually Call ‘Portable’

Lenovo IdeaPad Slim 5x Gen 11 Review: Solid ARM at a Budget Price

Nothing Ear 3a Review: You Can Skip the Flagship

Related Articles

An Algorithm Generated Eerily Accurate Portraits Based Only On Someone’s Voice

Sign up for our newsletters

Xbox Might Have Way to Win Over PlayStation Fans as Sony Ditches Discs

‘Kong x Godzilla: The Ride’ Immerses You in the Monsterverse

How to Watch France vs Spain Livestream Free from Anywhere

New York Issues the Nation’s First Statewide Moratorium on New Large Data Centers

‘The Mandalorian and Grogu’ Is Finally Coming Home

Anker Goes All-In on Soundcore Earbuds, With Space A40 at Record Low as a 5x Cheaper Alternative to AirPods Pro

‘Channel Zero: No-End House’ Holds Up as a Liminal-Space Nightmare

Samsung Dolby Soundbar Is Cheaper Than Off-Brand Speakers at 50% Off, Comes With Wireless Subwoofer

The Best Budget Laptops Under $1,000 for Back to School

Roborock Saros 20 Review: Jack of All Trades, Master of Most

You Know What Your Bathroom Needs? A Smart Mirror With Party Lighting

Narwal Freo Z10 Turbo Review: Midrange Vacuum, High-End Performance

X by Xreal a01+ Review: AR Glasses That Are Light on Your Face (and Wallet)

Razer Blade 16 (2026) Review: A Gaming Laptop You Can Actually Call ‘Portable’

Lenovo IdeaPad Slim 5x Gen 11 Review: Solid ARM at a Budget Price

Nothing Ear 3a Review: You Can Skip the Flagship

Related Articles

The Best Budget Laptops Under $1,000 for Back to School

The Best Tech to Level Up Summer 2026

Xbox Might Have Way to Win Over PlayStation Fans as Sony Ditches Discs

DHS Cybersecurity Reportedly Has an ‘I’m Sure It’s Nothing’ Problem

Anthropic Says Claude’s Values Are Different Depending on Which Language You’re Using

Flock Says ‘We Hope to Resume’ Work With LAPD After Getting Dropped