Meta Releases AI to Translate Dozens of Languages Using Speech and Text

The SeamlessM4T AI model can understand languages from voice or text and translate using either mode of communication.

By Mack DeGeurin Published August 22, 2023, 12:26 pm ET

Reading time 2 minutes

Meta took a step towards a universal language translator on Tuesday with the release of its new Seamless M4T AI model, which the company says can quickly and efficiently understand language from speech or text in up to 100 languages and generate translation in either mode of communication. Multiple tech companies have released similar advanced AI translation models in recent months.

In a blog post, Meta describes its new translation system as “the first all-in-one multimodal and multilingual AI translation model” capable of speech recognition and speech-to-text translation for nearly 100 different languages. The model can also interpret speech and text and spit back out translated spoken words for 36 and 35 languages respectively. Seamless M4T can also reportedly understand when users change languages mid-sentence, which could help when using a model to translate people who mix parts of languages together when they speak, which language researchers refer to as codeswitching.

“SeamlessM4T is a unified multilingual model, meaning that it doesn’t rely on intermediate models to produce results,” Meta Research Scientist Manager Paco Guzmán told Gizmodo. “Other cascaded systems for spoken translation often do: speech recognition, text translation, text-to-speech generation. SeamlessM4T does it in a single go.”

In a video demo, Guzmán spoke the sentence “our goal is to create a more connected world.” The model quickly recognized the language spoken was English and then translated that into Russian. A computerized Russian voice spat the sentence back out with a more or less human timbre.

Unlike other past translation models, SeamlessM4T uses one single system which Meta believes will ultimately result in reduced errors and delays and increased quality. Meta compared this all-in-one translator approach to the Babel fish universal translator in The Hitchhiker’s Guide to The Galaxy. For now, you won’t have to shove this one in your ear.

Meta is releasing Seamless M4TT under a Creative Commons license so other translators and AI researchers can build off of it. The company is also releasing the metadata of SeamlessAlign, which contains over 270,000 hours of mined speech and text. Meta claims it’s the largest dataset of its kind.

Though much of new AI in recent months has pointed out the unreliability of using large language models for delivering accuratefactual information, language translation is something these models are actually well-suited for. Seamless M4T is made possible by Meta’s previous iterations in translation models. One of those broke new ground by successfully translating the primarily spoken language Hokkien into spoken words, a first for a new model. More recently, the company released its Massively Multilingual Speech system, which Meta claims can provide automatic speech detection and language identification for more than 1,100 languages.

Explore more on these topics

Meta Speech recognition

Share this story

Sign up for our newsletters

Subscribe and interact with our community, get up to date with our customised Newsletters and much more.

Meta Releases AI to Translate Dozens of Languages Using Speech and Text

Sign up for our newsletters

Latest news

The Truth Is Out There, We Need This ‘X-Files’ Lego Set

Iran Claims It ‘Destroyed’ Amazon’s Central Data Infrastructure In Bahrain

Ryan Reynolds Confirms a New ‘Deadpool’ Movie Is in the Works

The AI Copyright Lawsuits Have Finally Produced an Actual Payout

Apple Plans to Defeat RAM Prices by Letting You Lease a Mac

Ozempic Is Quietly Becoming the Go-to Weight Loss Treatment for Teens

‘RoboCop’ Is Officially Making His Return, Thanks to Amazon

The Final ‘Spider-Man: Brand New Day’ Trailer Takes Us Back in Time

Latest Reviews

‘Splatoon Raiders’ Isn’t What the Switch 2 Needs Right Now

Alienware AW3426DW Review: Gaming Monitors Get Thrown a Curveball

Anker Solix S2000 Review: The Little 2kWh Battery That Could

SwitchBot Home Dashboard Review: An E Ink Smart Display for the Weather-Obsessed

Asus ROG Kithara Review: A Huge Gaming Headset With Even Bigger Sound

Geekom A9 Max (2026) Review: Not Much ‘Max’ About It

The Best Budget Laptops Under $1,000 for Back to School

Roborock Saros 20 Review: Jack of All Trades, Master of Most

Related Articles

Meta Releases AI to Translate Dozens of Languages Using Speech and Text

Sign up for our newsletters

The Truth Is Out There, We Need This ‘X-Files’ Lego Set

Iran Claims It ‘Destroyed’ Amazon’s Central Data Infrastructure In Bahrain

Ryan Reynolds Confirms a New ‘Deadpool’ Movie Is in the Works

The AI Copyright Lawsuits Have Finally Produced an Actual Payout

Apple Plans to Defeat RAM Prices by Letting You Lease a Mac

Ozempic Is Quietly Becoming the Go-to Weight Loss Treatment for Teens

‘RoboCop’ Is Officially Making His Return, Thanks to Amazon

The Final ‘Spider-Man: Brand New Day’ Trailer Takes Us Back in Time

‘Splatoon Raiders’ Isn’t What the Switch 2 Needs Right Now

Alienware AW3426DW Review: Gaming Monitors Get Thrown a Curveball

Anker Solix S2000 Review: The Little 2kWh Battery That Could

SwitchBot Home Dashboard Review: An E Ink Smart Display for the Weather-Obsessed

Asus ROG Kithara Review: A Huge Gaming Headset With Even Bigger Sound

Geekom A9 Max (2026) Review: Not Much ‘Max’ About It

The Best Budget Laptops Under $1,000 for Back to School

Roborock Saros 20 Review: Jack of All Trades, Master of Most

Related Articles

Back to School: The 8 Best Alternatives to Buying a TV

The Best Budget Laptops Under $1,000 for Back to School

The Best Tech to Level Up Summer 2026

Meta’s Oversight Board Finds Top AI Models Are Hesitant to Criticize Repressive Governments

Meta Sued For Allegedly Using Discriminatory AI In Layoff Decisions

Smart Glasses Backlash Is Reaching New Celebrity Heights