Google's Image Recognition Software Can Now Describe Entire Scenes

Image recognition was already good—but it’s getting way, way better. A research collaboration between Google and Stanford University is producing software that increasingly describes the entire scene portrayed in a picture, not just individual objects.

The New York Times reports that algorithms written by the team attempt to explain what’s happening in images—in language that actually makes sense. So it spits out sentences like “a group of young people playing a game of frisbee” or “a person riding a motorcycle on a dirt road.”

It does that using two neural networks: one deals with image recognition, the other with natural language processing. The system uses computer learning, so it’s fed a series of captioned images and it gradually learns how sentences relate to what the image shows. The resulting software is, according to the team, about twice as accurate as any software to have gone before it.

It’s not, however, perfect. Check, for instance, the image above: it often makes small mistakes and, occasionally, it gets things completely wrong. Clearly there’s room for improvement, then, but it’s evident that image recognition is improving apace.

And, perhaps unsurprisingly given Google’s involved, the natural application is in search. Such an algorithm could easily return relevant images when you type in “three cats eating ice cream sundaes in a billiard room” in a way that current technology just can’t manage. And isn’t that what we all want? (Better search, I mean, not the cats. Well, maybe the cats.) [Google Research Blog, Stanford University via New York Times]

Google’s Image Recognition Software Can Now Describe Entire Scenes

Sign up for our newsletters

Latest news

I Gave the Hardest Cryptic Crossword I Could Find to a Bunch of LLMs

This Startup Wants to Build a Better Microwave Beep

Researcher Faces Investigation for Concealing 6-Year-Old Girl’s Death in Gene-Editing Trial

Dario Amodei Says He’s Not Against Open Models, He’s Against Selling Chips to China

Streaming Gaming Handhelds Would Make Sense if They Were Actually Good

Proton VPN Drops to $2.99, With 147+ Countries and Every Netflix Library

The Always-Busy Mike Flanagan May Next Tackle ‘Warhammer 40K’

Astronomers Spotted a Wandering Black Hole at the Edge of Its Galaxy. Then It Found a Snack

Latest Reviews

Framework Laptop 13 Pro Review: The Best Modular Laptop Ever Made

Oura Ring 5 Review: The Best Smart Ring Right Now, and It’s Not Close

Nanoleaf Smart Multicolor Ceiling Light Review: A Paper Plate on Your Ceiling

Dell XPS 13 (2026) Review: Truly the MacBook Neo of PCs

‘Splatoon Raiders’ Isn’t What the Switch 2 Needs Right Now

Alienware AW3426DW Review: Gaming Monitors Get Thrown a Curveball

Anker Solix S2000 Review: The Little 2kWh Battery That Could

SwitchBot Home Dashboard Review: An E Ink Smart Display for the Weather-Obsessed

Related Articles