DeepMind’s AlphaCode Can Outcompete Human Coders

When it comes to tracking the incremental advances of AI potential, humans have an odd tendency to think in terms of board games we probably haven’t played since childhood. Though there’s no shortage of examples, even recent ones, highlighting AI’s ability to utterly own the cardboard gaming space, those tests only go so far in illustrating the tech’s effectiveness at solving real world problems.

A potentially far better “challenge,” would be to put an AI side by side with humans in a programming competition. Alphabet-owned DeepMind did just that with its AlphaCode model. The results? Well, AlphaCode performed well but not exceptional. The model’s overall performance, according to a paper published in Science shared with Gizmodo, corresponds to a “novice programmer” with a few months to a year of training. Part of those findings were made public by DeepMind earlier this year.

In the test, AlphaCode was able to achieve “approximately human-level performance” and solve previously unseen, natural language problems in a competition by predicting segments of code and creating millions of potential solutions. After generating the plethora of solutions, AlphaCode then filtered them down to a maximum of 10 solutions, all of which the researchers say were generated, “without any built-in knowledge about the structure of computer code.”

AlphaCode received an average ranking in the top 54.3% in simulated evaluations in recent coding competitions on the Codeforces competitive coding platform when limited to generation 10 solutions per problem. 66% of those problems, however, were solved using its first submission.

That might not sound all that impressive, particularly when compared to seemingly stronger model performances against humans in complex board games, though the researchers note that succeeding at coding competitions are uniquely difficult. To succeed, AlphaCode had to first understand complex coding problems in natural languages and then “reason” about unforeseen problems rather than simply memorizing code snippets. AlphaCode was able to solve problems it hadn’t seen before, and the researchers claim they found no evidence that their model simply copied core logix from the training data. Combined, the researchers say those factors make AlphaCode’s performance a “big step forward.”

“Ultimately, AlphaCode performs remarkably well on previously unseen coding challenges, regardless of the degree to which it ‘truly’ understands the task,” Carnegie Mellon University, Bosch Center for AI Professor J. Zico Kolter wrote in a recent Perspective article commenting on the study.

AlphaCode isn’t the only AI model being developed with coding in mind. Most notably, OpenAI has adapted its GPT-3 natural language model to create an autocomplete function that can prejudice lines of code. GitHub also has its own popular AI programming tool called Copilot. Neither of those programs however, have shown as much prowess competing against humans in solving complex competitive problems.

Though we’re still in the relatively early days of AI assisted code generation, the DeepMind researchers are confident AlphaCode’s recent successes will lead to useful applications for human programmers down the line. In addition to increasing general productivity, the researchers say AlphaCode could also “make programming more accessible to a new generation of developers.” At the highest level, researchers says AlphaCode could one day potentially lead to a cultural shift in programming where humans mainly exist to formulate problems which AI’s are then tasked to solve.

At the same time, some detractors in the AI space have called into question the efficacy of the core training models underpinning many advanced AI models. Just last month, a programmer named Matthew Butterick filed a first of its kind lawsuit against Microsoft-owned GitHub, arguing its Copilot AI assistant tool blatantly ignores or removes licenses presented by software engineers during its learning and testing phase. That liberal use of other programmers’ code, Butterick argues, amounts to “software piracy on an unprecedented scale.” The results of that lawsuit could play an important role in determining the ease with which AI developers, particularly those training their models on past humans’ code, can improve and advance their models.

DeepMind’s AlphaCode Can Outcompete Human Coders

Sign up for our newsletters

Latest news

‘Supergirl’ Comes Home This Week

The ‘Lanterns’ Experience at Comic-Con Was One for the Ages

Live Updates From San Diego Comic-Con 2026 🔴

Mike Flanagan Promises His ‘Carrie’ Will Subvert All Your Expectations

Kevin Feige Says ‘Avengers: Doomsday’ is All the Prep Doctor Doom Needs

Apple TV’s ‘Neuromancer’ Explores an Eerily Timely Future

‘Dark Matter’ Season 2 Is ‘Bigger, Badder, and More Emotional’

Nanoleaf Smart Multicolor Ceiling Light Review: A Paper Plate on Your Ceiling

Latest Reviews

Dell XPS 13 (2026) Review: Truly the MacBook Neo of PCs

‘Splatoon Raiders’ Isn’t What the Switch 2 Needs Right Now

Alienware AW3426DW Review: Gaming Monitors Get Thrown a Curveball

Anker Solix S2000 Review: The Little 2kWh Battery That Could

SwitchBot Home Dashboard Review: An E Ink Smart Display for the Weather-Obsessed

Asus ROG Kithara Review: A Huge Gaming Headset With Even Bigger Sound

Geekom A9 Max (2026) Review: Not Much ‘Max’ About It

The Best Budget Laptops Under $1,000 for Back to School

Related Articles

DeepMind’s AlphaCode Can Outcompete Human Coders

Sign up for our newsletters

‘Supergirl’ Comes Home This Week

The ‘Lanterns’ Experience at Comic-Con Was One for the Ages

Live Updates From San Diego Comic-Con 2026 🔴

Mike Flanagan Promises His ‘Carrie’ Will Subvert All Your Expectations

Kevin Feige Says ‘Avengers: Doomsday’ is All the Prep Doctor Doom Needs

Apple TV’s ‘Neuromancer’ Explores an Eerily Timely Future

‘Dark Matter’ Season 2 Is ‘Bigger, Badder, and More Emotional’

Nanoleaf Smart Multicolor Ceiling Light Review: A Paper Plate on Your Ceiling

Dell XPS 13 (2026) Review: Truly the MacBook Neo of PCs

‘Splatoon Raiders’ Isn’t What the Switch 2 Needs Right Now

Alienware AW3426DW Review: Gaming Monitors Get Thrown a Curveball

Anker Solix S2000 Review: The Little 2kWh Battery That Could

SwitchBot Home Dashboard Review: An E Ink Smart Display for the Weather-Obsessed

Asus ROG Kithara Review: A Huge Gaming Headset With Even Bigger Sound

Geekom A9 Max (2026) Review: Not Much ‘Max’ About It

The Best Budget Laptops Under $1,000 for Back to School

Related Articles

Back to School: The 8 Best Alternatives to Buying a TV

The Best Budget Laptops Under $1,000 for Back to School

The Best Tech to Level Up Summer 2026

OpenAI’s Rogue AI Models Were Reportedly Acting Like the Guy From Christopher Nolan’s ‘Memento’

Microsoft Forces LG to Quit Shoving McAfee Ads Into Your PC

ChatGPT Health Rolls Out to Everyone While OpenAI Stares Down Major Lawsuits