If We Want Robots to Be Good, We May Need to Destroy Their Self-Confidence

We’ve all worried about artificial intelligence reaching a point in which its cognitive ability is so far beyond ours that it turns against us. But what if we just turned the AI into a spineless weenie that longs for our approval? Researchers are suggesting that could be a great step towards improving the algorithms, even if they aren’t out to murder us.

https://gizmodo.com/when-will-robots-deserve-human-rights-1794599063

In a new paper, a team of scientists has begun to explore the practical (and philosophical) question of how much self-confidence AI should have. Dylan Hadfield-Menell, a researcher at the University of California and one of the authors of the paper, tells New Scientist that Facebook’s newsfeed algorithm is a perfect example of machine confidence gone awry. The algorithm is good at serving up what it believes you’ll click on, but it’s so busy deciding if it can get your engagement, it doesn’t ask whether or not it should. Hadfield-Menell feels that the AI would be better at making choices and identifying fake news if it was programmed to seek out human oversight.

In order to put some data behind this idea, Hadfield-Menell’s team created a mathematical model they call the “off-switch game.” The premise is simple: a robot has an off switch and a task; a human can turn off the robot whenever they want, but the robot can override the human only if it believes it should. “Confidence” could mean a lot of things in AI. It could mean that the AI has been trained to assume its sensors are more reliable than a human’s perception, and if a situation is unsafe, the human should not be allowed to switch it off. It could mean, that the AI knows more about productivity goals and the human will be fired if this process isn’t completed—depending on the task, it will probably mean a ton of factors are being considered.

The study doesn’t come to any conclusions about “how much” confidence is too much—that’s really a case-by-case scenario. It does lay out some theoretical models in which the AI’s confidence is based on its perception of its own utility and its lack of confidence in human decision making.

The model allows us to see some hypothetical outcomes of what happens when an AI has too much or too little confidence. But more importantly, it’s putting a spotlight on this issue. Especially in these nascent days of artificial intelligence, our algorithms need all the human guidance they can get. A lot of that is being accomplished through machine learning and all of us acting as guinea pigs while we use our devices. But machine learning isn’t great for everything. For quite a while, the top search result on Google for the question, “Did the Holocaust happen?” was a link to the white supremacist website Stormfront. Google eventually conceded that its algorithm wasn’t showing the best judgment and fixed the problem.

Hadfield-Menell and his colleagues maintain that AI will need to be able to override humans in many situations. A child shouldn’t be allowed to override a self-driving car’s navigation systems. A future breathalyzer app should be able to stop you from sending that 3 AM tweet. There are no answers here, just more questions.

The team plans to continue working on the problem of AI confidence with larger datasets for the machine to make judgments about its own utility. For now, it’s a problem that we can still control. Unfortunately, the self-confidence of human innovators is untameable.

[Cornell University via New Scientist]

If We Want Robots to Be Good, We May Need to Destroy Their Self-Confidence

Sign up for our newsletters

Latest news

The Federal Agency Fighting Bed Bugs Keeps Getting Infested But its Workers Aren’t Allowed to Telecommute

The New ‘Marvel’s Wolverine’ Trailer Unleashes the Weapon

Andrew Yang Is Living the Presidential Life (Trying to Build a Mobile Phone Business)

Microsoft Targets Legal Fears to Sell Its Powerful New AI Model to Businesses

‘Spider-Man: Brand New Day’ Made Some Major Changes Thanks to ‘The Odyssey’

Fake GTA 6, real malware: the new scam targeting Windows and Android

Scientists May Have Found a Way to Detect a Third Type of Magnetism

Seven States Sue Trump for Cancelling New York Offshore Wind Farm

Latest Reviews

The Best Gadgets of May 2026

Sony 1000X The Collexion Review: Too Expensive for Anyone but Sony Superfans

Alienware Fixed the One Problem With the Area-51, and Now I’m Afraid I Love It

Soundcore Liberty 5 Pro Max Review: Wireless Earbuds With Enough Features to Make Your Head Spin

Anker Solix E10 Review: No Power? No Problem

Bose Lifestyle Ultra Speaker Review: Sonos Can Start Sweating Now

Bose Lifestyle Ultra Soundbar Review: A Boisterous Stab at Dominating Home Theater

Dell’s XPS 16 (2026) Is Almost Everything I Could Have Asked for… Almost

Related Articles

If We Want Robots to Be Good, We May Need to Destroy Their Self-Confidence

Sign up for our newsletters

The Federal Agency Fighting Bed Bugs Keeps Getting Infested But its Workers Aren’t Allowed to Telecommute

The New ‘Marvel’s Wolverine’ Trailer Unleashes the Weapon

Andrew Yang Is Living the Presidential Life (Trying to Build a Mobile Phone Business)

Microsoft Targets Legal Fears to Sell Its Powerful New AI Model to Businesses

‘Spider-Man: Brand New Day’ Made Some Major Changes Thanks to ‘The Odyssey’

Fake GTA 6, real malware: the new scam targeting Windows and Android

Scientists May Have Found a Way to Detect a Third Type of Magnetism

Seven States Sue Trump for Cancelling New York Offshore Wind Farm

The Best Gadgets of May 2026

Sony 1000X The Collexion Review: Too Expensive for Anyone but Sony Superfans

Alienware Fixed the One Problem With the Area-51, and Now I’m Afraid I Love It

Soundcore Liberty 5 Pro Max Review: Wireless Earbuds With Enough Features to Make Your Head Spin

Anker Solix E10 Review: No Power? No Problem

Bose Lifestyle Ultra Speaker Review: Sonos Can Start Sweating Now

Bose Lifestyle Ultra Soundbar Review: A Boisterous Stab at Dominating Home Theater

Dell’s XPS 16 (2026) Is Almost Everything I Could Have Asked for… Almost

Related Articles

Why Noninvasive Blood Glucose Monitoring Is Still the Holy Grail of Wearables

Roomba’s Creator Is Making an AI Robot Pet That Can’t Lie to You

We Met Disney’s Most Advanced Robot Yet: Olaf From ‘Frozen’

Researchers Made a Social Media Platform Where Every User Was AI. The Bots Ended Up at War

Alan Tudyk Wasn’t Part of ‘I, Robot’ Publicity for a Very Surprising Reason

Thailand Rings in New Year With Drone and CCTV-Powered Robot Cop