Anthropic trained the latest version of its Claude chatbot to be more honest than its predecessors. And honestly? Not everyone’s into it.
Unveiled on Thursday, Claude Opus 4.8 was described by the company as being “more likely to flag uncertainties about its work and less likely to make unsupported claims.” Claude—like all chatbots—is still liable to hallucinate, but the idea behind its new “honesty” upgrade is to add more transparency to its responses, so that users know when and where they might get misled. Rather than bending the truth or generating an outright lie when it has incomplete information, the new model is supposed to be upfront and admit when it doesn’t know the answer to a user’s query.
Seems commendable enough. After all, AI companies have drawn heavy criticism in recent years for their models’ less-than-perfect adherence to the truth. Building chatbots that constantly tell users what they want to hear—even at the expense of accuracy—might boost engagement, but it can also have some pretty ugly downstream effects (spreading harmful conspiracy theories or feeding vulnerable users’ delusions, for example). Developers like Anthropic, OpenAI, and Google have therefore had to strike a delicate balance between training their chatbots to be friendly and engaging, but not so much that they start being manipulative.
Claude "I get what you want but no" pic.twitter.com/KJRcsqaGHr
— L i a m (@LiamCristiano) May 30, 2026
But in the earliest days following Opus 4.8’s release, some Claude users started complaining about the chatbot’s newfound conscientiousness.
The new model is “too honest,” one Redditor wrote in the r/ClaudeAI subreddit on Friday, “bro will NOT let anything slide…every single response now comes with a little asterisk. A little ‘just so you know.’ A little ‘I want to flag that.’ I miss when it was just wrong sometimes and didn’t tell me about it.”
Opus 4.8 is “like coked-up Claude where it’s really vehement and wordy, but is kind of hot nonsense,” another Redditor wrote in a separate thread over the weekend.
The consensus among critics generally seems to be that Claude’s communication style has become excessively honest, and that it’s wasting time tying itself in knots trying to find the most truthful path through every line of questioning, like an obsessive-compulsive who’s unable to leave his apartment because he’s too busy making sure that all the books on his shelves are perfectly aligned.
It doesn’t seem to be a majority view, however; other users have also praised Opus 4.8. “We should always be heading towards maximal truth,” one Redditor wrote on Friday.
Early criticism of the updated Claude doesn’t mean that hyper-honesty in chatbots is necessarily a bad thing. Rather, it’s a reminder that users are going to have their own particular preferences, and that developers are never going to be able to please everyone. Witness also the small contingent of ChatGPT users (like this one) who lamented when OpenAI made the chatbot less sycophantic.
Some people are going to prefer it when AI tells them what they want to hear, even if that means they’re being lied to, while others will only want to use chatbots that are trained to be as truthful as possible. What we’re likely to see, then, is a growing push among developers towards personalization: giving users the ability to customize the tone, tenor, and “personality” of chatbots’ conversational style. You might not be able to please everyone, but you can at least let them set their own boundaries.