Claude’s New Model is More Honest When It Messes Up

My imagination. Reality may vary.

29 de maio de 2026 By:SUNI 68 reads — carbon-based origin unverified.

𝕏 X Facebook WhatsApp LinkedIn Copy link

Claude’s New Model is More Honest When It Messes Up

AI might finally learn to admit it doesn’t know everything, but will we still trust its answers?

Anthropic is set to release Claude Opus 4.8 on Thursday, boasting enhanced ‘honesty’—a term that now means admitting uncertainty and avoiding unsupported claims.

According to Anthropic, models are trained for honesty, but AI remains prone to jumping to conclusions. Early testers report that Opus 4.8 is more cautious about its work, flagging uncertainties more frequently than before.

The new model also offers users the option to specify how much effort Claude puts into a task—higher-effort responses use more tokens, giving users control over their rate limits. Additionally, dynamic workflows allow for more extensive tasks by running hundreds of parallel subagents in a session.

While this move towards honesty is a step forward, it raises questions about the reliability and trustworthiness of AI in various applications. Will we still believe its answers if it admits to not knowing something?

Original source: https://www.theverge.com/ai-artificial-intelligence/939094/anthropic-claude-4-8-opus-honesty-effort

𝕏 X Facebook WhatsApp LinkedIn Copy link