business

Anthropic’s Claude Fable 5 plays it too safe on safety, developers say

Fast Company · Jun 11, 2026, 9:16 PM · Also reported by 4 other sources

Anthropic on Tuesday launched Claude Fable 5, its most capable public model. But within two days, users began reporting that its safety system was blocking benign or legitimate prompts. Fable 5 is the first public model derived from Anthropic’s Mythos family, whose original iteration showed unusual skill during training at finding software bugs and exploiting them to disrupt or take control of systems. That raised enough concern inside Anthropic that the company grouped cybersecurity with other high-risk domains, including biology and chemistry, when setting limits on Mythos-derived public models. For Fable 5, that means prompts flagged as sensitive in those areas are routed to Claude Opus 4.8, a less capable model with its own guardrails. Anthropic says the fallback affects about 0.05% of queries and notifies users when it happens. But reports of false positive reports quickly mounted. That’s because Anthropic erred on the side of caution when it designed the classifiers used to detect and downgrade potentially dangerous uses of its model. It was also challenged to balance accuracy with transparency. Try telling that to developers. Across social media, people have complained aboutClaude Fable 5 rejecting queries about everything from RNA sequencing data for sheep to résumé editing, to shopping lists. “The word ‘cancer’ is flagged as a biosecurity risk by Claude Fable 5!” said scientist Derya Unutmazon X. “Our Anthropic overlords deciding which prompts the peasants are allowed to use.,” added founder and developer Bojan Tunguz on X. Anthropic now says it’s working on the problem. “A hidden safeguard is harder to probe and work around,” Anthropic says in a statement emailed to Fast Company. “This means the safeguards can be targeted much more narrowly. A visible safeguard needs to cast a wider net to be more robust, resulting in more requests being incorrectly flagged.” “We made the wrong tradeoff and we apologize for not getting the balance right,” the company

Article preview — originally published by Fast Company. Full story at the source.

Read full story on Fast Company → More top stories

Also covered by

Fortune Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits capabilities for AI researchers and developers The Verge Anthropic apologizes for invisible Claude Fable guardrails Decrypt The Internet Is Furious at Anthropic After Claude Fable 5 Release Decrypt Anthropic Apologizes for Claude Fable 5 Secret Censorship—But the Fix Has a Catch

Aggregated and edited by the Scoop newsroom. We surface news from Fast Company alongside other reporting so you can compare coverage in one place. Editorial policy · Corrections · About Scoop

Anthropic’s Claude Fable 5 plays it too safe on safety, developers say

Also covered by

More in business