Anthropic Offers Mythos Upgrade for Cyber Partners and a ‘Safe’ Version for the Rest of You
Key takeaways
- These requests will instead be rerouted to an older AI model, Claude Opus 4.8.
- We just ended up feeling like this was the best product choice for users to get the maximum value out of Fable 5.”
- For now, Penn says that the protective mechanism is built to err on the side of caution, meaning some user queries may be routed to the less capable AI model even if they’re benign.
Why this matters: a development in AI with implications for how people work, create, and decide.
Anthropic CEO Dario Amodei.Photograph: Ludovic Marin/Getty Images Comment Loader Save Story Save this story Comment Loader Save Story Save this story Anthropic released two new AI models called Claude Fable 5 and Claude Mythos 5 on Tuesday, which the company says have greater capabilities than the Mythos Preview model it released in April to a limited set of tech industry partners. Anthropic has said the initial, limited release stemmed from concerns that the model’s capabilities could be exploited by bad actors to develop hacking tools that could catch defenders off guard.
Anthropic is currently only releasing Claude Mythos 5 to a limited set of industry partners, many of which received access to Mythos Preview, and the company says it is collaborating with the US government on the rollout.
Claude Fable 5, which is being publicly released, uses the same underlying model as Mythos 5, but will have “guardrails” in place at launch, the company said Tuesday, that will block the model from answering many user questions related to cybersecurity, biology, and chemistry. These requests will instead be rerouted to an older AI model, Claude Opus 4.8. If Anthropic suspects a user is trying to conduct distillation—training a smaller AI model off a larger AI model’s responses—on Claude Fable 5, those requests will also be rerouted to Claude Opus 4.8, the company says.