• Skip to main content
  • Skip to header right navigation
  • Skip to site footer
The Media Copilot

The Media Copilot

How AI is changing Media, journalism and content creation

  • News
  • Reviews
  • Guides
  • AI Courses
    • AI Quick Start
    • NEW—AI for Media
    • Custom AI Training for Teams
  • Newsletter
  • Podcast
  • Events
    • GEO Dinner Series
    • Webinars
  • About

Anthropic restores Fable 5 access, launches Sonnet 5, as Washington scrutiny deepens

The new models show improved cyber-evaluation results despite not being trained for security tasks, but red flags around the underlying safeguards remain a live legal and policy issue in Washington

Anthropic's Claude app icon next to the Department of War seal, symbolizing the company's ongoing dispute with the Pentagon
Anthropic's relationship with the Pentagon has been strained since a "supply chain risk" designation in early 2026. (Credit: NazeerArt - stock.adobe.com)
Jul 1, 2026

By Romy Abu-Fadel

Anthropic restored global access to Claude Fable 5 on Wednesday, one day after launching Claude Sonnet 5, closing out an 18-day export-control suspension that had cut off both of the company’s newest flagship models since mid-June. Anthropic also restored access to the more powerful Mythos 5 model for a set of U.S. organizations approved under its Glasswing cybersecurity program.

The back-to-back announcements come at a sensitive moment for the company. Increasingly more capable models, like OpenAI’s GPT-5.6 family, draw heightened scrutiny from the U.S. government over national security and cybersecurity concerns. 

The suspension began June 12, when the U.S. government applied export controls to Fable 5 and Mythos 5 after Amazon researchers reported a technique that allowed Fable 5 to identify software vulnerabilities, and, in one case, produce code demonstrating how to exploit one. 

Anthropic said it had found that several less-capable models, including Opus 4.8 and OpenAI’s GPT-5.5, could replicate the same behavior, and that the bypass didn’t expose any capability unique to Mythos 5. The company nonetheless spent the following weeks working with the government to fix it, training a new safety classifier that it says blocks the specific technique “in over 99% of cases.”

Anthropic acknowledged the new classifier could flag more benign coding and debugging requests as it errs toward caution. Newsrooms and media teams running Claude Code or agentic research workflows should expect occasional false-positive refusals on security-adjacent tasks going forward.

Fable 5 is rolling out today across Claude Platform, Claude.ai, Claude Code and Claude Cowork. Pro, Max, Team and select Enterprise plans can access it for up to half of their weekly usage limits through July 7. Then, access will shift to usage credits. 

Sonnet 5, which Anthropic launched late on June 30, is Anthropic’s middle ground between capability and safety. According to the company, the model performs close to the former top model, Opus 4.8, but at a cheaper price: $2 per million input tokens and $10 per million output tokens. The price will rise to $3 and $15, respectively, on Sept. 1.

Security concerns

But the same capabilities that make AI systems more useful can also make them more dangerous, raising questions about whether safety measures are keeping pace with rapidly advancing model performance.  

In its announcement, Anthropic sought to reassure users that Fable 5 has adequate security measures and Sonnet 5 poses a relatively low risk. The company said the Sonnet model is better than its predecessor at refusing malicious requests and resisting prompt injection attacks, a common technique used to manipulate AI systems into bypassing their safeguards. 

However, Anthropic says that while Sonnet 5 exhibited fewer undesirable behaviors overall than Sonnet 4.6, it also showed higher rates of misaligned behavior than both Opus 4.8 and Claude Mythos Preview, which have stricter safety controls. 

“Sonnet 5 was never able to develop a full working exploit, but it does show a slightly higher rate of partial success than Sonnet 4.6. This latter change is likely due to improvements in general intelligence rather than specific training,” Anthropic said in its press release. 

These results suggest that improvements in general reasoning and problem-solving abilities may also increase the model’s capacity to assist with offensive cyber activities, although the company emphasized that Sonnet 5 was unable to develop a complete exploit for Firefox vulnerabilities.

The findings reflect a broader concern among governments and security researchers: AI models do not necessarily need specialized cybertraining to become more useful to attackers. As reasoning and problem-solving abilities improve, models may naturally become more effective at identifying vulnerabilities, generating attack strategies and assisting with technical exploitation.

“Because we judged that the overall level of cybersecurity risk from Sonnet 5 was low, the safeguards are less strict than those launched with Fable 5, which block a much wider range of cybersecurity tasks,” Anthropic said. 

The company said Mythos 5, which is still restricted to a small set of companies and organizations in Glasswing, “can be used to find and exploit software vulnerabilities more effectively than any other model — and all but the most skilled human security experts.” 

Fable 5, however, launched with what Anthropic called the strongest safeguards it has ever applied to a model, after doubling its safety research staff in the month before launch. Researchers from the Commerce Department’s Center for AI Standards and Innovation tested both the original and updated safeguards and, Anthropic said, “agree that they are extraordinarily strong.”

U.S. government collaboration

With the lifting of the Mythos and Fable restrictions, Anthropic is further deepening its cooperation with the U.S. government. This marks an abrupt turnaround for the company, which has had a turbulent relationship with the Trump administration since the Pentagon labeled the company a supply chain risk in late February. The feud arose over Anthropic’s opposition to the use of its Claude models for mass domestic surveillance or fully autonomous weapons systems. 

Reuters reported that Commerce Secretary Howard Lutnick said in a letter sent to Anthropic that the company would work with the government on safety protocols for Mythos, Fable and future models, and to disclose any malicious activity it detects. However, Lutnick warned that the department “reserves the right to reevaluate the decisions made in this letter and the necessity of reimposing a license requirement, should circumstances change or should Anthropic fail to adhere to its commitments.”

“Our hope is that this collaboration … will serve as the basis for systematic rules for the whole industry,” Anthropic said in its Fable 5 release, “and even offer the beginnings of a template for effective global coordination on the risks and benefits of AI.”

The company is currently appealing the supply-chain risk designation in the D.C. Circuit Court of Appeals. It’s unclear how today’s announcement will affect the suit.

Contributors

  • Romy Abu-Fadel: Author

    Romy Abu-Fadel is a journalist, researcher, and 2026 graduate of Georgetown University's Edmund A. Walsh School of Foreign Service. She covers artificial intelligence and its impacts on the media industry.

  • Christopher Allbritton: Editor

    Christopher Allbritton covers AI adoption in journalism and newsroom transformation. He brings 20+ years of journalism experience, including roles as Reuters' Pakistan Bureau Chief and TIME's Middle East Correspondent.

Category: NewsTags:anthropic| Pentagon| restrictions
Share this post:
FacebookTweetLinkedInEmail

What do 1,000 journalists and PR pros know about AI that you don't? They took AI Quick Start, a 1-hour live class from The Media Copilot. 94% satisfaction. Find out how to work smarter with AI in just 60 minutes. Get 20% off with the code AIPRO: https://mediacopilot.ai/

  • Related articles

Trump administration allows limited GPT-5.6 release

Read moreTrump administration allows limited GPT-5.6 release
3D "AI" and Slack logo blocks connected by glowing energy strands

With Claude Tag, Anthropic has entered the Slack chat

Read moreWith Claude Tag, Anthropic has entered the Slack chat
Editorial illustration showing a glowing AI model behind a government barrier

The Fable 5 pullback turns AI availability into a planning problem

Read moreThe Fable 5 pullback turns AI availability into a planning problem
Executives in a boardroom viewing a "Global AI Spending: Runaway Costs" dashboard with a large dollar figure

Corporate America is starting to ration AI as costs skyrocket

Read moreCorporate America is starting to ration AI as costs skyrocket
Computer circuit board with a chip labeled "NSA"

NSA Using Anthropic’s Mythos Despite Pentagon Blacklist, Reports Say

Read moreNSA Using Anthropic’s Mythos Despite Pentagon Blacklist, Reports Say
A lobster claw on a restaurant table beside a receipt stamped "Pay Up!"

Anthropic to OpenClaw users: Pay up

Read moreAnthropic to OpenClaw users: Pay up

The Media Copilot

The Media Copilot is an independent media organization covering the intersection of AI and media. Founded by journalist Pete Pachal, we produce journalism, analysis, and courses meant to help newsrooms and PR professionals navigate the growing presence of AI in our media ecosystem.

  • LinkedIn
  • X
  • YouTube
  • Instagram
  • TikTok
  • Bluesky
  • About The Media Copilot
  • Advertising & Sponsorships
  • Our Methodology
  • Privacy Policy
  • Membership
  • Newsletter
  • Podcast
  • Contact

© 2026 · All Rights Reserved · Powered by Springwire.ai · RSS