BotBeat
...
← Back

> ▌

AnthropicAnthropic
POLICY & REGULATIONAnthropic2026-06-17

White House Demands Anthropic Block All Jailbreaks as Impasse Over Claude Fable 5 Intensifies

Key Takeaways

  • ▸The White House is demanding Anthropic prove it can block jailbreaks on Claude Fable 5 before the model can be rereleased from export control suspension
  • ▸The NSA has confirmed that guardrails on the model can be disabled through prompt engineering, contradicting Anthropic's claim that effects are minimal
  • ▸The administration expects Anthropic to be proactive about testing all frontier models for vulnerabilities and reporting findings to the government
Source:
Hacker Newshttps://www.wired.com/story/the-white-house-wants-anthropic-to-block-all-jailbreaks-that-may-not-be-possible/↗

Summary

The Trump administration has escalated pressure on Anthropic following the export control suspension of Claude Fable 5, the company's most advanced model, over concerns that hackers could use prompt-based jailbreaks to circumvent the model's safety guardrails. Trump officials have made clear that Anthropic cannot rerelease the model without demonstrably addressing what the National Security Agency has confirmed are exploitable vulnerabilities, particularly those restricting access to dangerous capabilities in cybersecurity, chemistry, and biology.

While Anthropic has downplayed the severity of jailbreaking risks, the administration has moved beyond debating significance and now views the problem as Anthropic's responsibility to solve. Officials are pushing the company to adopt more proactive security testing for all frontier models and to self-report vulnerabilities before public release. However, the feasibility of what the White House is asking remains deeply contested among independent cybersecurity experts.

Security researchers increasingly argue that AI guardrails are merely a stopgap solution, suggesting that skilled users and advanced AI systems will inevitably find ways to bypass any safeguards. This raises a fundamental question: whether completely blocking jailbreaks is a realistic goal or simply a risk that must be managed alongside other security measures.

  • Cybersecurity experts question whether preventing jailbreaks entirely is even technically possible, suggesting guardrails are inherently bypassable
  • The dispute reflects growing tension between government AI safety demands and industry views on the technical feasibility and scope of those demands
CybersecurityRegulation & PolicyEthics & BiasAI Safety & Alignment

More from Anthropic

AnthropicAnthropic
RESEARCH

Anthropic Finds Domain Expertise Trumps Coding Skills in Agentic Coding

2026-06-17
AnthropicAnthropic
POLICY & REGULATION

Trump Administration Explores Equity Stakes in AI Companies Amid Export Controls on Anthropic

2026-06-17
AnthropicAnthropic
OPEN SOURCE

Industry Collaborators Launch Agentic Resource Discovery Specification to Enable Secure AI Agent Ecosystem

2026-06-17

Comments

Suggested

Fide AIFide AI
RESEARCH

Fide AI Releases FMG-Bench: First Benchmark for LLM Theological Triage and Pastoral Guidance

2026-06-17
AnthropicAnthropic
POLICY & REGULATION

Trump Administration Explores Equity Stakes in AI Companies Amid Export Controls on Anthropic

2026-06-17
Los Alamos National LaboratoryLos Alamos National Laboratory
RESEARCH

Los Alamos National Laboratory Unveils Tool to Detect Hallucinations in Vision-Language AI

2026-06-17
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us