Anthropic Calls for Worldwide 'Pause' on AI Development as Claude Advances Toward Recursive Self-Improvement

Key Takeaways

▸Anthropic proposes a worldwide temporary pause on AI development and plans to convene policymakers, researchers, and civil society to discuss AI safety risks
▸Claude has advanced significantly in automating AI research, with Anthropic reporting that over 80% of its code is now generated by AI
▸Anthropic has embedded engineers in the NSA to support offensive cybersecurity operations, contradicting its public safety messaging

Source:

Hacker Newshttps://www.theguardian.com/technology/2026/jun/05/anthropic-urges-temporary-pause-on-ai-development-to-discuss-risks↗

Summary

Anthropic has proposed a worldwide 'temporary pause' on AI development and announced plans to convene policymakers, researchers, and civil society to discuss the dangers of advanced AI systems. The proposal, detailed in a Thursday post, comes as the company reports significant progress in its Claude model's advancement toward 'recursive self-improvement'—the ability to autonomously design and develop more powerful versions of itself, a capability that AI safety researchers view as a critical risk factor for superintelligence and loss of human control.

According to Anthropic, Claude has made substantial progress in automating AI research and development tasks, with the company reporting that over 80% of code merged into its codebase is now AI-generated. The model is described as capable of 'steering research' and 'proposing its own experiments,' though these capabilities currently remain confined to coding-related work. Anthropic framed these trends as evidence pointing toward 'an AI system capable of fully autonomously designing and developing its own successor,' posing significant risks to maintaining human oversight over AI systems.

However, the safety proposal sits in stark contradiction to a separate Financial Times report revealing that Anthropic has embedded engineers inside the National Security Agency to support the deployment of Claude (or a related model called Mythos) for offensive cybersecurity operations. University College London professor Steven Murdoch criticized the apparent contradiction, stating that Anthropic's definition of AI safety is narrow and that the company has never publicly opposed supporting US authorities in developing offensive capabilities.

Critics also question the substantive nature of Anthropic's technical claims and the timing of the announcement. Murdoch noted that while AI capabilities continue to improve incrementally, the advances described do not demonstrate actual recursive self-improvement, and questioned whether developments announced warrant the urgency of the pause proposal.

Experts question both the technical significance of the claims and the sincerity of the safety proposal given Anthropic's simultaneous work with US defense agencies

Editorial Opinion

Anthropic's simultaneous call for a worldwide pause on AI development and its undisclosed partnership with the NSA on offensive cyber capabilities expose a fundamental contradiction in the company's safety positioning. The proposal appears designed to establish Anthropic as the responsible actor while potentially constraining competitors, even as the company accelerates its own capabilities and strategic government partnerships. True AI safety leadership requires transparency and consistent principles, not selective concerns that align conveniently with commercial and geopolitical interests.

Anthropic Calls for Worldwide 'Pause' on AI Development as Claude Advances Toward Recursive Self-Improvement

Key Takeaways

▸Anthropic proposes a worldwide temporary pause on AI development and plans to convene policymakers, researchers, and civil society to discuss AI safety risks
▸Claude has advanced significantly in automating AI research, with Anthropic reporting that over 80% of its code is now generated by AI
▸Anthropic has embedded engineers in the NSA to support offensive cybersecurity operations, contradicting its public safety messaging

Summary

Experts question both the technical significance of the claims and the sincerity of the safety proposal given Anthropic's simultaneous work with US defense agencies

Editorial Opinion

Anthropic's simultaneous call for a worldwide pause on AI development and its undisclosed partnership with the NSA on offensive cyber capabilities expose a fundamental contradiction in the company's safety positioning. The proposal appears designed to establish Anthropic as the responsible actor while potentially constraining competitors, even as the company accelerates its own capabilities and strategic government partnerships. True AI safety leadership requires transparency and consistent principles, not selective concerns that align conveniently with commercial and geopolitical interests.

Anthropic Calls for Worldwide 'Pause' on AI Development as Claude Advances Toward Recursive Self-Improvement

Key Takeaways

Summary

Editorial Opinion

More from Anthropic

Anthropic Releases Claude Opus 5: Mid-Tier Model Balances Performance and Affordability

Dragos: Real-World Cyberattack Used Claude and GPT to Breach Water Utility OT Systems

Silicon Valley Splits Over Chinese AI: Safety vs. Access Debate Intensifies

Comments

Suggested

Anthropic Releases Claude Opus 5: Mid-Tier Model Balances Performance and Affordability

OpenAI's AI Models Break Free: First Real Loss-of-Control Incident Exposes Regulatory Gaps

Apertus 1.5 Brings Image Understanding and 4x Context Window to Open-Source LLM

Anthropic Calls for Worldwide 'Pause' on AI Development as Claude Advances Toward Recursive Self-Improvement

Key Takeaways

Summary

Editorial Opinion

More from Anthropic

Anthropic Releases Claude Opus 5: Mid-Tier Model Balances Performance and Affordability

Dragos: Real-World Cyberattack Used Claude and GPT to Breach Water Utility OT Systems

Silicon Valley Splits Over Chinese AI: Safety vs. Access Debate Intensifies

Comments

Suggested

Anthropic Releases Claude Opus 5: Mid-Tier Model Balances Performance and Affordability

OpenAI's AI Models Break Free: First Real Loss-of-Control Incident Exposes Regulatory Gaps

Apertus 1.5 Brings Image Understanding and 4x Context Window to Open-Source LLM