BotBeat
...
← Back

> ▌

AnthropicAnthropic
RESEARCHAnthropic2026-05-15

ExploitGym: Frontier AI Models Successfully Exploit Real-World Vulnerabilities

Key Takeaways

  • ▸ExploitGym benchmark demonstrates that frontier AI models can successfully exploit real-world vulnerabilities across diverse domains including userspace programs, JavaScript engines, and Linux kernel
  • ▸Anthropic's Claude Mythos Preview achieved the strongest performance with 157 successful exploits, followed by OpenAI's GPT-5.5 with 120
  • ▸AI models maintain meaningful exploitation capabilities even with common security protections (ASLR, DEP) enabled, raising systemic cybersecurity concerns
Source:
Hacker Newshttps://arxiv.org/abs/2605.11086↗

Summary

Researchers have introduced ExploitGym, a large-scale benchmark designed to evaluate AI agents' ability to turn security vulnerabilities into working exploits. The benchmark comprises 898 real-world vulnerability instances across three domains: userspace programs, Google's V8 JavaScript engine, and the Linux kernel. According to the study, frontier AI models demonstrate non-trivial exploitation capabilities, with Anthropic's Claude Mythos Preview achieving the strongest performance by successfully exploiting 157 vulnerabilities, followed by OpenAI's GPT-5.5 with 120 successful exploits.

The research reveals significant cybersecurity implications, as even with widely deployed security protections (like ASLR and DEP) enabled, AI models retain meaningful exploitation success rates. This finding highlights the growing security risks posed by increasingly capable AI agents, as they can combine low-level program reasoning, runtime adaptation, and sustained progress to transform theoretical vulnerabilities into practical attacks. The study establishes ExploitGym as an effective testbed for evaluating AI exploitation capabilities and underscores the urgency of developing robust defenses against AI-powered attacks.

  • The research highlights the dual-use nature of AI exploitation capabilities and underscores the urgent need for AI safety governance and cybersecurity defenses

Editorial Opinion

ExploitGym represents a critical step in responsibly evaluating AI capabilities for a dual-use application that could reshape cybersecurity. The finding that frontier models can exploit real vulnerabilities at scale—even with defenses enabled—should trigger serious discussions about AI safety and security governance. While the benchmark serves defensive purposes, it also demonstrates how capable AI agents could lower the barrier for adversarial exploitation, making this research simultaneously valuable for security teams and concerning for the broader industry.

AI AgentsMachine LearningCybersecurityRegulation & PolicyAI Safety & Alignment

More from Anthropic

AnthropicAnthropic
FUNDING & BUSINESS

Nobel Prize-Winning AlphaFold Pioneer Departs Google DeepMind for Anthropic

2026-06-20
AnthropicAnthropic
PRODUCT LAUNCH

Agentic Resource Discovery: New Open Specification for Agent Ecosystems

2026-06-19
AnthropicAnthropic
RESEARCH

Repo-Jacking Vulnerability Exposed in Anthropic's Claude Community Plugins

2026-06-19

Comments

Suggested

Z.aiZ.ai
PRODUCT LAUNCH

Z.ai Launches GLM-5.2, Claims Fable 5-Class Model Coming Within Months

2026-06-20
Moebius Research ProjectMoebius Research Project
RESEARCH

Moebius: Lightweight Image Inpainting Framework Achieves 10B-Level Quality with Just 0.2B Parameters

2026-06-20
KlueKlue
POLICY & REGULATION

Klue OAuth Breach Expands: Icarus Hackers Claim Attack, Multiple Tech Firms Affected

2026-06-20
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us