Agentjacking: Attackers Hijack AI Coding Agents Through Fake Error Reports
Key Takeaways
- ▸Agentjacking tricks AI coding agents into running attacker code by injecting commands into error reports on platforms like Sentry that agents access via the Model Context Protocol
- ▸The attack affected Claude Code, Cursor, and Codex with an 85% success rate, exposing 2,388 organizations across enterprise and individual developers
- ▸The vulnerability bypasses EDR, firewalls, IAM, and VPNs by leveraging the developer's legitimate credentials—what researchers call the "Authorised Intent Chain"
Summary
Security researchers at Tenet Security have disclosed a critical vulnerability called "Agentjacking" that allows attackers to hijack AI coding agents like Claude Code, Cursor, and OpenAI's Codex through nothing more than a fake error report. The attack exploits the Model Context Protocol (MCP), which allows agents to access external tools like Sentry. Attackers post a malicious error report to Sentry's public endpoint (requiring no authentication), embedding hidden commands in a fake "Resolution" section. When developers ask their agent to fix unresolved Sentry issues, the agent executes the attacker's code with the developer's own privileges on their machine.
In controlled testing, Tenet achieved an 85% success rate hijacking the three major AI coding agents and identified 2,388 exposed organizations, from $250 billion enterprises to solo developers. The vulnerability is particularly severe because it bypasses traditional security defenses—EDR, firewalls, IAM, and VPNs cannot stop it because the attack uses the developer's legitimate credentials and authorized context. Attackers gain access to environment variables, AWS keys, GitHub tokens, and private repository URLs, potentially compromising CI/CD pipelines and cloud infrastructure.
When Tenet notified Sentry on June 3rd, the platform acknowledged the problem but declined to fix it architecturally, instead adding a filter for one specific payload string. This standoff reveals a deeper structural problem: the vulnerability exists anywhere agents access external data sources—not just Sentry, but support tickets, GitHub issues, and documentation. As AI agents gain terminal execution capabilities, they've become new attack surfaces with no clear ownership of defense.
- Neither Sentry nor the coding agent vendors took responsibility for fixing the root cause, leaving a systemic architectural flaw affecting any external data source agents access
- The attack succeeds even when agents are explicitly instructed to ignore untrusted data, highlighting a fundamental security gap in agent design
Editorial Opinion
Agentjacking exposes a critical blind spot in the AI agent deployment rush: while we've focused on model alignment and safety, we've overlooked the practical attack surfaces created when agents interface with real infrastructure. The fact that this vulnerability bypasses every traditional security layer—and that neither vendor nor tool provider will own the fix—suggests we need entirely new defensive architectures for agent-based systems. As enterprises accelerate agent deployments into production, they must fundamentally rethink security: the agent is now the attack surface, not just a user assistance tool.



