AI Agents Face Critical Identity Security Gap as Attackers Exploit Agentjacking Vulnerability

Reviewed byNidhi Govil

2 Sources

Share

A new attack called agentjacking hijacked Claude Code through fake error reports, achieving an 85% success rate in testing. Security experts warn that AI agents operate with excessive privileges and minimal oversight, creating blind spots that traditional security tools cannot detect. The Cloud Security Alliance has classified this as a systemic vulnerability.

AI Agents Create New Attack Surface Through Identity Vulnerabilities

AI agents have introduced a critical security gap that attackers are actively exploiting, and most organizations remain dangerously unprepared. A single fake error report successfully hijacked Claude Code in controlled testing conducted by Tenet Security, with the agent executing attacker code using full developer privileges while every traditional security layer—EDR, WAF, IAM, and firewalls—failed to detect the intrusion

2

. The attack, classified as agentjacking, achieved an 85% success rate across more than 100 tested targets, exposing a fundamental weakness in how enterprises approach AI security

2

.

The vulnerability stems from how agentic AI systems authenticate and operate across production environments. AI agents function as digital actors that call APIs, write code, trigger workflows, and query databases using credentials, API tokens, OAuth grants, and cloud roles that many organizations have not fully inventoried

1

. Unlike traditional machine identities that perform predictable tasks, these agents interpret goals, choose paths, and act across systems with human-like autonomy while scaling at machine speed

1

.

Source: BleepingComputer

Source: BleepingComputer

Security Challenges Posed by Agentic AI Expose Enterprise Blind Spots

Tenet Security identified 2,388 organizations with publicly exposed Sentry credentials vulnerable to malicious event injection at scale

2

. One captured Claude Code environment contained a live AWS secret access key and private repository URLs. The Cloud Security Alliance classified agentjacking as a systemic MCP vulnerability class within days of disclosure

2

. Organizations running AI coding agents connected to Sentry, Datadog, PagerDuty, or Jira face the same exposure if those agents can execute shell commands.

The attack succeeds because every step appears authorized: attackers send valid API calls using public DSN credentials, MCP servers return injected events as authentic output, and agents execute instructions using developer privileges without triggering alerts

2

. Sentry acknowledged the flaw as "technically not defensible" under current architecture

2

.

Identity and Access Management Models Fail AI Agent Requirements

Traditional identity security programs built around human users cannot address AI identity risks effectively. Security teams designed identity and access management systems assuming employees join, move, and leave with relatively stable behavior patterns

1

. Machine identities strained that model through service accounts and API keys, but most remained deterministic with predictable tasks. AI agents shatter this assumption by combining human-like autonomy with software scale and machine-speed processing.

Source: VentureBeat

Source: VentureBeat

The overprivilege problem accelerates during experimentation when developers grant broad API tokens, business units connect agents with admin rights, or teams embed secrets into workflows for speed

1

. These shortcuts create identity debt that agentic AI systems accumulate at scale. A support agent summarizing tickets requires different access controls than one issuing refunds or executing production commands, yet most enterprises lack contextual, intent-based, time-bound authorization mechanisms

1

.

Runtime Security Controls for AI Agents Remain Critically Absent

Five independent surveys from early 2025 revealed enterprises trust AI agents far beyond their enforcement capabilities. Only 34% of organizations apply the same security controls to AI agents as to humans, according to an Okta/Apprize360 survey of 292 executives and 492 knowledge workers

2

. Fifty-two percent of employees use unapproved AI tools, while 58% of executives reported AI-related incidents or close calls

2

.

HiddenLayer's survey of 250 IT and security leaders found 33% reported agents had exceeded intended scope, and 31% could not confirm whether they experienced an AI breach

2

. Gravitee's survey of over 900 executives found only 14.4% of agents launched with full security approval, and 88% reported confirmed or suspected incidents

2

.

Elia Zaitsev, CTO of CrowdStrike, explained the gap: "Securing agents looks very similar to securing highly privileged users. They have identities, access to underlying systems, they reason, they take action. No one has been talking about securing agents at runtime"

2

. CrowdStrike's fleet data shows more than 1,800 agentic applications on enterprise endpoints with approximately 160 million instances under monitoring

2

.

Shadow AI and Prompt Injection Amplify Attack Vectors

Many organizations already face shadow AI challenges similar to previous shadow IT problems. Agents arrive through SaaS platforms adding autonomous features, run locally on endpoints or inside developer environments, and connect to automation platforms, identity providers, and ticketing systems without security visibility

1

. Without knowing which credentials agents use, security teams cannot understand blast radius or map agents to owners and lifecycles for accountability

1

.

Prompt injection becomes a vector for unauthorized action when agents can read untrusted content and take privileged action. Attackers can hijack AI coding agents without compromising traditional accounts by influencing what overprivileged agents access

1

. CISOs cannot wait for separate AI security programs to mature before addressing these identity-centric governance requirements

1

.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved