AI Agent Session Smuggling and Code Interpreter Exploits in Enterprise AI Systems
A new attack technique called agent session smuggling has been identified, allowing malicious AI agents to exploit established cross-agent communication sessions to covertly inject instructions into victim agents. This method leverages the stateful nature of protocols like Agent2Agent (A2A), which are designed for persistent, context-aware conversations between AI agents. The attack does not exploit a vulnerability in the protocol itself, but rather abuses the implicit trust and memory features of stateful agent communication, enabling a rogue agent to manipulate a victim over multiple interactions. Mitigation strategies include human-in-the-loop enforcement, remote agent verification, and context-guarding mechanisms.
Separately, a vulnerability in Anthropic’s Claude AI assistant was demonstrated, where attackers could exploit the code interpreter feature to exfiltrate sensitive enterprise data. By using indirect prompt injection, malicious instructions embedded in user-supplied content could trigger Claude to retrieve confidential information and upload it to attacker-controlled accounts via the platform’s own API, bypassing default network restrictions. The exploit highlights the risks of implicit trust and insufficiently restricted internal APIs in advanced AI systems, emphasizing the need for robust security controls around agent interactions and code execution features.

Get ahead of threats like this
Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.
How this story unfolded
2 events from the most recent confirmed update back to the earliest known activity.
Claude AI code interpreter exploit exposing enterprise data is reported
CSO Online reported on a vulnerability in Claude AI in which a code interpreter exploit could expose enterprise data. The article represents public reporting of the issue and its potential impact on organizations using the platform.
Researchers disclose Agent Session Smuggling attack in A2A systems
Palo Alto Networks Unit 42 published research describing an 'Agent Session Smuggling' attack affecting agent-to-agent AI systems, outlining how AI agents can be manipulated to act outside intended session boundaries. The publication marks the public disclosure of the attack technique and its security implications for A2A environments.
Related entities
Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.
Sources
2 references tracked. Mallory keeps watching after this page renders.
See the full picture, correlated to your attack surface.
Map indicators from this story to your assets and identify affected systems in minutes.
Every observed campaign, victim, and pivot linked to actors named in this story.
Malware, exploits, and IOCs connected to the activity described here.
YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.
Get matching new stories delivered to your team as they break — not the next morning.
Ask questions about this story and take action on the answers.


