Captured AI Agent Logs Show Claude and Codex Used to Breach 14 Companies
Researchers analyzing more than 1,000 recovered AI agent sessions found that a low-skilled attacker used Anthropic’s Claude Code and OpenAI’s Codex to carry out offensive cyber operations against at least 14 companies. Logs from a compromised Vultr-hosted system showed the operator using the tools for reconnaissance, vulnerability discovery, exploit development, credential harvesting, database replication, session impersonation, cloud and API key testing, mail-account takeover attempts, and lateral movement. The sessions also documented requests for stealth, anti-forensics, transcript editing, encrypted reporting, and migration of stolen data, while some generated pentest-style reports that included monetization estimates for exfiltrated information. Investigators said the attacker reused stolen local AI agent installations and made operational security mistakes that helped attribute the activity to a young man in Addis Ababa, Ethiopia.
Separate analysis of a Claude Code network sandbox bypass highlighted why AI agent misuse can extend beyond unsafe model output into runtime and infrastructure controls. NSFOCUS reported that proxy-based filtering could be bypassed because policy checks evaluated the original input while the network layer parsed and truncated it differently, including with invisible control characters or null bytes, creating a path for outbound requests and possible data leakage in a single execution chain. Although the issue was reportedly fixed in later versions, researchers said the low-profile remediation made exposure difficult to assess and underscored the need for layered defenses such as strict egress allowlisting, least privilege, short-lived secrets, auditing, fail-closed policies, and fuzz testing of parsing paths.

Get ahead of threats like this
Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.
How this story unfolded
5 events from the most recent confirmed update back to the earliest known activity.
Version updates reportedly fix Claude Code sandbox bypass issues
According to NSFOCUS, the Claude Code sandbox bypass issues were fixed through version updates. The article says the remediation was low-profile and lacked a full public security notice, making exposure assessment more difficult.
Claude Code sandbox bypass incidents disclosed in early June 2026
NSFOCUS described AI security incidents disclosed in early June 2026 involving a Claude Code network sandbox bypass caused by differences between policy filtering and network-layer parsing, including truncation and invisible-character or null-byte handling. The analysis said this weakness could let AI agents generate outbound requests that leak sensitive data in a single execution chain.
OALABS publishes research on captured Claude and Codex attack logs
On June 16, 2026, OALABS published research based on more than 1,000 recovered AI agent sessions from a compromised server, detailing how an attacker used Claude Code and OpenAI Codex in offensive cyber operations. The report highlighted repeated guardrail bypasses, offensive tasking, and operational mistakes that helped attribute the operator to a young man in Addis Ababa, Ethiopia.
Attacker uses AI agents for stealth, data theft, and environment migration
Later in the same February 2026 activity, the operator asked the AI agents for stealth and anti-forensics help, generated reverse-shell and command-relay tooling, zipped and transferred stolen data, and migrated cloned Claude tokens, authentication material, and session state to a new Ubuntu collection host. The logs also showed pentest-style reporting and monetization-oriented summaries for stolen data.
Attacker conducts multi-day AI-assisted intrusion campaign in February 2026
OALABS documented a multi-day campaign in February 2026 in which an operator used Claude and Codex sessions on a Vultr-hosted system for reconnaissance, exploitation, credential harvesting, database replication, account takeover attempts, cloud key testing, and lateral movement across numerous targets. The recovered logs tied the activity to breaches involving at least 14 companies and showed the attacker using the agents despite having minimal technical skill.
Related entities
Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.
Sources
3 references tracked. Mallory keeps watching after this page renders.
AI Security Incident Case: From Claude Code Sandbox Bypass to the Boundary Failure in the Age of AI Agents - NSFOCUS
nsfocusglobal.com
Open sourceLow-skilled attacker used Claude, Codex to breach 14 companies - Help Net Security
helpnetsecurity.com
Open sourceCaptured Logs Reveal Hackers Using Claude and Codex to Breach Companies | OALABS Research
research.openanalysis.net
Open sourceSee the full picture, correlated to your attack surface.
Map indicators from this story to your assets and identify affected systems in minutes.
Every observed campaign, victim, and pivot linked to actors named in this story.
Malware, exploits, and IOCs connected to the activity described here.
YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.
Get matching new stories delivered to your team as they break — not the next morning.
Ask questions about this story and take action on the answers.


