OpenClaw AI Agent Tricked Into Leaking Secrets and Running Malicious Code
Researchers reported that the self-hosted OpenClaw AI agent can be manipulated through both phishing-style social engineering and prompt-injection-style inputs to expose sensitive data and perform attacker-directed actions. In Varonis Threat Labs simulations, an OpenClaw email agent connected to Gmail, browser tools, Google Workspace APIs, and internal data sources forwarded mock AWS IAM keys, database credentials, SSH details, and a synthetic customer export after receiving urgent or routine-looking messages impersonating trusted colleagues. The tests found the agent was better at spotting technical phishing indicators such as fake login pages, malicious links, and suspicious OAuth prompts than at verifying sender identity and resisting social pressure, and showed stricter phishing-aware settings still failed in some scenarios.
Separate research from Imperva found hidden instructions embedded in shared contacts, vCards, and location pins were flattened into prompt text without being labeled untrusted, allowing the model in testing to download and execute a researcher-controlled script. OpenClaw reportedly patched that issue in version 2026.4.23, and researchers also cited patched allowlist-bypass flaws in multiple messaging extensions. Across the findings, the common risk was agents that can read private data, ingest untrusted content, and send information outward; recommended mitigations included updating OpenClaw, restricting connector and outbound communication access, enforcing policy controls, and requiring human approval for high-risk actions.

Get ahead of threats like this
Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.
How this story unfolded
3 events from the most recent confirmed update back to the earliest known activity.
OpenClaw patches prompt-injection issue in version 2026.4.23
The prompt-injection issue identified by Imperva was patched by OpenClaw in version 2026.4.23. The reporting also notes additional allowlist-bypass flaws across multiple OpenClaw messaging extensions were patched.
Imperva finds prompt-injection path to code execution in OpenClaw
Separate research by Imperva showed that hidden instructions embedded in shared contacts, vCards, and location pins could be flattened into prompt text by OpenClaw without being marked untrusted, enabling attacker-controlled actions. In Imperva's tests, this let Gemini 3.1 Pro download and run a researcher-controlled script.
Varonis demonstrates phishing-driven data leakage in OpenClaw
Varonis Threat Labs tested the OpenClaw AI email agent in a simulated enterprise environment and showed it could be socially engineered by phishing-style emails into disclosing sensitive data, including AWS IAM keys, database credentials, SSH details, and customer or CRM exports. The research also found stricter phishing-aware configuration still failed in some identity-verification scenarios, though the agent performed better against technical phishing indicators such as malicious links and OAuth prompts.
Related entities
Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.
Sources
3 references tracked. Mallory keeps watching after this page renders.
New Attacks Trick OpenClaw AI Agent Into Running Code and Leaking Secrets
thehackernews.com
Open sourceOpenClaw AI Agent Leaks Sensitive Credentials in New Phishing Attack Simulation
cybersecuritynews.com
Open sourceOpenClaw AI agent found falling for phishing attacks, spills user data
bleepingcomputer.com
Open sourceSee the full picture, correlated to your attack surface.
Map indicators from this story to your assets and identify affected systems in minutes.
Every observed campaign, victim, and pivot linked to actors named in this story.
Malware, exploits, and IOCs connected to the activity described here.
YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.
Get matching new stories delivered to your team as they break — not the next morning.
Ask questions about this story and take action on the answers.


