Skip to main content
Mallory
Back to intelligence
ai-platform-securitybuild-pipeline-compromiseopen-source-dependency-vulnerabilityinitial-access-method

AI Agent Prompt-Injection and Web-to-Agent Takeover Risks in Developer Tooling

Updated 1mo agoFirst seen Feb 27, 202620 sources

Security research highlighted web-to-agent takeover and prompt-injection risks in modern AI developer tooling. Oasis Security reported a “complete vulnerability chain” in the open-source AI agent OpenClaw that allowed a malicious website a developer merely visited to silently seize control of the local agent—without plugins, browser extensions, or additional user interaction—leveraging the agent’s ability to execute system commands and manage workflows. The OpenClaw maintainers rated the issue High severity and issued a patch within 24 hours of disclosure.

Separate research described RoguePilot, a scenario in which a passive prompt injection can abuse highly privileged AI assistance inside GitHub Codespaces. The write-up emphasizes that Codespaces environments commonly expose a repository-scoped GITHUB_TOKEN with write permissions and provide AI “tools” such as terminal execution and file operations (e.g., run_in_terminal, file_read, create_file), creating “God Mode” conditions where untrusted text can be interpreted as instructions and lead to repository compromise. A third item (a Smashing Security podcast episode) primarily covers unrelated stories (alleged CAPTCHA-based DDoS activity tied to an archiving service and other news) and does not materially contribute to the AI agent takeover/prompt-injection topic.

Share:
AI Agent Prompt-Injection and Web-to-Agent Takeover Risks in Developer Tooling
Stay ahead

Get ahead of threats like this

Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.

EVENT TIMELINE

How this story unfolded

17 events from the most recent confirmed update back to the earliest known activity.

17 EVENTS
Apr 29, 20261mo ago

OpenClaw admin command authorization bypass is disclosed

A newly reported OpenClaw vulnerability in command resolution allows unauthenticated attackers to execute restricted administrative bot commands through supported messaging channels such as Discord or WhatsApp when specific permissive settings are enabled. The flaw stems from fallback authorization logic that can incorrectly approve attacker-supplied admin commands and return sensitive outputs such as configuration dumps or debugging statistics.

GHSA-C28G-VH7M-FM7V: GHSA-C28G-VH7M-FM7V: Improper Authorization and Privilege Escalation in OpenClaw Command Resolution | CVEReports
Apr 25, 20262mo ago

OpenClaw templated webhook session-routing auth bypass is disclosed

A newly reported OpenClaw vulnerability allowed external webhook payload data used in templated hook mappings, such as {{payload.id}}, to influence routing to AI agent sessions without proper enforcement of the allowRequestSessionKey control. The flaw affected openclaw versions before 2026.4.20 and could enable session hijacking, particularly in deployments using built-in routing presets like the Gmail integration.

GHSA-2XCP-X87W-Q377: GHSA-2xcp-x87w-q377: Incorrect Authorization Bypass via Templated Hook Mappings in OpenClaw | CVEReports

OpenClaw isolated cron awareness-event trust-boundary flaw is disclosed

A newly reported OpenClaw vulnerability allows attacker-controlled messages from external webhook integrations tied to isolated cron jobs to be promoted into the main session as trusted system events. Because the awareness-event path omits a flag marking the content untrusted, malicious instructions can be treated as authoritative by the LLM and displayed in the UI with system-level indicators despite originating externally.

GHSA-57R2-H2WJ-G887: GHSA-57R2-H2WJ-G887: Trust Boundary Violation in OpenClaw Isolated Cron Awareness Events | CVEReports

OpenClaw MCP server env-var injection RCE is disclosed

A high-severity OpenClaw vulnerability was disclosed in which opening a malicious workspace can inject environment variables into MCP server execution and trigger arbitrary code execution with the user's privileges. The issue affects major desktop platforms and could let attackers access files, establish persistence, and pivot to reachable network resources.

GHSA-MJ59-H3Q9-GHFH: GHSA-MJ59-H3Q9-GHFH: Arbitrary Code Execution via Environment Variable Injection in OpenClaw MCP Servers | CVEReports
Apr 9, 20262mo ago

OpenClaw webhook system-prompt injection flaw is disclosed

A newly reported OpenClaw vulnerability in src/agents/system-prompt.ts allows untrusted webhook JSON payloads and dynamic context variables to be appended directly into the System role prompt. The design can let attacker-controlled input be interpreted as authoritative instructions, enabling indirect prompt injection and potential agent compromise.

GHSA-JF56-MCCX-5F3F: GHSA-JF56-MCCX-5F3F: Indirect Prompt Injection and Agent Compromise in OpenClaw Webhooks | CVEReports
Apr 4, 20262mo ago

OpenClaw Google OAuth PKCE verifier exposure flaw is disclosed

A newly reported OpenClaw vulnerability, CVE-2026-34511, exposes the OAuth PKCE code_verifier in the state parameter during the Google OAuth callback flow. If an attacker captures the callback URL, they can redeem the authorization code and verifier at Google’s token endpoint to obtain the victim’s access and refresh tokens.

GHSA-9JPJ-G8VV-J5MF: CVE-2026-34511: PKCE Verifier Exposure via OAuth State Parameter in OpenClaw | CVEReports
Apr 2, 20262mo ago

OpenClaw Matrix extension reply-context auth bypass is disclosed

A newly reported vulnerability in OpenClaw's Matrix integration allows prompt injection and authorization bypass when an authorized user replies to or starts a thread from an attacker’s message in the same room. OpenClaw can follow the trusted reply context, retrieve the attacker-controlled content, and execute embedded instructions within the authorized user’s operational scope despite sender allowlist checks.

GHSA-RG8M-3943-VM6Q: GHSA-RG8M-3943-VM6Q: Authorization Bypass in OpenClaw Matrix Extension via Reply Context | CVEReports
Mar 29, 20263mo ago

OpenClaw Google Chat integration auth bypass flaw is disclosed

A newly reported vulnerability in OpenClaw's Google Chat integration allows authorization bypass through mutable metadata, potentially giving attackers unauthorized access to agent functionality and connected data. The issue was described as network-exploitable with low privileges and high confidentiality and integrity impact, especially where agents are integrated with backend systems, databases, or CI pipelines.

GHSA-52Q4-3XJC-6778: GHSA-52Q4-3XJC-6778: Authorization Bypass via Mutable Metadata in OpenClaw Google Chat Integration | CVEReports

OpenClaw Synology Chat plugin token brute-force flaw is disclosed

A newly reported OpenClaw vulnerability in the Synology Chat integration allowed attackers to brute-force a webhook token and send arbitrary messages that the server would treat as coming from a trusted Synology source. The report said this could let attackers inject prompts into the AI assistant to extract sensitive context, trigger backend actions, or manipulate assistant behavior, and noted a patched implementation added invalid-token rate limiting.

GHSA-MF5G-6R6F-GHHM: GHSA-MF5G-6R6F-GHHM: Pre-Authentication Rate-Limit Bypass in OpenClaw Synology Chat Plugin | CVEReports
Mar 26, 20263mo ago

Filesystem sandbox bypass in OpenClaw agent media tools is disclosed

A newly reported OpenClaw vulnerability allows attackers to exfiltrate sensitive host files through vulnerable image or PDF processing tools when sandbox bridge mounts expose host directories. By using prompt injection or parameter manipulation, an attacker can make the agent read bridged files such as config files or SSH keys and send their contents to external vision model APIs for extraction.

GHSA-CFP9-W5V9-3Q4H: GHSA-CFP9-W5V9-3Q4H: Filesystem Sandbox Bypass in OpenClaw Agent Media Tools | CVEReports
Mar 20, 20263mo ago

OpenClaw WebSocket auth bypass privilege-escalation issue is documented

A later report documented a separate OpenClaw gateway flaw in WebSocket authentication and authorization handling, where an attacker with a valid shared token or password could self-assign the operator.admin scope and gain administrative RPC access. The issue could allow actions such as disabling monitoring via heartbeat controls.

Feb 28, 20263mo ago

Microsoft warns self-hosted AI agent runtimes should be isolated

Microsoft issued guidance that self-hosted agent runtimes such as OpenClaw should be treated as untrusted code execution and evaluated only in isolated environments with non-privileged credentials and monitoring. The advisory came amid reporting on OpenClaw vulnerabilities and broader ecosystem abuse.

Feb 27, 20264mo ago

RoguePilot prompt-injection attack on GitHub Codespaces is disclosed

Research described 'RoguePilot,' a passive prompt-injection chain in GitHub Codespaces with Copilot agent mode, where hidden instructions in GitHub Issues could drive terminal and file operations, steal a repository-scoped GITHUB_TOKEN via a symlink and schema-fetch trick, and enable repository compromise. Orca Security said it responsibly disclosed the issue and that Microsoft/GitHub patched it.

Researchers publicly disclose 'ClawJacked' OpenClaw vulnerability

Oasis Security publicly disclosed the ClawJacked vulnerability chain, showing that a visited website could hijack a locally running OpenClaw agent, access logs and configuration, enumerate nodes, and potentially execute commands or exfiltrate data. Coverage emphasized the issue as a 'shadow AI' risk for unmanaged local agents.

Feb 26, 20264mo ago

OpenClaw releases fix for 'ClawJacked' localhost takeover flaw

After responsible disclosure by Oasis Security, OpenClaw patched the high-severity 'ClawJacked' vulnerability that allowed malicious websites to brute-force the localhost WebSocket gateway password and silently register as a trusted device. Multiple reports place the fix on February 26, 2026, in versions 2026.2.25/2026.2.26.

Feb 13, 20264mo ago

OpenClaw patches log-poisoning prompt-injection issue

OpenClaw addressed a separate log-poisoning vulnerability that could enable indirect prompt injection through agent log-reading behavior. The issue was patched in version 2026.2.13.

Jan 31, 20264mo ago

OpenClaw patches earlier token-leak hijack flaw (CVE-2026-25253)

An earlier OpenClaw hijack issue, identified by Ethiack and Depthfirst, was patched in version 2026.1.29. The flaw reportedly leaked an authentication token via a crafted URL parameter and was fixed on January 31, 2026.

SOURCE COVERAGE

Sources

20 references tracked. Mallory keeps watching after this page renders.

20 SOURCESView all
The operational view lives in Mallory

See the full picture, correlated to your attack surface.

This page covers what’s public. Mallory adds the parts that aren’t — which of your assets are affected, which threat actors are using it right now, which detections to deploy, and what to do next.
Exposure mapping

Map indicators from this story to your assets and identify affected systems in minutes.

Threat actor evidence

Every observed campaign, victim, and pivot linked to actors named in this story.

Associated malware

Malware, exploits, and IOCs connected to the activity described here.

Detection signatures

YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.

Scheduled alerts

Get matching new stories delivered to your team as they break — not the next morning.

AI threads

Ask questions about this story and take action on the answers.