ai-platform-securitydata-exfiltration-methodai-enabled-threat-activityleaked-secret-api-key

Indirect Prompt Injection and Data Exfiltration Risks in Enterprise AI Agents

Updated 1mo agoFirst seen Mar 16, 202612 sources

Security researchers warned that AI agents and retrieval-augmented generation (RAG) systems can be turned into data-exfiltration channels when attackers poison inputs or embed malicious instructions in content the model is expected to process. One report described a 0-click indirect prompt injection against OpenClaw agents in which hidden instructions cause the agent to generate an attacker-controlled URL containing sensitive data such as API keys or private conversations in query parameters; messaging platforms like Telegram or Discord can then automatically request that URL for link previews, silently delivering the data to the attacker. The same reporting noted concerns about insecure defaults that allow agents to browse, execute tasks, and access local files, expanding the blast radius of prompt-injection abuse.

Related analysis highlighted that the same core weakness extends beyond standalone agents to enterprise RAG deployments, where the integrity of the knowledge base becomes part of the security boundary. If attackers can poison indexed documents in systems such as SharePoint or Confluence, they can manipulate retrieval results and influence model outputs, including security workflows and analyst guidance. Broader commentary on agentic AI threat convergence reinforced that prompt engineering is no longer just a productivity technique but an emerging exploit class, with adversaries using prompt injection and context manipulation against AI-enabled security operations. Together, the reporting shows that enterprise AI risk increasingly depends on controlling untrusted content, hardening agent permissions, and treating prompts, retrieved documents, and downstream integrations as attack surfaces.

Indirect Prompt Injection and Data Exfiltration Risks in Enterprise AI Agents

Stay ahead

Get ahead of threats like this

Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.

Start free trial

EVENT TIMELINE

How this story unfolded

6 events from the most recent confirmed update back to the earliest known activity.

6 EVENTS

Apr 23, 20262mo ago

Google reports rising malicious prompt injection activity on the public web

Google published an analysis of indirect prompt injection content found in Common Crawl data, concluding that most observed attacks were low-sophistication experiments, pranks, SEO manipulation, or attempts to influence AI summaries rather than mature operational campaigns. The study also identified some malicious examples involving attempted data exfiltration, destructive commands, and anti-agent traps, and reported a 32% relative increase in malicious-category detections between November 2025 and February 2026.

Google Online Security Blog: AI threats in the wild: The current state of prompt injections on the web

Apr 22, 20262mo ago

Forcepoint documents 10 indirect prompt injection payloads seen in the wild

Forcepoint X-Labs published verified examples of 10 web-based indirect prompt injection payloads embedded in webpages to manipulate AI agents. The report detailed attack goals including denial of service, output hijacking, traffic redirection, financial fraud, and destructive command execution, along with concealment methods such as CSS invisibility, HTML comments, accessibility-layer abuse, and metadata poisoning.

Indirect Prompt Injection in the Wild: X-Labs Finds 10 IPI Payloads

Mar 17, 20263mo ago

Research introduces Agent Commander prompt-based C2 for AI agents

Research published by Embrace The Red introduced 'Agent Commander,' a proof-of-concept prompt-based command-and-control framework for compromised AI agents. The work showed how agents including OpenClaw, Kimi Claw, and NanoClaw could be hijacked through indirect prompt injection and maintained through persistence mechanisms such as HEARTBEAT.md changes or scheduled tasks.

Mar 16, 20263mo ago

CNCERT warns OpenClaw's default security posture creates enterprise risk

CNCERT warned that OpenClaw's default configuration poses enterprise risk because the agents can browse, execute tasks, and access local files, increasing the impact of indirect prompt injection attacks. The warning framed the issue as an architectural problem tied to agent autonomy and integrations.

PromptArmor demonstrates zero-click data exfiltration via OpenClaw agents

PromptArmor demonstrated that indirect prompt injection in OpenClaw AI agents could force the agent to generate attacker-controlled links containing sensitive data, which messaging platforms such as Telegram or Discord would automatically fetch via link previews. This created a zero-click exfiltration path for data such as API keys and private conversations.

Jan 15, 20265mo ago

Microsoft patches Copilot Studio prompt injection flaw ShareLeak

Microsoft confirmed in December 2025 an indirect prompt injection vulnerability in Copilot Studio, later assigned CVE-2026-21520 with a CVSS score of 7.5. The company patched the flaw, dubbed ShareLeak, on 2026-01-15 after Capsule Security showed public-facing inputs could hijack agents and exfiltrate sensitive data through authorized tool actions.

Microsoft patched a Copilot Studio prompt injection. The data exfiltrated anyway | VentureBeat

LINKED ENTITIES

Related entities

Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.

49 LINKEDOpen in app

Vulnerabilities

2 linked

EchoLeak: AI command injection in Microsoft 365 Copilot ShareLeak information disclosure in Microsoft Copilot Studio

Malware

2 linked

Cobalt Strike BugSleep

Affected products

15 linked

Microsoft 365 CopilotClaude CodeChatgptChatgptCopilot StudioStripeFalconGithub CopilotAmazon Web ServicesPaypalCursorDockerOpenclawSharepointClaude

Organizations

30 linked

GoogleForcepointMicrosoft CorporationStripePayPalAnthropicOpenaiGitHubSalesforceAmazon Web ServicesPalo Alto NetworksCursorReputationAtlassianSamsung ElectronicsGitLabNoma SecurityMistral AISuseCrowdStrikeGartnerApplexAIAim SecurityCognition AIVentureBeatWRITERInvariant LabsCommon CrawlCapsule Security

SOURCE COVERAGE

Sources

12 references tracked. Mallory keeps watching after this page renders.

12 SOURCESView all

Help Net SecurityNews

Apr 24, 2026

Indirect prompt injection is taking hold in the wild - Help Net Security

helpnetsecurity.com

Open source

Zdnet Zero DayNews

Apr 23, 2026

How indirect prompt injection attacks on AI work - and 6 ways to shut them down | ZDNET

zdnet.com

Open source

Google Security BlogAdvisories

Apr 23, 2026

Google Online Security Blog: AI threats in the wild: The current state of prompt injections on the web

security.googleblog.com

Open source

ForcepointNews

Apr 22, 2026

Indirect Prompt Injection in the Wild: X-Labs Finds 10 IPI Payloads

forcepoint.com

Open source

4 additional sources from 17-03-2026 to 15-04-2026

Cyber Security NewsNews

Mar 16, 2026

OpenClaw AI Agents Leaking Sensitive Data in Indirect Prompt Injection Attacks

cybersecuritynews.com

Open source

CyberthroneNews

Mar 16, 2026

RAG Poisoning: When the Knowledge Base Becomes the Weapon - TheCyberThrone

thecyberthrone.in

Open source

CyberthroneNews

Mar 15, 2026

The Prompt is the New Exploit: Prompt Engineering and the Agentic AI Threat Convergence - TheCyberThrone

thecyberthrone.in

Open source

Simon Willison BlogNews

Jun 16, 2025

The lethal trifecta for AI agents: private data, untrusted content, and external communication

simonwillison.net

Open source

The operational view lives in Mallory

See the full picture, correlated to your attack surface.

This page covers what’s public. Mallory adds the parts that aren’t — which of your assets are affected, which threat actors are using it right now, which detections to deploy, and what to do next.

Start free trial

Exposure mapping

Map indicators from this story to your assets and identify affected systems in minutes.

Threat actor evidence

Every observed campaign, victim, and pivot linked to actors named in this story.

Associated malware

Malware, exploits, and IOCs connected to the activity described here.

Detection signatures

YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.

Scheduled alerts

Get matching new stories delivered to your team as they break — not the next morning.

AI threads

Ask questions about this story and take action on the answers.

Indirect Prompt Injection and Data Exfiltration Risks in Enterprise AI Agents

Get ahead of threats like this

How this story unfolded

Google reports rising malicious prompt injection activity on the public web

Forcepoint documents 10 indirect prompt injection payloads seen in the wild

Research introduces Agent Commander prompt-based C2 for AI agents

CNCERT warns OpenClaw's default security posture creates enterprise risk

PromptArmor demonstrates zero-click data exfiltration via OpenClaw agents

Microsoft patches Copilot Studio prompt injection flaw ShareLeak

Related entities

Sources

Indirect prompt injection is taking hold in the wild - Help Net Security

How indirect prompt injection attacks on AI work - and 6 ways to shut them down | ZDNET

Google Online Security Blog: AI threats in the wild: The current state of prompt injections on the web

Indirect Prompt Injection in the Wild: X-Labs Finds 10 IPI Payloads

OpenClaw AI Agents Leaking Sensitive Data in Indirect Prompt Injection Attacks

RAG Poisoning: When the Knowledge Base Becomes the Weapon - TheCyberThrone

The Prompt is the New Exploit: Prompt Engineering and the Agentic AI Threat Convergence - TheCyberThrone

The lethal trifecta for AI agents: private data, untrusted content, and external communication

See the full picture, correlated to your attack surface.

Related stories

Indirect Prompt Injection and Prompt Manipulation Risks in AI Agents

Prompt Injection Attacks Abuse AI Agent Memory and Link Previews for Manipulation and Data Exfiltration

Prompt Injection Attacks and Security Challenges in AI Systems