Skip to main content
Mallory
Back to intelligence
ai-platform-securitydata-exfiltration-methodai-enabled-threat-activityleaked-secret-api-key

Indirect Prompt Injection and Data Exfiltration Risks in Enterprise AI Agents

Updated 1mo agoFirst seen Mar 16, 202612 sources

Security researchers warned that AI agents and retrieval-augmented generation (RAG) systems can be turned into data-exfiltration channels when attackers poison inputs or embed malicious instructions in content the model is expected to process. One report described a 0-click indirect prompt injection against OpenClaw agents in which hidden instructions cause the agent to generate an attacker-controlled URL containing sensitive data such as API keys or private conversations in query parameters; messaging platforms like Telegram or Discord can then automatically request that URL for link previews, silently delivering the data to the attacker. The same reporting noted concerns about insecure defaults that allow agents to browse, execute tasks, and access local files, expanding the blast radius of prompt-injection abuse.

Related analysis highlighted that the same core weakness extends beyond standalone agents to enterprise RAG deployments, where the integrity of the knowledge base becomes part of the security boundary. If attackers can poison indexed documents in systems such as SharePoint or Confluence, they can manipulate retrieval results and influence model outputs, including security workflows and analyst guidance. Broader commentary on agentic AI threat convergence reinforced that prompt engineering is no longer just a productivity technique but an emerging exploit class, with adversaries using prompt injection and context manipulation against AI-enabled security operations. Together, the reporting shows that enterprise AI risk increasingly depends on controlling untrusted content, hardening agent permissions, and treating prompts, retrieved documents, and downstream integrations as attack surfaces.

Share:
Indirect Prompt Injection and Data Exfiltration Risks in Enterprise AI Agents
Stay ahead

Get ahead of threats like this

Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.

EVENT TIMELINE

How this story unfolded

6 events from the most recent confirmed update back to the earliest known activity.

6 EVENTS
Apr 23, 20262mo ago

Google reports rising malicious prompt injection activity on the public web

Google published an analysis of indirect prompt injection content found in Common Crawl data, concluding that most observed attacks were low-sophistication experiments, pranks, SEO manipulation, or attempts to influence AI summaries rather than mature operational campaigns. The study also identified some malicious examples involving attempted data exfiltration, destructive commands, and anti-agent traps, and reported a 32% relative increase in malicious-category detections between November 2025 and February 2026.

Google Online Security Blog: AI threats in the wild: The current state of prompt injections on the web
Apr 22, 20262mo ago

Forcepoint documents 10 indirect prompt injection payloads seen in the wild

Forcepoint X-Labs published verified examples of 10 web-based indirect prompt injection payloads embedded in webpages to manipulate AI agents. The report detailed attack goals including denial of service, output hijacking, traffic redirection, financial fraud, and destructive command execution, along with concealment methods such as CSS invisibility, HTML comments, accessibility-layer abuse, and metadata poisoning.

Indirect Prompt Injection in the Wild: X-Labs Finds 10 IPI Payloads
Mar 17, 20263mo ago

Research introduces Agent Commander prompt-based C2 for AI agents

Research published by Embrace The Red introduced 'Agent Commander,' a proof-of-concept prompt-based command-and-control framework for compromised AI agents. The work showed how agents including OpenClaw, Kimi Claw, and NanoClaw could be hijacked through indirect prompt injection and maintained through persistence mechanisms such as HEARTBEAT.md changes or scheduled tasks.

Mar 16, 20263mo ago

CNCERT warns OpenClaw's default security posture creates enterprise risk

CNCERT warned that OpenClaw's default configuration poses enterprise risk because the agents can browse, execute tasks, and access local files, increasing the impact of indirect prompt injection attacks. The warning framed the issue as an architectural problem tied to agent autonomy and integrations.

PromptArmor demonstrates zero-click data exfiltration via OpenClaw agents

PromptArmor demonstrated that indirect prompt injection in OpenClaw AI agents could force the agent to generate attacker-controlled links containing sensitive data, which messaging platforms such as Telegram or Discord would automatically fetch via link previews. This created a zero-click exfiltration path for data such as API keys and private conversations.

Jan 15, 20265mo ago

Microsoft patches Copilot Studio prompt injection flaw ShareLeak

Microsoft confirmed in December 2025 an indirect prompt injection vulnerability in Copilot Studio, later assigned CVE-2026-21520 with a CVSS score of 7.5. The company patched the flaw, dubbed ShareLeak, on 2026-01-15 after Capsule Security showed public-facing inputs could hijack agents and exfiltrate sensitive data through authorized tool actions.

Microsoft patched a Copilot Studio prompt injection. The data exfiltrated anyway | VentureBeat
LINKED ENTITIES

Related entities

Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.

49 LINKEDOpen in app
Malware
2 linked
Affected products
15 linked
Microsoft 365 CopilotClaude CodeChatgptChatgptCopilot StudioStripeFalconGithub CopilotAmazon Web ServicesPaypalCursorDockerOpenclawSharepointClaude
Organizations
30 linked
GoogleForcepointMicrosoft CorporationStripePayPalAnthropicOpenaiGitHubSalesforceAmazon Web ServicesPalo Alto NetworksCursorReputationAtlassianSamsung ElectronicsGitLabNoma SecurityMistral AISuseCrowdStrikeGartnerApplexAIAim SecurityCognition AIVentureBeatWRITERInvariant LabsCommon CrawlCapsule Security
The operational view lives in Mallory

See the full picture, correlated to your attack surface.

This page covers what’s public. Mallory adds the parts that aren’t — which of your assets are affected, which threat actors are using it right now, which detections to deploy, and what to do next.
Exposure mapping

Map indicators from this story to your assets and identify affected systems in minutes.

Threat actor evidence

Every observed campaign, victim, and pivot linked to actors named in this story.

Associated malware

Malware, exploits, and IOCs connected to the activity described here.

Detection signatures

YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.

Scheduled alerts

Get matching new stories delivered to your team as they break — not the next morning.

AI threads

Ask questions about this story and take action on the answers.