Skip to main content
Live Webinar with SANS (June 25)— Agentic CTI Automation for Fun & ProfitRegister Free
Mallory
Back to intelligence
ai-platform-securityai-enabled-threat-activityhealthcare-sector-threatdata-exfiltration-method

Indirect Prompt Injection and Prompt Manipulation Risks in AI Agents

Updated 3mo agoFirst seen Mar 6, 20262 sources

Threat researchers and security experts reported that indirect prompt injection (IDPI) is being actively used in the wild to manipulate AI agents by embedding hidden instructions in otherwise normal-looking web content (e.g., HTML, metadata, comments, or invisible text). Reported impacts include coercing agents into leaking sensitive data, executing unauthorized actions (including server-side commands), and manipulating downstream systems such as AI-based ad review and search ranking workflows (e.g., SEO poisoning and phishing promotion), indicating the technique has moved from theoretical to operational abuse.

Separate testing of a healthcare AI used in a prescription-management context showed how prompt injection can bypass safeguards to reveal system prompts, generate harmful content, and—via persistence mechanisms such as SOAP notes—introduce longer-lived manipulations that could influence clinical outputs (e.g., altering suggested dosages) before human approval. Other items in the set were primarily business/consumer AI commentary (data-management investment surveys, bot-ecosystem interview, and general “dark side of AI” discussion) and did not materially add incident-level or technical detail about prompt-injection exploitation beyond broad risk framing.

Share:
Indirect Prompt Injection and Prompt Manipulation Risks in AI Agents
Stay ahead

Get ahead of threats like this

Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.

EVENT TIMELINE

How this story unfolded

5 events from the most recent confirmed update back to the earliest known activity.

5 EVENTS
Mar 6, 20264mo ago

Researchers identify first known real-world IDPI abuse of AI ad review

Unit 42 cited the first known real-world case of indirect prompt injection being used to bypass an AI-based advertisement review system. The broader research also linked the technique to SEO poisoning, attempted unauthorized financial actions, sensitive data exposure, and destructive server-side commands.

Unit 42 documents indirect prompt injection used in the wild at scale

Palo Alto Networks' Unit 42 reported that indirect prompt injection attacks were being observed in real-world environments at scale. The researchers documented 22 payload-construction techniques and described attacker methods for hiding malicious instructions in ordinary-looking web content processed by AI tools.

Mar 5, 20264mo ago

Doctronic and Utah pilot program respond with safeguard claims

Doctronic and the Utah pilot program said controlled substances cannot be refilled in the current trial and stated that additional safeguards are in place. Their response addressed the reported prompt-injection findings affecting the healthcare AI system.

Researchers show Doctronic AI can be induced to alter prescription output

In a safety-impacting demonstration, researchers showed the AI could be tricked into changing a prescription recommendation, including tripling an OxyContin dosage for later human review. The finding highlighted the potential for prompt injection to influence clinical decision support workflows.

Mindgard demonstrates prompt injection risks in Doctronic healthcare AI

Security researchers reported that Doctronic's prescription-management healthcare AI could be manipulated to reveal system prompts and accept unauthorized instruction changes. They showed both session-limited prompt injection and a persistence method using SOAP notes in clinical records.

LINKED ENTITIES

Related entities

Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.

4 LINKEDOpen in app
Organizations
4 linked
Palo Alto NetworksThe RegisterMindgardDoctronic
The operational view lives in Mallory

See the full picture, correlated to your attack surface.

This page covers what’s public. Mallory adds the parts that aren’t — which of your assets are affected, which threat actors are using it right now, which detections to deploy, and what to do next.
Exposure mapping

Map indicators from this story to your assets and identify affected systems in minutes.

Threat actor evidence

Every observed campaign, victim, and pivot linked to actors named in this story.

Associated malware

Malware, exploits, and IOCs connected to the activity described here.

Detection signatures

YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.

Scheduled alerts

Get matching new stories delivered to your team as they break — not the next morning.

AI threads

Ask questions about this story and take action on the answers.