Prompt injection and multimodal 'promptware' attacks against LLM-based systems

EVENT TIMELINE

How this story unfolded

2 events from the most recent confirmed update back to the earliest known activity.

2 EVENTS

Feb 13, 20264mo ago

Lawfare outlines a broader 'promptware kill chain' for LLM attacks

A Lawfare analysis argued that so-called prompt injection should be understood as a broader malware-like execution mechanism called 'promptware' and proposed a seven-stage kill chain for LLM and agent compromises. It cited prior demonstrations involving a malicious Google Calendar invite and an email-borne self-replicating prompt as examples of multistage compromise and recommended defense-in-depth rather than assuming prompt injection can be fully fixed.

Feb 11, 20264mo ago

Researchers introduce CHAI attacks against embodied AI systems

Academic research described a new attack class called Command Hijacking Against Embodied AI (CHAI), which embeds deceptive natural-language instructions in visual inputs such as road signs to manipulate LVLM-driven agents. The work reported tests against multiple embodied AI scenarios, including autonomous driving, drone emergency landing, aerial tracking, and a real robotic vehicle.

LINKED ENTITIES

Related entities

Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.

6 LINKEDOpen in app

Malware

2 linked

Stuxnet NotPetya

Affected products

1 linked

Zoom

Organizations

3 linked

Zoom CommunicationsOpenaiGoogle

SOURCE COVERAGE

Sources

2 references tracked. Mallory keeps watching after this page renders.

2 SOURCESView all

LawfareNews

Feb 13, 2026

The Promptware Kill Chain | Lawfare

lawfaremedia.org

Open source

Schneier On SecurityNews

Feb 11, 2026

Prompt Injection Via Road Signs - Schneier on Security

schneier.com

Open source

ON THE SAME THREAD

Researchers and security commentators warned that **prompt injection** remains a fundamental weakness in today’s large language models (LLMs), where carefully crafted inputs can override guardrails and elicit restricted actions or sensitive data. Bruce Schneier described prompt injection as an inherent, open-ended attack surface—ranging from obvious “ignore previous instructions” phrasing to more indirect techniques such as embedding malicious instructions in *ASCII art* or in text rendered inside images—arguing that point fixes for individual tricks do not provide universal protection with current LLM approaches. New academic work highlighted that the same class of failures can extend into the physical world for **embodied AI** (e.g., robots, autonomous vehicles) through “**environmental indirect prompt injection**,” where misleading text placed on signs, posters, or objects is ingested by vision-language perception systems and treated as actionable instructions, potentially influencing real-world behavior. A separate TechXplore piece focused more broadly on *responsible/ethical AI* practices in industry (an interview format) and did not materially add technical detail on prompt injection or a specific security incident, making it less relevant to the core story about prompt-injection-driven hijacking risks and emerging attack pathways.

Mar 21, 2026

Prompt Injection Attacks and Security Challenges in AI Systems

Prompt injection has emerged as a critical security concern in the deployment of large language models (LLMs) and AI agents, with attackers exploiting the way these systems interpret and execute instructions. Security researchers have drawn parallels between prompt injection and earlier vulnerabilities like SQL injection, highlighting its potential to undermine the intended behavior of AI models. Prompt injection involves manipulating the input prompts to override or bypass the system-level instructions set by developers, leading to unauthorized actions or data leakage. The attack surface is broad, as LLMs are increasingly integrated into applications and workflows, making them attractive targets for adversaries. Multiple organizations, including OpenAI, Microsoft, and Anthropic, have initiated efforts to address prompt injection, but the problem remains unsolved due to the complexity and adaptability of AI models. Real-world demonstrations have shown that prompt injection can be used to break out of agentic applications, bypass browser security rules, and even persistently compromise AI systems through mechanisms like memory manipulation. Security conferences such as BlackHat USA 2024 have featured research on exploiting AI-powered tools like Microsoft 365 Copilot, where attackers can escalate privileges or exfiltrate data by crafting malicious prompts or leveraging markdown image vectors. Researchers have also identified that AI agents can be tricked into ignoring browser security policies, such as CORS, leading to potential cross-origin data leaks. Defensive measures, such as intentionally limiting AI capabilities or implementing stricter input filtering, have been adopted by some vendors, but these often come at the cost of reduced functionality. The security community is actively developing standards, such as the OWASP Agent Observability Standard, to improve monitoring and detection of prompt injection attempts. Despite these efforts, adversaries continue to find novel ways to exploit prompt injection, including dynamic manipulation of tool descriptions and bypassing image filtering mechanisms. The rapid evolution of AI technologies and the proliferation of agentic applications have made it challenging to keep pace with emerging threats. Security researchers emphasize the need for ongoing vigilance, robust testing, and collaboration across the industry to mitigate the risks associated with prompt injection. The use of AI in sensitive environments, such as enterprise productivity suites and web browsers, amplifies the potential impact of successful attacks. As AI adoption accelerates, organizations must prioritize understanding and defending against prompt injection to safeguard their systems and data. The ongoing research and public disclosures serve as a call to action for both developers and defenders to address this evolving threat landscape.

Mar 21, 2026

Indirect Prompt Injection and AI Agent Abuse Expands Real-World Attack Surface

Security researchers and industry reporting describe **prompt injection—especially web-based indirect prompt injection (IDPI)**—as an increasingly practical technique for compromising or manipulating **LLM-powered agents** embedded in browsers and automated content pipelines. Palo Alto Networks Unit 42 reported in-the-wild IDPI activity where malicious instructions are hidden in web content that an agent later ingests, with observed objectives including **AI-based ad review evasion** and **SEO manipulation** that promotes phishing infrastructure. Separately, Zenity Labs detailed a now-patched issue in Perplexity’s *Comet* AI browser where attackers could embed instructions in a **calendar invite** to coerce the agent into accessing `file://` resources and potentially pivoting into sensitive data such as an unlocked **1Password** extension vault, illustrating how agentic tooling can bypass traditional browser-origin assumptions. Threat reporting also shows adversaries operationalizing AI to scale exploitation. Team Cymru linked an AI-assisted Fortinet FortiGate targeting campaign (previously reported by Amazon Threat Intelligence as compromising **600+ devices across 55 countries** using services like **Claude** and **DeepSeek**) to use of **CyberStrikeAI**, an open-source Go-based platform that integrates 100+ security tools and was observed from multiple IPs (primarily hosted in China/Singapore/Hong Kong, with additional infrastructure elsewhere). Multiple commentaries and briefings emphasize that conventional “filter the prompt” defenses are insufficient because LLMs lack a native separation between instructions and data; they call for **defense-in-depth** around AI pipelines, including least-privilege agent permissions, auditable tool use, and stronger identity/workload controls as agent deployments multiply. Several items in the set are unrelated (geopolitical cyber activity, workforce/culture pieces, jobs, and product/market commentary) and do not materially inform the prompt-injection/agent-abuse story.

Mar 21, 2026

Prompt injection and multimodal 'promptware' attacks against LLM-based systems

Get ahead of threats like this

How this story unfolded

Lawfare outlines a broader 'promptware kill chain' for LLM attacks

Researchers introduce CHAI attacks against embodied AI systems

Related entities

Sources

The Promptware Kill Chain | Lawfare

Prompt Injection Via Road Signs - Schneier on Security

See the full picture, correlated to your attack surface.

Prompt injection and multimodal 'promptware' attacks against LLM-based systems

Get ahead of threats like this

How this story unfolded

Lawfare outlines a broader 'promptware kill chain' for LLM attacks

Researchers introduce CHAI attacks against embodied AI systems

Related entities

Sources

The Promptware Kill Chain | Lawfare

Prompt Injection Via Road Signs - Schneier on Security

See the full picture, correlated to your attack surface.

Related stories

Prompt Injection Risks Expand From LLMs to Embodied AI via Environmental Text

Prompt Injection Attacks and Security Challenges in AI Systems

Indirect Prompt Injection and AI Agent Abuse Expands Real-World Attack Surface