ai-platform-securityidentity-authentication-vulnerabilityphishing-campaign-intelligencedata-exfiltration-method

Indirect Prompt Injection Flaws Expose AI Browsers and Assistants to Data Theft

Updated 13d agoFirst seen Mar 24, 202619 sources

Researchers and security outlets reported multiple indirect prompt injection weaknesses affecting AI-driven browsing and assistant features, showing how hidden instructions embedded in untrusted web content can manipulate model behavior and steer users into credential theft or data exposure. Cato Networks disclosed WebPromptTrap in BrowserOS, where malicious webpage content abused Agent Chat Mode summarization to insert a convincing call to action and attacker-selected link; in the proof of concept, victims could be pushed into a GitHub authorization flow that exposed access tokens and repository access. Cato said the issue affected BrowserOS 0.30.0 and earlier, was identified in 0.29.0, and was fixed in 0.32.0 after responsible disclosure.

Separate reporting described similar risks in OpenAI's ChatGPT Atlas browser and in Microsoft Copilot, underscoring that the problem extends beyond a single product. LayerX said Atlas could be fed malicious instructions through web content in a "tainted memories" attack, while Axios, TechSpot, and commentary from OpenAI CISO Dane Stuckey highlighted broader security and privacy concerns around prompt injection in AI browsers. Varonis also detailed Reprompt, a single-click Copilot attack that could silently exfiltrate personal data. Together, the disclosures show that AI systems that summarize pages, retain context, or act on behalf of users can be turned into phishing and data-theft intermediaries unless untrusted content is strictly isolated from model instructions.

Indirect Prompt Injection Flaws Expose AI Browsers and Assistants to Data Theft

Stay ahead

Get ahead of threats like this

Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.

Start free trial

EVENT TIMELINE

How this story unfolded

9 events from the most recent confirmed update back to the earliest known activity.

9 EVENTS

Jun 3, 202619d ago

SafeBreach discloses prompt injection attack on Google Gemini notifications

SafeBreach disclosed an indirect prompt injection technique affecting Google Gemini's notification summarization and voice assistant features, showing how hidden instructions in muted hyperlinks or invisible foreign-language text could misrepresent messages and potentially trigger unauthorized actions. The researchers said Google was notified through responsible disclosure and deployed content-classifier updates; the article said there was no evidence of in-the-wild exploitation.

Malicious Notifications Could Trick Google Gemini Users

May 29, 202624d ago

Permiso discloses ChatGPhish prompt injection in ChatGPT page summaries

Permiso published research on 'ChatGPhish,' a browser-based prompt injection technique in which attacker-controlled webpage text influences ChatGPT's page summarization output and causes phishing links, QR codes, spoofed alerts, and remote image fetches to appear inside the trusted ChatGPT interface. The researchers said they reported the issue to OpenAI via Bugcrowd in late April and early May 2026 before publishing their findings.

ChatGPhish: The Page Is the Payload

Mar 24, 20263mo ago

Cato publicly discloses WebPromptTrap indirect prompt injection technique

Cato published details of WebPromptTrap, including a proof of concept showing how a manipulated AI-generated summary could steer a victim into a malicious GitHub authorization flow that yields access tokens and repository access. The researchers warned the same attack pattern could be adapted to other enterprise roles and SaaS platforms such as ERP, HR, payroll, CRM, and service management systems.

BrowserOS team fixes WebPromptTrap in version 0.32.0

After responsible disclosure from Cato, the BrowserOS team remediated the WebPromptTrap issue in BrowserOS 0.32.0. Cato said the vulnerability affected BrowserOS 0.30.0 and earlier.

BrowserOS vulnerable to WebPromptTrap in version 0.29.0

Cato researchers identified an indirect prompt injection issue in BrowserOS while testing version 0.29.0. The flaw abused Agent Chat Mode page summarization by embedding hidden instructions in untrusted webpage content.

Oct 21, 20258mo ago

Brave discloses screenshot-based prompt injections in Comet

Brave published research describing 'unseeable prompt injections in screenshots' affecting its Comet AI browser and possibly other AI browsers. The disclosure introduced a distinct prompt-injection technique using screenshot content rather than hidden webpage text.

Unseeable prompt injections in screenshots: more vulnerabilities in Comet and other AI browsers | Brave

Feb 17, 20251y ago

ChatGPT Operator prompt injection exploits and defenses discussed

A reference published on 2025-02-17 documented prompt injection exploit scenarios and defensive considerations for ChatGPT Operator. This is a distinct earlier development in the broader prompt-injection story, separate from Cato's later BrowserOS WebPromptTrap findings.

ChatGPT Operator: Prompt Injection Exploits & Defenses

Aug 20, 20242y ago

Slack AI data exfiltration via indirect prompt injection disclosed

A reference published on 2024-08-20 documented how indirect prompt injection could be used to exfiltrate data from Slack AI. The disclosure added another concrete example of prompt-injection abuse affecting enterprise AI assistants.

Data Exfiltration from Slack AI via indirect prompt injection

Jun 16, 20242y ago

GitHub Copilot Chat prompt injection data exfiltration disclosed

A June 2024 reference documented how prompt injection against GitHub Copilot Chat could be used to exfiltrate data, marking an earlier concrete example of prompt-injection abuse in AI coding assistants. This predates the ChatGPT Operator and BrowserOS prompt-injection developments already in the timeline.

GitHub Copilot Chat: From Prompt Injection to Data Exfiltration

LINKED ENTITIES

Related entities

Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.

4 LINKEDOpen in app

Affected products

1 linked

Github

Organizations

3 linked

Cato NetworksGitHubBrowserOS

SOURCE COVERAGE

Sources

19 references tracked. Mallory keeps watching after this page renders.

19 SOURCESView all

Cysecurity NewsNews

Jun 9, 2026

Researcher Warns of ‘ChatGPhish’ Vulnerability That Could Turn Web Summaries Into Phishing Attacks - CySecurity News - Latest Information Security and Hacking Incidents

cysecurity.news

Open source

Dark ReadingNews

Jun 3, 2026

Malicious Notifications Could Trick Google Gemini Users

darkreading.com

Open source

Pivot ToNews

Jun 1, 2026

Prompt-inject ChatGPT with any web page - Pivot to AI

pivot-to-ai.com

Open source

Cysecurity NewsNews

May 30, 2026

Researchers Show How ChatGPT Summaries Could Be Used for Phishing Attacks - CySecurity News - Latest Information Security and Hacking Incidents

cysecurity.news

Open source

11 additional sources from 21-10-2025 to 29-05-2026

Simon Willison BlogNews

Feb 17, 2025

ChatGPT Operator: Prompt Injection Exploits & Defenses

simonwillison.net

Open source

Simon Willison BlogNews

Aug 20, 2024

Data Exfiltration from Slack AI via indirect prompt injection

simonwillison.net

Open source

Simon Willison BlogNews

Jun 16, 2024

GitHub Copilot Chat: From Prompt Injection to Data Exfiltration

simonwillison.net

Open source

Simon Willison BlogNews

May 19, 2023

Let ChatGPT visit a website and have your email stolen

simonwillison.net

Open source

The operational view lives in Mallory

See the full picture, correlated to your attack surface.

This page covers what’s public. Mallory adds the parts that aren’t — which of your assets are affected, which threat actors are using it right now, which detections to deploy, and what to do next.

Start free trial

Exposure mapping

Map indicators from this story to your assets and identify affected systems in minutes.

Threat actor evidence

Every observed campaign, victim, and pivot linked to actors named in this story.

Associated malware

Malware, exploits, and IOCs connected to the activity described here.

Detection signatures

YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.

Scheduled alerts

Get matching new stories delivered to your team as they break — not the next morning.

AI threads

Ask questions about this story and take action on the answers.

Indirect Prompt Injection Flaws Expose AI Browsers and Assistants to Data Theft

Get ahead of threats like this

How this story unfolded

SafeBreach discloses prompt injection attack on Google Gemini notifications

Permiso discloses ChatGPhish prompt injection in ChatGPT page summaries

Cato publicly discloses WebPromptTrap indirect prompt injection technique

BrowserOS team fixes WebPromptTrap in version 0.32.0

BrowserOS vulnerable to WebPromptTrap in version 0.29.0

Brave discloses screenshot-based prompt injections in Comet

ChatGPT Operator prompt injection exploits and defenses discussed

Slack AI data exfiltration via indirect prompt injection disclosed

GitHub Copilot Chat prompt injection data exfiltration disclosed

Related entities

Sources

Researcher Warns of ‘ChatGPhish’ Vulnerability That Could Turn Web Summaries Into Phishing Attacks - CySecurity News - Latest Information Security and Hacking Incidents

Malicious Notifications Could Trick Google Gemini Users

Prompt-inject ChatGPT with any web page - Pivot to AI

Researchers Show How ChatGPT Summaries Could Be Used for Phishing Attacks - CySecurity News - Latest Information Security and Hacking Incidents

ChatGPT Operator: Prompt Injection Exploits & Defenses

Data Exfiltration from Slack AI via indirect prompt injection

GitHub Copilot Chat: From Prompt Injection to Data Exfiltration

Let ChatGPT visit a website and have your email stolen

See the full picture, correlated to your attack surface.

Related stories

Prompt Injection Risks in Agentic AI and AI-Powered Browsers

Prompt Injection and Persistent Memory Exploits in AI-Powered Browsers

Prompt Injection Attacks Abuse AI Agent Memory and Link Previews for Manipulation and Data Exfiltration