Skip to main content
Live Webinar with SANS (June 25)— Agentic CTI Automation for Fun & ProfitRegister Free
Mallory
Back to intelligence
ai-platform-securityidentity-authentication-vulnerabilityphishing-campaign-intelligencedata-exfiltration-method

Indirect Prompt Injection Flaws Expose AI Browsers and Assistants to Data Theft

Updated 13d agoFirst seen Mar 24, 202619 sources

Researchers and security outlets reported multiple indirect prompt injection weaknesses affecting AI-driven browsing and assistant features, showing how hidden instructions embedded in untrusted web content can manipulate model behavior and steer users into credential theft or data exposure. Cato Networks disclosed WebPromptTrap in BrowserOS, where malicious webpage content abused Agent Chat Mode summarization to insert a convincing call to action and attacker-selected link; in the proof of concept, victims could be pushed into a GitHub authorization flow that exposed access tokens and repository access. Cato said the issue affected BrowserOS 0.30.0 and earlier, was identified in 0.29.0, and was fixed in 0.32.0 after responsible disclosure.

Separate reporting described similar risks in OpenAI's ChatGPT Atlas browser and in Microsoft Copilot, underscoring that the problem extends beyond a single product. LayerX said Atlas could be fed malicious instructions through web content in a "tainted memories" attack, while Axios, TechSpot, and commentary from OpenAI CISO Dane Stuckey highlighted broader security and privacy concerns around prompt injection in AI browsers. Varonis also detailed Reprompt, a single-click Copilot attack that could silently exfiltrate personal data. Together, the disclosures show that AI systems that summarize pages, retain context, or act on behalf of users can be turned into phishing and data-theft intermediaries unless untrusted content is strictly isolated from model instructions.

Share:
Indirect Prompt Injection Flaws Expose AI Browsers and Assistants to Data Theft
Stay ahead

Get ahead of threats like this

Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.

EVENT TIMELINE

How this story unfolded

9 events from the most recent confirmed update back to the earliest known activity.

9 EVENTS
Jun 3, 202619d ago

SafeBreach discloses prompt injection attack on Google Gemini notifications

SafeBreach disclosed an indirect prompt injection technique affecting Google Gemini's notification summarization and voice assistant features, showing how hidden instructions in muted hyperlinks or invisible foreign-language text could misrepresent messages and potentially trigger unauthorized actions. The researchers said Google was notified through responsible disclosure and deployed content-classifier updates; the article said there was no evidence of in-the-wild exploitation.

Malicious Notifications Could Trick Google Gemini Users
May 29, 202624d ago

Permiso discloses ChatGPhish prompt injection in ChatGPT page summaries

Permiso published research on 'ChatGPhish,' a browser-based prompt injection technique in which attacker-controlled webpage text influences ChatGPT's page summarization output and causes phishing links, QR codes, spoofed alerts, and remote image fetches to appear inside the trusted ChatGPT interface. The researchers said they reported the issue to OpenAI via Bugcrowd in late April and early May 2026 before publishing their findings.

ChatGPhish: The Page Is the Payload
Mar 24, 20263mo ago

Cato publicly discloses WebPromptTrap indirect prompt injection technique

Cato published details of WebPromptTrap, including a proof of concept showing how a manipulated AI-generated summary could steer a victim into a malicious GitHub authorization flow that yields access tokens and repository access. The researchers warned the same attack pattern could be adapted to other enterprise roles and SaaS platforms such as ERP, HR, payroll, CRM, and service management systems.

BrowserOS team fixes WebPromptTrap in version 0.32.0

After responsible disclosure from Cato, the BrowserOS team remediated the WebPromptTrap issue in BrowserOS 0.32.0. Cato said the vulnerability affected BrowserOS 0.30.0 and earlier.

BrowserOS vulnerable to WebPromptTrap in version 0.29.0

Cato researchers identified an indirect prompt injection issue in BrowserOS while testing version 0.29.0. The flaw abused Agent Chat Mode page summarization by embedding hidden instructions in untrusted webpage content.

Oct 21, 20258mo ago

Brave discloses screenshot-based prompt injections in Comet

Brave published research describing 'unseeable prompt injections in screenshots' affecting its Comet AI browser and possibly other AI browsers. The disclosure introduced a distinct prompt-injection technique using screenshot content rather than hidden webpage text.

Unseeable prompt injections in screenshots: more vulnerabilities in Comet and other AI browsers | Brave
Feb 17, 20251y ago

ChatGPT Operator prompt injection exploits and defenses discussed

A reference published on 2025-02-17 documented prompt injection exploit scenarios and defensive considerations for ChatGPT Operator. This is a distinct earlier development in the broader prompt-injection story, separate from Cato's later BrowserOS WebPromptTrap findings.

ChatGPT Operator: Prompt Injection Exploits & Defenses
Aug 20, 20242y ago

Slack AI data exfiltration via indirect prompt injection disclosed

A reference published on 2024-08-20 documented how indirect prompt injection could be used to exfiltrate data from Slack AI. The disclosure added another concrete example of prompt-injection abuse affecting enterprise AI assistants.

Data Exfiltration from Slack AI via indirect prompt injection
Jun 16, 20242y ago

GitHub Copilot Chat prompt injection data exfiltration disclosed

A June 2024 reference documented how prompt injection against GitHub Copilot Chat could be used to exfiltrate data, marking an earlier concrete example of prompt-injection abuse in AI coding assistants. This predates the ChatGPT Operator and BrowserOS prompt-injection developments already in the timeline.

GitHub Copilot Chat: From Prompt Injection to Data Exfiltration
LINKED ENTITIES

Related entities

Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.

4 LINKEDOpen in app
Affected products
1 linked
Github
Organizations
3 linked
Cato NetworksGitHubBrowserOS
The operational view lives in Mallory

See the full picture, correlated to your attack surface.

This page covers what’s public. Mallory adds the parts that aren’t — which of your assets are affected, which threat actors are using it right now, which detections to deploy, and what to do next.
Exposure mapping

Map indicators from this story to your assets and identify affected systems in minutes.

Threat actor evidence

Every observed campaign, victim, and pivot linked to actors named in this story.

Associated malware

Malware, exploits, and IOCs connected to the activity described here.

Detection signatures

YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.

Scheduled alerts

Get matching new stories delivered to your team as they break — not the next morning.

AI threads

Ask questions about this story and take action on the answers.