Skip to main content
Live Webinar with SANS (June 25)— Agentic CTI Automation for Fun & ProfitRegister Free
Mallory
Back to intelligence
ai-platform-securityinitial-access-methoddata-exfiltration-method

Google Chrome Gemini AI Agent Enhanced to Counter Prompt Injection Attacks

Updated 3mo agoFirst seen Dec 10, 20253 sources

Google has acknowledged the significant risk of prompt injection attacks targeting its Gemini-powered Chrome browsing agent, which can be manipulated to perform unauthorized actions such as initiating financial transactions or exfiltrating sensitive data. In response, Google has introduced a second AI model, termed the 'user alignment critic,' designed to independently vet the agent's proposed actions before execution. This model operates in isolation from untrusted web content, providing an additional layer of defense against both goal hijacking and data leakage. The move comes as prompt injection has been identified as a leading vulnerability in AI systems, with industry bodies like OWASP and the UK's National Cyber Security Centre highlighting its prevalence and difficulty to mitigate due to the structural limitations of large language models.

The Gemini-powered browsing agent, currently in preview, is capable of navigating websites, clicking buttons, and filling forms while users are logged into sensitive accounts, increasing the potential impact of successful attacks. Security experts and analysts have emphasized the need for robust safeguards, as malicious instructions can be hidden in web pages, iframes, or user-generated content. Google's dual-model approach aims to address these concerns by ensuring that any action not aligned with the user's intent is blocked, thereby reducing the risk of exploitation through prompt injection. The development reflects a broader industry trend of reassessing the security of AI-driven browsers and the need for advanced countermeasures to protect users and organizations from emerging threats.

Share:
Google Chrome Gemini AI Agent Enhanced to Counter Prompt Injection Attacks
Stay ahead

Get ahead of threats like this

Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.

EVENT TIMELINE

How this story unfolded

3 events from the most recent confirmed update back to the earliest known activity.

3 EVENTS
Dec 10, 20256mo ago

Google rolls out additional Chrome AI safeguards and security testing

Alongside the critic model, Google implemented controls such as Agent Origin Sets or origin restrictions, user confirmation for sensitive actions, and automated red-teaming to test the system’s resilience. Google also offered a $20,000 bounty for researchers who successfully bypass these new security boundaries.

Dec 9, 20257mo ago

Google adds User Alignment Critic to vet Chrome AI agent actions

Google introduced a secondary AI model called the User Alignment Critic to review the main Chrome agent’s proposed actions and block those that do not match user intent. The critic is designed to be isolated from untrusted web content as a defense against prompt injection.

Google acknowledges prompt injection risk in Gemini for Chrome

Google recognized indirect prompt injection as a key security risk for its Gemini-powered Chrome browsing agent, including the possibility of goal hijacking and sensitive data exfiltration. The issue was discussed while the browsing agent was still in preview and optional for users.

LINKED ENTITIES

Related entities

Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.

8 LINKEDOpen in app
Organizations
8 linked
GartnerGoogleOpen Web Application Security ProjectAppOmniOpenaiServicenowEuropean CommissionNational Cyber Security Centre
The operational view lives in Mallory

See the full picture, correlated to your attack surface.

This page covers what’s public. Mallory adds the parts that aren’t — which of your assets are affected, which threat actors are using it right now, which detections to deploy, and what to do next.
Exposure mapping

Map indicators from this story to your assets and identify affected systems in minutes.

Threat actor evidence

Every observed campaign, victim, and pivot linked to actors named in this story.

Associated malware

Malware, exploits, and IOCs connected to the activity described here.

Detection signatures

YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.

Scheduled alerts

Get matching new stories delivered to your team as they break — not the next morning.

AI threads

Ask questions about this story and take action on the answers.