Google Chrome Gemini AI Agent Enhanced to Counter Prompt Injection Attacks
Google has acknowledged the significant risk of prompt injection attacks targeting its Gemini-powered Chrome browsing agent, which can be manipulated to perform unauthorized actions such as initiating financial transactions or exfiltrating sensitive data. In response, Google has introduced a second AI model, termed the 'user alignment critic,' designed to independently vet the agent's proposed actions before execution. This model operates in isolation from untrusted web content, providing an additional layer of defense against both goal hijacking and data leakage. The move comes as prompt injection has been identified as a leading vulnerability in AI systems, with industry bodies like OWASP and the UK's National Cyber Security Centre highlighting its prevalence and difficulty to mitigate due to the structural limitations of large language models.
The Gemini-powered browsing agent, currently in preview, is capable of navigating websites, clicking buttons, and filling forms while users are logged into sensitive accounts, increasing the potential impact of successful attacks. Security experts and analysts have emphasized the need for robust safeguards, as malicious instructions can be hidden in web pages, iframes, or user-generated content. Google's dual-model approach aims to address these concerns by ensuring that any action not aligned with the user's intent is blocked, thereby reducing the risk of exploitation through prompt injection. The development reflects a broader industry trend of reassessing the security of AI-driven browsers and the need for advanced countermeasures to protect users and organizations from emerging threats.

Get ahead of threats like this
Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.
How this story unfolded
3 events from the most recent confirmed update back to the earliest known activity.
Google rolls out additional Chrome AI safeguards and security testing
Alongside the critic model, Google implemented controls such as Agent Origin Sets or origin restrictions, user confirmation for sensitive actions, and automated red-teaming to test the system’s resilience. Google also offered a $20,000 bounty for researchers who successfully bypass these new security boundaries.
Google adds User Alignment Critic to vet Chrome AI agent actions
Google introduced a secondary AI model called the User Alignment Critic to review the main Chrome agent’s proposed actions and block those that do not match user intent. The critic is designed to be isolated from untrusted web content as a defense against prompt injection.
Google acknowledges prompt injection risk in Gemini for Chrome
Google recognized indirect prompt injection as a key security risk for its Gemini-powered Chrome browsing agent, including the possibility of goal hijacking and sensitive data exfiltration. The issue was discussed while the browsing agent was still in preview and optional for users.
Related entities
Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.
Sources
3 references tracked. Mallory keeps watching after this page renders.
구글, 크롬 제미나이에 추가 감시형 AI 도입···프롬프트 인젝션 대비 강화
cio.com
Open sourceGoogle Chrome’s New AI Security Aims to Stop Hackers Cold
techrepublic.com
Open sourceGemini for Chrome gets a second AI agent to watch over it
csoonline.com
Open sourceSee the full picture, correlated to your attack surface.
Map indicators from this story to your assets and identify affected systems in minutes.
Every observed campaign, victim, and pivot linked to actors named in this story.
Malware, exploits, and IOCs connected to the activity described here.
YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.
Get matching new stories delivered to your team as they break — not the next morning.
Ask questions about this story and take action on the answers.


