Pwn2Own Berlin Exposes Host-Level Risks in AI Developer Tools and Infrastructure
Pwn2Own Berlin 2026 showed that generative AI has become both an attack target and an offensive aid, as contestants used LLMs and coding agents to accelerate code review, translation, and exploit development while probing AI products including Claude Code, OpenAI Codex, Cursor, Ollama, ChromaDB, LM Studio, Megatron Bridge, and Nvidia Container Toolkit. Researchers reported that AI assistance improved speed but still generated many false positives during bug discovery, reinforcing that human validation remains necessary in vulnerability research.
The competition highlighted recurring weaknesses across AI ecosystems, particularly overprivileged developer tools, unsafe trust relationships between agents and users, and exposed infrastructure that can turn an application flaw into host-level compromise. Internet-facing deployments such as Ollama and ChromaDB drew attention as examples of expanding attack surface, while supply chain risk and AI-assisted "vibe coding" were cited as factors likely to introduce additional security flaws into rapidly deployed AI environments.

Get ahead of threats like this
Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.
How this story unfolded
1 event from the most recent confirmed update back to the earliest known activity.
Pwn2Own Berlin 2026 showcases attacks on GenAI products
At Pwn2Own Berlin 2026, contestants targeted AI-related products including Claude Code, OpenAI Codex, Cursor, Ollama, ChromaDB, LM Studio, Megatron Bridge, and Nvidia Container Toolkit. The event also highlighted researchers using LLMs and coding agents to assist vulnerability research and exploit development.
Related entities
Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.
Sources
1 reference tracked. Mallory keeps watching after this page renders.
See the full picture, correlated to your attack surface.
Map indicators from this story to your assets and identify affected systems in minutes.
Every observed campaign, victim, and pivot linked to actors named in this story.
Malware, exploits, and IOCs connected to the activity described here.
YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.
Get matching new stories delivered to your team as they break — not the next morning.
Ask questions about this story and take action on the answers.

