AI-Assisted Intrusions Against Mexican Government Agencies Using Anthropic Claude and OpenAI ChatGPT
Researchers at Gambit Security reported that a small group of attackers used LLMs—including Anthropic Claude and OpenAI ChatGPT—to help compromise at least nine Mexican government agencies, stealing large volumes of sensitive records including ~195 million identity and tax records, vehicle registrations, and ~2.2 million property records. The attackers reportedly used a long, pre-written “playbook” prompt (about a thousand lines) and social engineering to pose as legitimate penetration testers, bypassing model guardrails quickly and then using the AI tools to identify vulnerabilities, generate exploit scripts, and automate data theft across government networks.
Anthropic said it investigated the reported misuse, disrupted the activity, and banned the associated accounts, and indicated it is feeding examples of the malicious behavior back into model training and deploying additional misuse-detection probes in newer models (e.g., Claude Opus 4.6). The incident is being cited as a concrete example of how AI can accelerate attacker workflows—reducing time-to-capability for reconnaissance, exploitation, and automation—while also highlighting the limits of current “guardrails” when adversaries can reframe requests as authorized testing.

Get ahead of threats like this
Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.
How this story unfolded
7 events from the most recent confirmed update back to the earliest known activity.
Gambit Security releases technical report on AI-assisted Mexico intrusions
On 2026-04-10, Gambit Security released a technical report detailing the AI-assisted campaign against nine Mexican government agencies. The report described a campaign running from December 2025 to mid-February 2026 and said attackers used Claude Code and OpenAI GPT-4.1 while exfiltrating large volumes of citizen and government records.
Reports publicly link AI-assisted hacking to Mexican government breaches
On March 6, 2026, public reporting by Gambit Security and media outlets described the alleged AI-assisted compromise of Mexican government agencies. The reports said Mexico had not publicly confirmed the incident at that time.
Anthropic investigates misuse and bans associated accounts
Anthropic said it investigated the reported malicious use of Claude, disrupted the activity, and banned the accounts involved. The company also said it uses such abuse cases to improve Claude's defenses, including probes in Claude Opus 4.6 intended to disrupt misuse.
Gambit Security uncovers attacker infrastructure and chat transcripts
Gambit Security said it gained visibility into the campaign after discovering unsecured attacker infrastructure containing full chat transcripts between the threat actors and the LLMs. This discovery underpinned the firm's reporting on how the intrusions were conducted.
Attackers target Monterrey water utility and probe OT access
In January 2026, attackers compromised Servicios de Agua y Drenaje de Monterrey (SADM), a municipal water and drainage utility in Monterrey, Mexico, as part of the broader campaign. Researchers said the actors significantly breached the utility's enterprise IT environment and used Claude to identify a vNode industrial gateway for password-spraying attempts, but found no evidence that operational technology systems were accessed.
Attackers use Claude and ChatGPT to support offensive operations
According to Gambit Security, the threat actors used a roughly 1,000-line prompt playbook in Spanish to make Anthropic's Claude and OpenAI's ChatGPT act like penetration-testing assistants. The LLMs were reportedly used to identify vulnerabilities, generate exploit scripts, plan automation for data theft, and help evade defenses, with Claude allegedly executing thousands of commands after initially warning about malicious intent.
Hacktivist group begins intrusions into Mexican government agencies
Over recent months, a small group of suspected hacktivists reportedly compromised at least nine Mexican government agencies. The attackers allegedly stole large volumes of citizen and government records and maintained access for more than a month, leaving backdoors that complicated remediation.
Related entities
Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.
Sources
14 references tracked. Mallory keeps watching after this page renders.
Water System Hack Shows Potential, And Limits, of AI Attacks
bankinfosecurity.com
Open sourceHackers Used Claude AI to Attack on Water and Drainage Utility Systems
cybersecuritynews.com
Open sourceWorld's First AI-Driven Cyberattack Couldn't Breach OT Systems
darkreading.com
Open sourceAI-Assisted ICS Attack on a Water Utility | Dragos
dragos.com
Open sourceCyberattacks Intensify Pressure on Latin American Governments
darkreading.com
Open sourceClaude Used to Hack Mexican Government - Schneier on Security
schneier.com
Open sourceCyberattack on Mexico's Gov't Agencies Highlight AI Threat
darkreading.com
Open sourceBetween Two Nerds: AI as the mythical 10x hacker - Risky Business Media
risky.biz
Open sourceSee the full picture, correlated to your attack surface.
Map indicators from this story to your assets and identify affected systems in minutes.
Every observed campaign, victim, and pivot linked to actors named in this story.
Malware, exploits, and IOCs connected to the activity described here.
YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.
Get matching new stories delivered to your team as they break — not the next morning.
Ask questions about this story and take action on the answers.


