Skip to main content
Live Webinar with SANS (June 25)— Agentic CTI Automation for Fun & ProfitRegister Free
Mallory
Back to intelligence

Attackers Abuse AI Services and Exposed LLM Infrastructure for Intrusions and Compute Theft

Updated 28d agoFirst seen May 24, 20264 sources

Researchers and vendors reported a sharp rise in attacks targeting AI infrastructure, from exposed self-hosted LLM servers to commercial coding assistants used in real intrusions. A Kaspersky honeypot posing as a private AI server with services including Ollama, LM Studio, LangServe, text-generation-webui, OpenAI-compatible APIs, RAG databases, and an MCP server was indexed by Shodan within hours and drew more than 113,000 requests in a month from thousands of IP addresses. Nearly a quarter of the traffic focused on discovering and exploiting AI capabilities, with attackers attempting to consume inference resources, analyze documents, generate content, process vulnerability data, proxy requests to external models, and steal secrets from exposed .env files using tooling such as LLM-Scanner.

The broader threat has moved beyond opportunistic abuse into targeted operations. Anthropic said suspected Chinese state-linked operators misused Claude Code against 30 high-value organizations across technology, finance, chemicals, and government, using the AI assistant to test systems, generate attack code, harvest credentials, identify privileged accounts and backdoors, and support deeper intrusion activity; the company said the tool performed 80% to 90% of the work in some cases. Separately, defenders were urged to rapidly patch internet-exposed management infrastructure after CVE-2026-41940 in cPanel/WHM was exploited in the wild, allowing pre-authentication root access in as few as four HTTP requests and reportedly being used in ransomware activity, underscoring how exposed services and AI-enabled tradecraft are converging into a faster, more scalable attack model.

Share:
Stay ahead

Get ahead of threats like this

Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.

EVENT TIMELINE

How this story unfolded

10 events from the most recent confirmed update back to the earliest known activity.

10 EVENTS
May 12, 20261mo ago

AI honeypot records month-long large-scale scanning and abuse

Over the month following deployment, Kaspersky's honeypot received more than 113,000 requests from thousands of IP addresses, with 23% focused on discovering and exploiting AI capabilities. Observed abuse centered on free use of AI resources, including document analysis, erotic content generation, vulnerability-data processing, and attempts to proxy requests to Anthropic models.

Shodan indexes the AI honeypot within hours of exposure

Kaspersky reported that the honeypot was indexed by Shodan within three hours of being exposed to the internet. Reconnaissance activity began roughly an hour later, showing how quickly exposed AI services are discovered.

Apr 30, 20262mo ago

CISA adds CVE-2026-41940 to KEV catalog

Two days after the patch, CISA added CVE-2026-41940 to its Known Exploited Vulnerabilities catalog and set a 72-hour remediation deadline. The agency noted known ransomware use of the flaw.

Apr 28, 20262mo ago

cPanel releases emergency patch for CVE-2026-41940

On 2026-04-28, cPanel issued an emergency fix for CVE-2026-41940, a critical vulnerability rated CVSS 9.3 that could let attackers gain root access to WHM without credentials in as few as four HTTP requests. The patch changed session sanitization behavior and added protections for malformed cookies used to disable encryption and inject session fields.

Feb 25, 20264mo ago

cPanel flaw CVE-2026-41940 exploited in the wild by late February

watchTowr Labs reported that CVE-2026-41940, a critical cPanel and WHM authentication bypass, had been exploited in the wild since at least late February 2026. The bug allowed pre-authentication root access through flaws in session handling, including CRLF injection and session cache inconsistencies.

Jan 30, 20265mo ago

Researchers disclose 175,000 publicly exposed Ollama AI servers

A January 2026 report said more than 175,000 Ollama AI servers were publicly exposed on the internet worldwide, highlighting broad attack surface risk from unsecured self-hosted AI infrastructure. The disclosure warned administrators to lock down exposed instances to prevent unauthorized access and abuse.

Over 175,000 publicly exposed Ollama AI servers discovered worldwide - so fix now | TechRadar
Sep 15, 20259mo ago

Anthropic blocks accounts and notifies victims after campaign discovery

After detecting the abuse, Anthropic blocked the implicated Claude Code accounts, notified affected organizations, cooperated with authorities, and introduced additional controls to prevent similar misuse. The company also asserted with high confidence that the operators acted under direction from Beijing.

Anthropic detects Claude Code espionage campaign

Anthropic detected the campaign in mid-September 2025 and described it as the first documented case in which an AI agent successfully gained access to confirmed high-value targets for intelligence collection. The company said most intrusion attempts were unsuccessful and that attackers bypassed safeguards by breaking malicious goals into smaller benign-looking tasks.

Chinese-linked actors begin abusing Claude Code against high-value targets

Anthropic said suspected Chinese hackers used its Claude Code AI coding service in a campaign targeting 30 high-value organizations worldwide, including technology firms, financial institutions, chemical manufacturers, and government agencies. The company said the AI was used for vulnerability testing, attack-code generation, credential collection, privilege discovery, and deeper intrusion support.

Kaspersky launches April honeypot posing as a private AI server

In April 2026, Kaspersky ran a honeypot experiment using a Raspberry Pi configured to imitate an exposed local AI server with services such as Ollama, LM Studio, AutoGPT, LangServe, text-gen-webui, OpenAI-compatible APIs, RAG databases, and an MCP server. The goal was to observe reconnaissance and abuse targeting exposed AI infrastructure.

LINKED ENTITIES

Related entities

Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.

23 LINKEDOpen in app
Affected products
5 linked
Wp SquaredCpanel & WhmOllamaAutogptNode.Js
Organizations
17 linked
ShodanTrend MicroProjectdiscoveryRapid7International Business MachinesCloudflarePicus SecurityWordpressNameCheapHostPapaCpanelWatchTowrHadrianKnownHostHosting.comInMotionAnthropic
The operational view lives in Mallory

See the full picture, correlated to your attack surface.

This page covers what’s public. Mallory adds the parts that aren’t — which of your assets are affected, which threat actors are using it right now, which detections to deploy, and what to do next.
Exposure mapping

Map indicators from this story to your assets and identify affected systems in minutes.

Threat actor evidence

Every observed campaign, victim, and pivot linked to actors named in this story.

Associated malware

Malware, exploits, and IOCs connected to the activity described here.

Detection signatures

YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.

Scheduled alerts

Get matching new stories delivered to your team as they break — not the next morning.

AI threads

Ask questions about this story and take action on the answers.