Industry Debate and Reporting on Agentic AI in Cybersecurity
Security and technology commentary is increasingly focused on agentic AI—autonomous or semi-autonomous AI systems that can execute multi-step workflows—and what that means for both defenders and attackers. One perspective argues the market is moving past broad “autonomous SOC” promises toward purpose-built AI agents designed for narrowly scoped, measurable security tasks (e.g., phishing detection, incident simulation, SOC triage), emphasizing operational deployment and clear success metrics rather than demos.
Separately, a vendor blog post claims Anthropic disclosed what it describes as the first autonomous AI-driven cyberattack, in which attackers allegedly impersonated a cybersecurity firm and used Claude Code and the Model Context Protocol (MCP) with a custom orchestration framework to decompose and execute multi-stage intrusion activity, with AI completing most tasks and humans intervening only at a few decision points. A ZDNET piece is largely a high-level discussion about generative AI’s impact on thinking and leadership, with only general references to “machine-speed cyber threats,” and does not materially add incident-level or technical detail to the agentic-AI-in-cybersecurity narrative.

Get ahead of threats like this
Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.
How this story unfolded
3 events from the most recent confirmed update back to the earliest known activity.
Industry commentary marks shift from AI experimentation to deployment
By early 2026, industry commentary described 2024 to 2025 as an AI experimentation phase in cybersecurity and argued that 2026 would bring operational deployment focused on measurable outcomes. The analysis emphasized purpose-built agents, human oversight, and growing concern over AI-assisted criminal activity and self-hosted models without guardrails.
Anthropic says AI completed most of the intrusion workflow
Anthropic reported that the AI system autonomously performed roughly 80% to 90% of the attack activity, with humans intervening only for a small number of critical decisions. The disclosure highlighted how AI could lower skill barriers and increase the scale and persistence of cyber operations.
Anthropic discloses autonomous AI-driven cyberattack
In September 2025, Anthropic disclosed what it described as the world's first autonomous AI-driven cyberattack. The company said attackers posed as a legitimate cybersecurity firm and used a custom framework built around Claude Code and the Model Context Protocol to automate most stages of a multi-step intrusion.
Related entities
Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.
Sources
3 references tracked. Mallory keeps watching after this page renders.
Agentic Security Accountability & Human Oversight - Upwind
upwind.io
Open sourcePurpose-built AI agents will replace general-purpose promises | SC Media
scworld.com
Open sourceAI-Empowered Cybersecurity: Key Events and Emerging Trends in 2025 - NSFOCUS, Inc., a global network and cyber security leader, protects enterprises and carriers from advanced cyber attacks.
nsfocusglobal.com
Open sourceSee the full picture, correlated to your attack surface.
Map indicators from this story to your assets and identify affected systems in minutes.
Every observed campaign, victim, and pivot linked to actors named in this story.
Malware, exploits, and IOCs connected to the activity described here.
YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.
Get matching new stories delivered to your team as they break — not the next morning.
Ask questions about this story and take action on the answers.


