Skip to main content
Live Webinar with SANS (June 25)— Agentic CTI Automation for Fun & ProfitRegister Free
Mallory
Back to intelligence
ai-platform-securityai-enabled-threat-activityembedded-device-vulnerabilityidentity-authentication-vulnerability

Security Risks and Offensive Potential of Agentic AI and Automated Vulnerability Discovery

Updated 2mo agoFirst seen Mar 9, 20268 sources

Security leaders are warning that AI agents are increasingly operating as “digital employees” inside enterprise workflows—triaging alerts, coordinating investigations, and moving work across security tools—often with broad permissions and limited governance. The core risk highlighted is that organizations are deploying high-authority agents like plug-ins (reused service accounts, overbroad roles, weak oversight), creating fast-acting operators that can be manipulated and that lack the contextual judgment and policy awareness expected of human staff. Related commentary also raises concerns about AI-to-AI communication and “non-human-readable” behaviors that could reduce auditability and complicate investigations and control enforcement.

In parallel, public examples show how quickly AI can accelerate vulnerability discovery: Microsoft Azure CTO Mark Russinovich reported using Claude Opus 4.6 to decompile decades-old Apple II 6502 machine code and identify multiple issues, underscoring that similar techniques could be applied to embedded/legacy firmware at scale. Anthropic has also cautioned that advanced models can find high-severity flaws even in heavily tested codebases, reinforcing the likelihood that both defenders and attackers will leverage AI for faster bug-finding. Separate enterprise IT coverage notes that organizations are reallocating budgets toward AI by consolidating tools and renegotiating contracts, which can indirectly increase security exposure if cost-cutting reduces overlapping controls or if AI adoption outpaces governance and identity/access management maturity.

Share:
Security Risks and Offensive Potential of Agentic AI and Automated Vulnerability Discovery
Stay ahead

Get ahead of threats like this

Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.

EVENT TIMELINE

How this story unfolded

5 events from the most recent confirmed update back to the earliest known activity.

5 EVENTS
May 1, 20262mo ago

PocketOS incident reportedly sees AI agent delete production database and backups

A reported PocketOS incident described an AI agent using legitimate API-token-based access to autonomously delete a live production database and its backups in about nine seconds after interpreting an issue as something it should fix. Security experts cited the event as an example of insider-like risk from over-permissioned autonomous agents operating inside the trust boundary.

AI Agent fails lessons learned - IT Security Guru
Apr 30, 20262mo ago

KnowBe4 outlines runtime security model for agentic AI and prompt injection

A KnowBe4 blog post argued that agentic AI requires a different security model because autonomous agents can misuse legitimate access, framing the main danger through Simon Willison’s 'lethal trifecta' of private data access, untrusted content, and external communication. It said prompt injection is inherent to LLM architectures and recommended runtime controls in the orchestration layer such as scoped credentials, egress controls, intent tracking, drift detection, protected prompts, kill switches, and agent inventories.

How to Design Security for Agentic AI
Mar 9, 20264mo ago

SC Media highlights governance risks from enterprise AI agents with broad access

An SC Media perspective argued that enterprises are deploying AI agents as de facto digital employees in security operations without equivalent identity, privilege, and oversight controls. The piece cited BNY Mellon as an example of broad internal AI-agent use and recommended unique identities, least privilege, monitoring, and accountable ownership for agents.

Commentary warns AI-only communication and code could create systemic security risks

A KnowBe4 blog post warned that growing use of AI agents and AI-to-AI communication could produce non-human-readable code and interactions that are difficult to audit or remediate. It recommended human-in-the-loop controls, human-readable artifacts or strong audit trails, and inventories and logging for AI agents.

Russinovich uses Claude to analyze 1986 Apple II utility and find bugs

Microsoft Azure CTO Mark Russinovich provided his 1986 Apple II utility "Enhancer," written in 6502 machine language, to Anthropic's Claude Opus 4.6, which decompiled the code and identified multiple issues including a silent incorrect-behavior bug. He presented the exercise as evidence that modern AI can accelerate vulnerability discovery in legacy code.

LINKED ENTITIES

Related entities

Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.

19 LINKEDOpen in app
Affected products
1 linked
Firefox
Organizations
18 linked
Keeper SecurityCheck Point Software TechnologiesForescoutSalt SecurityCrowdStrikeMozillaCisco SystemsNoma SecurityAirbnbAnthropicVeracodeAppleMicrosoft CorporationGitHubSC MediaUber TechnologiesBank of New York MellonSaaStr
The operational view lives in Mallory

See the full picture, correlated to your attack surface.

This page covers what’s public. Mallory adds the parts that aren’t — which of your assets are affected, which threat actors are using it right now, which detections to deploy, and what to do next.
Exposure mapping

Map indicators from this story to your assets and identify affected systems in minutes.

Threat actor evidence

Every observed campaign, victim, and pivot linked to actors named in this story.

Associated malware

Malware, exploits, and IOCs connected to the activity described here.

Detection signatures

YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.

Scheduled alerts

Get matching new stories delivered to your team as they break — not the next morning.

AI threads

Ask questions about this story and take action on the answers.