Phishing Campaigns Evade Detection by Abusing AI and Trusted Email Security Controls
Security researchers reported multiple phishing evasion techniques designed to defeat modern email and AI-assisted defenses rather than relying only on traditional lure quality. One campaign analyzed by KnowBe4 used graymail-style content padding and extreme whitespace insertion to manipulate NLP-based email security tools, placing benign promotional text, legitimate signatures, and trusted links far below the visible phishing lure so scanners would weigh the message as less malicious. A separate LevelBlue-tracked trend showed attackers abusing enterprise URL rewriting and Safe Links-style protections by sending phishing through compromised accounts, causing security gateways to generate trusted wrapped URLs that could then be reused in campaigns targeting Microsoft 365 users.
The activity reflects a broader shift toward exploiting the gap between what users see and what automated systems inspect. In the URL-rewriting abuse, operators tied to Tycoon2FA and Sneaky2FA built multi-layer redirect chains across several trusted vendor domains to obscure final destinations and steal credentials and MFA session cookies through adversary-in-the-middle infrastructure, enabling account takeover, internal phishing, data theft, and sometimes ransomware follow-on activity. Related research from LayerX showed a different but thematically aligned evasion method in which font rendering and CSS make webpages display malicious commands to users while AI assistants parsing the underlying HTML see only harmless text, underscoring that attackers are increasingly targeting AI and trust-based inspection layers as part of phishing and social-engineering operations.

Get ahead of threats like this
Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.
How this story unfolded
6 events from the most recent confirmed update back to the earliest known activity.
LayerX publicly disclosed font-rendering trick that hides malicious commands from AI
LayerX publicly described a proof-of-concept attack in which custom fonts and CSS make a webpage render dangerous commands to users while AI assistants see only harmless underlying text. The company said the technique could mislead tools such as ChatGPT, Claude, Copilot, and Gemini into falsely assuring users that a command is safe.
LevelBlue said safe-link abuse campaigns remained active in early 2026
LevelBlue reported that phishing campaigns abusing trusted-looking rewritten safe links were still active in early 2026. The campaigns used layered redirects across multiple security vendor domains and prebuilt chains in emails or HTML attachments to conceal final phishing destinations.
KnowBe4 identified phishing obfuscation aimed at NLP-based email defenses
KnowBe4 reported an emerging phishing technique that places malicious social-engineering text at the top of an email and large volumes of benign padding farther down to dilute malicious indicators. In 40 analyzed samples, attackers also used line breaks, graymail-like content, legitimate links, and polymorphic subject lines or attachment names to evade detection.
Microsoft fully addressed LayerX's font-rendering attack scenario
By March 2026, according to LayerX, Microsoft had fully addressed the reported browser-rendering versus DOM-analysis issue after opening an MSRC case. The mitigation followed LayerX's December 2025 report about hidden malicious commands that AI tools could misread as safe.
Safe-link phishing abuse peaked with multi-vendor redirect chains
In January 2026, LevelBlue observed phishing campaigns using three or more enterprise email security URL-rewriting services at peak levels. The activity, tied to Tycoon2FA and Sneaky2FA, targeted Microsoft 365 users with adversary-in-the-middle phishing to steal credentials and MFA session cookies.
LayerX reported font-rendering AI deception issue to vendors
In December 2025, LayerX disclosed to affected vendors a proof-of-concept technique using custom fonts and CSS so webpages could show malicious commands to users while exposing only benign text to AI assistants analyzing the HTML/DOM. Microsoft opened an MSRC case, while most other vendors reportedly considered the issue out of scope because it relied on social engineering.
Related entities
Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.
Sources
3 references tracked. Mallory keeps watching after this page renders.
Phishing emails target AI defenses with unique obfuscation | news | SC Media
scworld.com
Open sourcePhishers Weaponize Safe Links With Multi-Layered URL Rewriting to Evade Detection
cybersecuritynews.com
Open sourceNew font-rendering trick hides malicious commands from AI tools
bleepingcomputer.com
Open sourceSee the full picture, correlated to your attack surface.
Map indicators from this story to your assets and identify affected systems in minutes.
Every observed campaign, victim, and pivot linked to actors named in this story.
Malware, exploits, and IOCs connected to the activity described here.
YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.
Get matching new stories delivered to your team as they break — not the next morning.
Ask questions about this story and take action on the answers.


