Skip to main content
Live Webinar with SANS (June 25)— Agentic CTI Automation for Fun & ProfitRegister Free
Mallory
Back to intelligence
ai-platform-securitycommand-and-control-methoddata-exfiltration-methodprivacy-surveillance-policy

OpenAI Adds ChatGPT Lockdown Mode and Elevated Risk Labels to Reduce Prompt-Injection Exfiltration

Updated 3mo agoFirst seen Feb 17, 20262 sources

OpenAI introduced Lockdown Mode and Elevated Risk labels in ChatGPT to reduce exposure to prompt injection and related data-exfiltration risks when AI features interact with external systems. Lockdown Mode is positioned as an optional, advanced setting for higher-risk users and environments (notably ChatGPT Enterprise, Edu, for Healthcare, and for Teachers) that restricts tool access and limits how ChatGPT can reach outside systems; reported controls include disabling or constraining capabilities attackers could abuse via conversations or connected apps, and limiting browsing so that no live network requests leave OpenAI-controlled infrastructure (with browsing constrained to cached content). Admins can enable the setting via workspace controls and apply additional restrictions through dedicated roles, while Elevated Risk labels provide in-product warnings and guidance for features that increase risk when connecting to apps or the web, including across ChatGPT, ChatGPT Atlas, and Codex.

Separate research highlighted how AI assistants with web-browsing/URL-fetching features can be abused as stealthy command-and-control (C2) relays, demonstrating a technique against Microsoft Copilot and xAI Grok that tunnels operator commands and victim data through legitimate AI web interfaces and can work without an API key or registered account. In parallel, the European Parliament reportedly disabled built-in AI tools on lawmakers’ work devices due to cybersecurity and privacy concerns about uploading sensitive correspondence to third-party cloud AI providers and uncertainty about what data is shared and retained. Other referenced material focused on general productivity customization of ChatGPT via “Custom Instructions,” rather than a specific security event or disclosure.

Share:
OpenAI Adds ChatGPT Lockdown Mode and Elevated Risk Labels to Reduce Prompt-Injection Exfiltration
Stay ahead

Get ahead of threats like this

Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.

EVENT TIMELINE

How this story unfolded

1 event from the most recent confirmed update back to the earliest known activity.

1 EVENTS
Feb 16, 20264mo ago

OpenAI introduces ChatGPT Lockdown Mode and Elevated Risk labels

OpenAI launched two new ChatGPT security features—Lockdown Mode and Elevated Risk labels—to reduce prompt injection, data exfiltration, and other advanced threats when AI tools connect to external systems. Lockdown Mode was made available as an optional setting for certain business and regulated offerings, while Elevated Risk labels were added across ChatGPT, ChatGPT Atlas, and Codex to warn users about higher-risk features.

LINKED ENTITIES

Related entities

Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.

4 LINKEDOpen in app
Affected products
2 linked
ChatgptChatgpt
Organizations
2 linked
OpenaiZiff Davis
The operational view lives in Mallory

See the full picture, correlated to your attack surface.

This page covers what’s public. Mallory adds the parts that aren’t — which of your assets are affected, which threat actors are using it right now, which detections to deploy, and what to do next.
Exposure mapping

Map indicators from this story to your assets and identify affected systems in minutes.

Threat actor evidence

Every observed campaign, victim, and pivot linked to actors named in this story.

Associated malware

Malware, exploits, and IOCs connected to the activity described here.

Detection signatures

YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.

Scheduled alerts

Get matching new stories delivered to your team as they break — not the next morning.

AI threads

Ask questions about this story and take action on the answers.