OpenAI Releases Codex Security Agent and GPT-5.4 With Expanded Safety Controls

EVENT TIMELINE

How this story unfolded

5 events from the most recent confirmed update back to the earliest known activity.

5 EVENTS

Mar 7, 20264mo ago

OpenAI says Codex Security found zero-days and led to 14 CVE assignments

As part of audits of major open-source projects including OpenSSH, GnuTLS, PHP, and Chromium, OpenAI reported that Codex Security discovered zero-day vulnerabilities and contributed to the assignment of 14 CVEs.

OpenAI launches Codex Security research preview

OpenAI announced Codex Security, an application security agent designed to find, validate, and remediate vulnerabilities in enterprise and open-source codebases. The product began rolling out as a research preview via the Codex web interface to ChatGPT Pro, Enterprise, Business, and Edu customers, with a Codex for OSS program for qualifying maintainers.

Mar 6, 20264mo ago

OpenAI releases GPT-5.4 with native computer use and updated safeguards

OpenAI released GPT-5.4 across ChatGPT, Codex, and the API, positioning it as a flagship model with improved reasoning, coding, tool use, and agent workflows. The company said it added upgraded safeguards, maintained the same high cyber-risk classification as GPT-5.3-Codex, and published related safety research on reasoning concealment.

Feb 5, 20265mo ago

Codex Security beta scans 1.2 million commits and finds thousands of severe issues

During the 30 days preceding its public announcement, OpenAI said Codex Security scanned more than 1.2 million commits across external repositories and identified 792 critical and 10,561 high-severity findings. OpenAI also said the system helped reduce alert noise and false positives in private beta.

Oct 1, 20259mo ago

OpenAI unveils Aardvark private beta for AI-driven vulnerability discovery

OpenAI unveiled Aardvark, the private beta precursor to Codex Security, in October 2025 as an effort to detect and help fix software vulnerabilities at scale.

LINKED ENTITIES

Related entities

Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.

24 LINKEDOpen in app

Vulnerabilities

17 linked

Double-free in GnuTLS SAN otherName export logic Division by Zero DoS in CISA Thorium Heap buffer over-read in GnuTLS SCT extension parsing Unauthenticated verification email flooding in CISA Thorium Code Injection in Claude Code startup trust dialog / MCP consent bypass GnuTLS certtool template parsing heap-buffer overflow (off-by-one)Improper TLS Certificate Validation in CISA Thorium Elasticsearch Connection CISA Thorium password reset token invalidation flaw Denial of Service in CISA Thorium account verification email handling Path Traversal in CISA Thorium download_ephemeral and download_children LDAP injection in CISA Thorium Claude Code API key exfiltration via pre-trust ANTHROPIC_BASE_URL override Improper validation of PBMAC1 parameters in OpenSSL PKCS#12 MAC verification Stack-based buffer overflow in GnuPG tpm2daemon PKDECRYPT GnuPG gpg-agent PKDECRYPT --kem=CMS Stack Buffer Overflow Gogs cross-account 2FA recovery code authentication bypass Unauthenticated file upload in Gogs attachment endpoints (/issues/attachments, /releases/attachments)

Affected products

4 linked

GithubChatgptChatgptChromium

Organizations

3 linked

OpenaiAnthropicThe Hacker News

SOURCE COVERAGE

Sources

4 references tracked. Mallory keeps watching after this page renders.

4 SOURCESView all

CyberthroneNews

Mar 8, 2026

Claude Code Security vs. OpenAI Codex Security - AI Arms Race - TheCyberThrone

thecyberthrone.in

Open source

The Hacker NewsNews

Mar 7, 2026

OpenAI Codex Security Scanned 1.2 Million Commits and Found 10,561 High-Severity Issues

thehackernews.com

Open source

Cyber Security NewsNews

Mar 7, 2026

OpenAI Launches Codex Security that Discover, Validate and Patch Vulnerabilities

cybersecuritynews.com

Open source

Help Net SecurityNews

Mar 6, 2026

OpenAI’s GPT-5.4 doubles down on safety as competition heats up - Help Net Security

helpnetsecurity.com

Open source

ON THE SAME THREAD

OpenAI has released GPT-5.2-Codex, a new AI model designed to enhance agentic coding and cybersecurity tasks, with notable improvements in vulnerability detection and software engineering workflows. The model demonstrates superior performance on benchmarks such as SWE-Bench Pro and Terminal-Bench 2.0, and excels in professional Capture-the-Flag challenges, supporting advanced tasks like fuzzing, attack surface analysis, and test environment setup. OpenAI has implemented stronger safeguards to mitigate dual-use risks and is offering the model to paid ChatGPT Codex users, with an invite-only pilot for vetted cybersecurity professionals focused on defensive applications. The model's capabilities have already contributed to the discovery of several critical vulnerabilities in React Server Components, including CVE-2025-55182 (RCE), CVE-2025-55183 (source code exposure), and CVE-2025-67779 (DoS), prompting urgent patching recommendations. Industry research highlights both the promise and risks of AI-generated code, with studies showing that machine-written pull requests contain significantly more bugs and security vulnerabilities than those authored solely by humans. Issues such as improper password handling, insecure object references, and cross-site scripting are notably more prevalent in AI-generated code, raising concerns about the security implications of rapid AI-driven development. These findings underscore the importance of robust review processes and the need for continued vigilance as AI tools become more deeply integrated into software engineering and cybersecurity operations.

Jun 29, 2026

Enterprise Platforms for Deploying and Governing AI Agents Expand as OpenAI Launches GPT-5.3-Codex and Frontier

OpenAI announced **GPT-5.3-Codex**, positioning it as more than a code-generation model and expanding its availability across a new macOS desktop app plus CLI, IDE extension, and web interface, with API access described as forthcoming. OpenAI and third-party coverage said the model is intended to support broader software-lifecycle work (e.g., debugging, deployments, monitoring, tests, and documentation) and highlighted benchmark gains over prior Codex versions; reporting also pushed back on claims that “Codex built itself,” characterizing OpenAI’s statement as the model being *instrumental* in its own development rather than fully autonomous. In parallel, OpenAI unveiled **Frontier**, an enterprise framework aimed at building, deploying, and managing AI agents, including promised agent security features and an approach modeled on Palantir-style **forward-deployed engineers** working alongside customer teams. Separately, *MintMCP* launched an enterprise governance platform focused on deploying, monitoring, and securing AI agents and MCP servers, emphasizing audit trails, policy enforcement, observability/guardrails, and centralized access controls to reduce risks from privileged agents (e.g., credential exposure and data exfiltration). Other items in the set were not tied to these launches, including a general CIO column on SMB IT “quick fixes” creating long-term risk and a corporate internal AI learning program announcement from Hancom.

Jun 29, 2026

OpenAI Restricts GPT-5.6 Sol Preview Amid Cybersecurity Misuse Concerns

OpenAI has launched a limited preview of its `GPT-5.6` model family—**Sol**, **Terra**, and **Luna**—with access initially restricted to a small group of trusted partners through the API and Codex. The company described **Sol** as its most advanced cybersecurity-focused model and said the rollout follows consultations with the U.S. government while broader national-security risk assessment frameworks for cyber-capable AI are developed. OpenAI said wider availability across ChatGPT, Codex, and API offerings is planned in the coming weeks. OpenAI said `GPT-5.6 Sol` improves vulnerability discovery, patch development, and exploit-related research, and internal as well as third-party testing indicated it can identify security flaws, generate credible memory-safety leads, and in some cases uncover previously unknown vulnerabilities. At the same time, the company said the model is not yet capable of reliably carrying out autonomous end-to-end attacks against hardened targets, and it has added layered safeguards including refusal training, output screening, real-time classifiers, secondary review models, account-level evaluations, misuse monitoring, and large-scale automated red-teaming to limit abuse of sensitive cyber capabilities.

Jun 30, 2026

OpenAI Releases Codex Security Agent and GPT-5.4 With Expanded Safety Controls

Get ahead of threats like this

How this story unfolded

OpenAI says Codex Security found zero-days and led to 14 CVE assignments

OpenAI launches Codex Security research preview

OpenAI releases GPT-5.4 with native computer use and updated safeguards

Codex Security beta scans 1.2 million commits and finds thousands of severe issues

OpenAI unveils Aardvark private beta for AI-driven vulnerability discovery

Related entities

Sources

Claude Code Security vs. OpenAI Codex Security - AI Arms Race - TheCyberThrone

OpenAI Codex Security Scanned 1.2 Million Commits and Found 10,561 High-Severity Issues

OpenAI Launches Codex Security that Discover, Validate and Patch Vulnerabilities

OpenAI’s GPT-5.4 doubles down on safety as competition heats up - Help Net Security

See the full picture, correlated to your attack surface.

OpenAI Releases Codex Security Agent and GPT-5.4 With Expanded Safety Controls

Get ahead of threats like this

How this story unfolded

OpenAI says Codex Security found zero-days and led to 14 CVE assignments

OpenAI launches Codex Security research preview

OpenAI releases GPT-5.4 with native computer use and updated safeguards

Codex Security beta scans 1.2 million commits and finds thousands of severe issues

OpenAI unveils Aardvark private beta for AI-driven vulnerability discovery

Related entities

Sources

Claude Code Security vs. OpenAI Codex Security - AI Arms Race - TheCyberThrone

OpenAI Codex Security Scanned 1.2 Million Commits and Found 10,561 High-Severity Issues

OpenAI Launches Codex Security that Discover, Validate and Patch Vulnerabilities

OpenAI’s GPT-5.4 doubles down on safety as competition heats up - Help Net Security

See the full picture, correlated to your attack surface.

Related stories

OpenAI GPT-5.2-Codex Release and AI-Driven Vulnerability Detection

Enterprise Platforms for Deploying and Governing AI Agents Expand as OpenAI Launches GPT-5.3-Codex and Frontier

OpenAI Restricts GPT-5.6 Sol Preview Amid Cybersecurity Misuse Concerns