Skip to main content
Meet us at Black Hat USA 2026— Las Vegas, August 1–6Book a Meeting
Mallory
Back to intelligence
ai-platform-securitystandards-framework-update

OpenAI Releases Codex Security Agent and GPT-5.4 With Expanded Safety Controls

Updated 2d agoFirst seen Mar 7, 20264 sources

OpenAI announced Codex Security (formerly Aardvark), an application security agent designed to autonomously discover, validate, and remediate vulnerabilities in enterprise and open-source codebases. The product builds a project-specific threat model (mapping trust boundaries and exposure points), then prioritizes issues by likely real-world impact and attempts to confirm findings by running proof-of-concept exploits in sandboxed environments before generating contextual patches intended to minimize regressions. OpenAI reported beta metrics including an 84% reduction in alert noise, a 90% decrease in over-reported severity, and a >50% drop in false positives, and said the system scanned 1.2 million commits from external repositories over a 30-day period; the tool is rolling out as a research preview via the Codex web interface to ChatGPT Pro, Enterprise, Business, and Edu customers.

Separately, OpenAI released the GPT-5.4 model across ChatGPT, Codex, and the API (gpt-5.4 and gpt-5.4-pro), positioning it as a flagship model with improved reasoning, coding, and agent workflows, plus native computer-use capabilities. OpenAI stated it reduced hallucinations and errors (claiming user-flagged factual errors were 33% less likely than GPT-5.2 on a de-identified prompt set) and said it upgraded safety protections while keeping the same high cyber-risk classification used for GPT-5.3-Codex, indicating continued emphasis on security controls as model capabilities expand.

Share:
OpenAI Releases Codex Security Agent and GPT-5.4 With Expanded Safety Controls
Stay ahead

Get ahead of threats like this

Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.

EVENT TIMELINE

How this story unfolded

5 events from the most recent confirmed update back to the earliest known activity.

5 EVENTS
Mar 7, 20264mo ago

OpenAI says Codex Security found zero-days and led to 14 CVE assignments

As part of audits of major open-source projects including OpenSSH, GnuTLS, PHP, and Chromium, OpenAI reported that Codex Security discovered zero-day vulnerabilities and contributed to the assignment of 14 CVEs.

OpenAI launches Codex Security research preview

OpenAI announced Codex Security, an application security agent designed to find, validate, and remediate vulnerabilities in enterprise and open-source codebases. The product began rolling out as a research preview via the Codex web interface to ChatGPT Pro, Enterprise, Business, and Edu customers, with a Codex for OSS program for qualifying maintainers.

Mar 6, 20264mo ago

OpenAI releases GPT-5.4 with native computer use and updated safeguards

OpenAI released GPT-5.4 across ChatGPT, Codex, and the API, positioning it as a flagship model with improved reasoning, coding, tool use, and agent workflows. The company said it added upgraded safeguards, maintained the same high cyber-risk classification as GPT-5.3-Codex, and published related safety research on reasoning concealment.

Feb 5, 20265mo ago

Codex Security beta scans 1.2 million commits and finds thousands of severe issues

During the 30 days preceding its public announcement, OpenAI said Codex Security scanned more than 1.2 million commits across external repositories and identified 792 critical and 10,561 high-severity findings. OpenAI also said the system helped reduce alert noise and false positives in private beta.

Oct 1, 20259mo ago

OpenAI unveils Aardvark private beta for AI-driven vulnerability discovery

OpenAI unveiled Aardvark, the private beta precursor to Codex Security, in October 2025 as an effort to detect and help fix software vulnerabilities at scale.

The operational view lives in Mallory

See the full picture, correlated to your attack surface.

This page covers what’s public. Mallory adds the parts that aren’t — which of your assets are affected, which threat actors are using it right now, which detections to deploy, and what to do next.
Exposure mapping

Map indicators from this story to your assets and identify affected systems in minutes.

Threat actor evidence

Every observed campaign, victim, and pivot linked to actors named in this story.

Associated malware

Malware, exploits, and IOCs connected to the activity described here.

Detection signatures

YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.

Scheduled alerts

Get matching new stories delivered to your team as they break — not the next morning.

AI threads

Ask questions about this story and take action on the answers.