OpenAI Releases Codex Security Agent and GPT-5.4 With Expanded Safety Controls
OpenAI announced Codex Security (formerly Aardvark), an application security agent designed to autonomously discover, validate, and remediate vulnerabilities in enterprise and open-source codebases. The product builds a project-specific threat model (mapping trust boundaries and exposure points), then prioritizes issues by likely real-world impact and attempts to confirm findings by running proof-of-concept exploits in sandboxed environments before generating contextual patches intended to minimize regressions. OpenAI reported beta metrics including an 84% reduction in alert noise, a 90% decrease in over-reported severity, and a >50% drop in false positives, and said the system scanned 1.2 million commits from external repositories over a 30-day period; the tool is rolling out as a research preview via the Codex web interface to ChatGPT Pro, Enterprise, Business, and Edu customers.
Separately, OpenAI released the GPT-5.4 model across ChatGPT, Codex, and the API (gpt-5.4 and gpt-5.4-pro), positioning it as a flagship model with improved reasoning, coding, and agent workflows, plus native computer-use capabilities. OpenAI stated it reduced hallucinations and errors (claiming user-flagged factual errors were 33% less likely than GPT-5.2 on a de-identified prompt set) and said it upgraded safety protections while keeping the same high cyber-risk classification used for GPT-5.3-Codex, indicating continued emphasis on security controls as model capabilities expand.

Get ahead of threats like this
Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.
How this story unfolded
5 events from the most recent confirmed update back to the earliest known activity.
OpenAI says Codex Security found zero-days and led to 14 CVE assignments
As part of audits of major open-source projects including OpenSSH, GnuTLS, PHP, and Chromium, OpenAI reported that Codex Security discovered zero-day vulnerabilities and contributed to the assignment of 14 CVEs.
OpenAI launches Codex Security research preview
OpenAI announced Codex Security, an application security agent designed to find, validate, and remediate vulnerabilities in enterprise and open-source codebases. The product began rolling out as a research preview via the Codex web interface to ChatGPT Pro, Enterprise, Business, and Edu customers, with a Codex for OSS program for qualifying maintainers.
OpenAI releases GPT-5.4 with native computer use and updated safeguards
OpenAI released GPT-5.4 across ChatGPT, Codex, and the API, positioning it as a flagship model with improved reasoning, coding, tool use, and agent workflows. The company said it added upgraded safeguards, maintained the same high cyber-risk classification as GPT-5.3-Codex, and published related safety research on reasoning concealment.
Codex Security beta scans 1.2 million commits and finds thousands of severe issues
During the 30 days preceding its public announcement, OpenAI said Codex Security scanned more than 1.2 million commits across external repositories and identified 792 critical and 10,561 high-severity findings. OpenAI also said the system helped reduce alert noise and false positives in private beta.
OpenAI unveils Aardvark private beta for AI-driven vulnerability discovery
OpenAI unveiled Aardvark, the private beta precursor to Codex Security, in October 2025 as an effort to detect and help fix software vulnerabilities at scale.
Related entities
Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.
Sources
4 references tracked. Mallory keeps watching after this page renders.
Claude Code Security vs. OpenAI Codex Security - AI Arms Race - TheCyberThrone
thecyberthrone.in
Open sourceOpenAI Codex Security Scanned 1.2 Million Commits and Found 10,561 High-Severity Issues
thehackernews.com
Open sourceOpenAI Launches Codex Security that Discover, Validate and Patch Vulnerabilities
cybersecuritynews.com
Open sourceOpenAI’s GPT-5.4 doubles down on safety as competition heats up - Help Net Security
helpnetsecurity.com
Open sourceSee the full picture, correlated to your attack surface.
Map indicators from this story to your assets and identify affected systems in minutes.
Every observed campaign, victim, and pivot linked to actors named in this story.
Malware, exploits, and IOCs connected to the activity described here.
YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.
Get matching new stories delivered to your team as they break — not the next morning.
Ask questions about this story and take action on the answers.


