Claude Mythos Preview System Card: What It Reveals About the Model's Capabilities

Key Points

The system card details cybersecurity capabilities that Anthropic describes as "unprecedented"
Mythos Preview discovered thousands of critical vulnerabilities across major operating systems and browsers
Notable: a 27-year-old OpenBSD vulnerability and Firefox 147 JS engine exploit chain
The model can autonomously move from vulnerability discovery to proof-of-concept exploit development
Anthropic's safety team determined these capabilities require restricted access

What Is a System Card?

A system card is a document published by AI developers that details a model's capabilities, safety evaluations, limitations, and deployment decisions. Think of it as a technical spec sheet combined with a safety assessment. Anthropic has published system cards for previous Claude models, but the Mythos Preview card is notable for its candor about the model's offensive potential.

The Key Revelations

Zero-Day Discovery at Scale

The most striking finding in the system card is the model's ability to discover previously unknown vulnerabilities — zero-days — at a scale that far exceeds what human security researchers can achieve in comparable timeframes.

During internal testing, Mythos Preview identified thousands of critical vulnerabilities across major software platforms. While the exact number has not been publicly disclosed, Anthropic characterized the results as representing a "step-change" in automated vulnerability discovery.

The OpenBSD Discovery

Perhaps the most compelling individual finding was the model's discovery of a 27-year-old vulnerability in OpenBSD, a operating system renowned for its security focus. OpenBSD maintainers have dedicated decades to code auditing, making it one of the most thoroughly reviewed codebases in the world. The fact that an AI model identified a vulnerability that human auditors missed for over a quarter century illustrates the model's capability.

Firefox 147 Exploit Chain

The system card describes how Mythos Preview identified vulnerabilities in the Firefox 147 JavaScript engine and then developed functional proof-of-concept exploits for them. This is significant because it demonstrates the model's ability to not just find vulnerabilities but to operationalize them — moving from "this code has a flaw" to "here is how that flaw can be exploited."

Autonomous Exploit Development

The most concerning capability described in the system card is the model's degree of autonomy in the exploit development pipeline. With minimal human guidance, Mythos Preview can:

Identify a vulnerability in source code or compiled software
Determine the exploitability of that vulnerability
Develop a working proof-of-concept exploit
In some cases, chain multiple vulnerabilities together for deeper access

This end-to-end capability is what ultimately drove Anthropic's decision to restrict access through Project Glasswing rather than releasing the model publicly.

Safety Evaluations

The system card describes Anthropic's evaluation process, which included:

Internal red-teaming: Anthropic's security team tested the model's ability to discover and exploit vulnerabilities across a range of software targets
External review: Select external evaluators were given access to validate the capability claims
Threshold assessment: The model's offensive capabilities exceeded thresholds that Anthropic had established for triggering access restrictions
Comparative analysis: The capabilities were compared against both previous Claude models and human expert teams

Anthropic reports that the model exceeded their pre-established safety thresholds in the cybersecurity domain, which triggered the restricted deployment decision.

Comparison to Previous System Cards

Aspect	Claude Opus 4.6 Card	Mythos Preview Card
Autonomy level	Moderate	High
Vulnerability discovery	Limited	Thousands found
Exploit development	Not demonstrated	Autonomous PoC generation
Access decision	Public release	Restricted (Glasswing)
Safety threshold	Within limits	Exceeded thresholds

What This Means

The Mythos Preview system card is significant not just for what it describes about one model, but for what it signals about the trajectory of AI capabilities. If a model developed primarily for general-purpose AI tasks can achieve this level of cybersecurity proficiency, it suggests that future frontier models from any lab may face similar deployment questions.

For Anthropic specifically, the system card establishes a framework for how the company handles models that exceed certain capability thresholds. This could serve as a template for other AI developers facing similar decisions.

For the latest benchmark comparison, see our benchmarks page. For more on the model's background, read our comprehensive explainer.

Claude Mythos Preview System Card: What It Reveals About the Model's Capabilities

Key Points

What Is a System Card?

The Key Revelations

Zero-Day Discovery at Scale

The OpenBSD Discovery

Firefox 147 Exploit Chain

Autonomous Exploit Development

Safety Evaluations

Comparison to Previous System Cards

What This Means

Sources

Related Articles

Anthropic Announces Claude Mythos Preview — Its Most Powerful AI Model Yet

Claude Mythos Preview Benchmark Breakdown: How It Compares to GPT-5 and Opus 4.6

NSA Uses Mythos Preview as Pentagon Labels Anthropic a 'Supply-Chain Risk'