Executive Summary for Claude Mythos Project Glasswing: June 2026 Verification Status

Download this Executive Summary as PDF –>

A market in which the buyer cannot measure what they bought is no market at all.

Morrell’s flashy claims of a revolutionary coast-to-coast rapid transit machine allegedly sold 250,000 shares of stock in a hotdog-shaped 450 foot gas balloon. It launched in Berkeley, California on May 23, 1908, after San Francisco had banned it. Source: The Jive Bomber

Summary

Defer any strategy, procurement, or risk decision that rests on the Claude Mythos capability claim until July 6, 2026, when Anthropic’s promised report is due for independent review. The claim is that Mythos is too dangerous to release. The record runs the other way.

Of 23,019 vulnerabilities Mythos reported, it showed only 75 fixes. Holding back the fixes slows validation of findings. Independent teams nonetheless reproduced the showcase findings on commodity and open-weight models for a fraction of the price, among them the engineer who wrote the OpenBSD flaw Anthropic featured at launch. The headline accuracy figure covers the 8 percent of findings a human checked, and the data behind the rest stays withheld. Anthropic widened access to roughly 150 organizations and filed to go public, both ahead of the verification the program promised.

Strategic assumption

Through 2026, AI vulnerability-discovery capabilities marketed as frontier-exclusive will remain reproducible on commodity open-weight models, removing the technical basis for premium pricing and restricted-access programs.

The question is whether to pay five times the price for a premium nail gun when commodity nail guns already do the job, while the premium vendor restricts access and justifies the high price by comparing its own tool to a hammer.

Key findings

Of 23,019 vulnerabilities Mythos reported, 1,752 were verified by a human or security firm and fixes have been shown for 75. The 90.6% accuracy rate in press coverage applies to a human doing the work, not the large numbers from a machine alone.
The flagship discovery used to claim novel risks (FreeBSD CVE-2026-4747) is a 2007 fix for shared code that sat with a patch waiting to be applied. The fix was present in the model’s training data, making the result consistent with recovery from the backlog of delayed fixes rather than novel discovery.
Eight of eight open-weight models reproduced the detection capability, one at $0.11 per million tokens. On June 8, 2026, Glasswing launch partner Cisco ran six frontier models across 1.8 billion lines of code and showed results do not depend on Mythos.
No reproduction steps were published with the Anthropic launch blog, the system card, or the Glasswing update, meaning premium claims cannot be independently verified.
Anthropic has meanwhile filed confidentially for an IPO near a one-trillion-dollar valuation and expanded Glasswing to roughly 150 organizations, committing access and capital ahead of verification.

Recommendations

Treat AI-assisted vulnerability discovery as a commodity input and source it competitively. The showcase results are reproducible at low cost on public models. AI vulnerability harness runs should cost cents per million tokens, not tens of dollars or more. An open-source harness on commodity Haiku 4.5 and Sonnet 4.6 produced eight findings in two minutes for $0.75, two of them matching the Mythos showcase, at the discovery layer. The FreeBSD exploit was reproduced separately by Calif.io on the prior Opus 4.6 model in about four hours.
Do not pay Anthropic a premium or restructure operations on the basis of the Mythos security capability claim until an independent verification exists.
Require any AI security vendor to supply reproduction steps and verified, fixed CVEs rather than model-generated finding counts.
Set July 6, 2026 as a validation checkpoint, and reassess with the Glasswing report published and independently reviewed.

The flagship “discovery” was backlog recall

CVE-2026-4747 is a valid stack buffer overflow in FreeBSD. The code is a University of Michigan implementation that was patched by MIT in 2007. FreeBSD imported the unpatched code in 2008 and never applied the fix. This 2007 patch is present in the model’s training data, so the Mythos published exploitation demonstration took an old vulnerable operating system with a known missing patch and pointed at it. The result demonstrates how a known, undefended target can be flagged by AI, rather than discovery of anything unknown.

The danger warnings are much thinner than advertised. Mythos did send an email out of its sandbox to flag a bug, but only after being instructed to try, it showed no sign of altering its own weights, and prior models such as Opus 4.6 find these same flaws.

Discovery is reproducible at commodity cost

The CVE explanation should help clarify why independent parties have repeatedly reproduced the showcase findings on very inexpensive public models. AISLE confirmed the FreeBSD detection with eight of eight open-weight models, showing $0.11 per million tokens was a sufficient cost model. Vidoc reproduced it on the public Opus 4.6 model and on GPT-5.4. Cisco’s June 8 assessment across six frontier models showed the outcome is model-independent. The curl maintainers reported no change to their workflow, and Mozilla’s headline of 271 Firefox vulnerabilities reconciles to roughly three against the advisory record. Discovery at this level carries a published, commodity cost.

Niels Provos, who committed the 1998 flaw in BSD that Mythos used for their “discovery”, reproduced that finding and autonomously surfaced new zero-days using Opus 4.6, Sonnet 4.6, and the open-weight GLM 5.1 on his own open-source IronCurtain harness, concluding that discovery is an orchestration problem rather than a frontier-model one. Then clearbluejar ran the same class of pipeline on two open-weight models on a single consumer GPU and recovered CVE-2026-4747, finding that their scaffolding, again not the model, did the hard work.

The premium is unjustifiable as presented

Anthropic prices Mythos at roughly five times its public Opus model, from $25 to $125 per million input and output tokens, on the strength of exploit development rather than discovery. No replayable exploit with reproduction steps accompanies the launch blog, their very large and inefficient 244-page system card, or the late-May Glasswing update. A buyer cannot confirm the capability they are paying for, and the available reproductions indicate the defensible cost is a fraction of the quoted price.

Results are self-assessed, data is withheld

Anthropic’s interim Glasswing update reports results in stages that have undermined their own headlines.

Stage	Figure	What it represents
Total findings	23,019	The model’s ungraded output
Estimated high or critical	6,202	The model’s own estimate
Checked by a human or firm	1,752	28% of the high-critical pile, about 8% of the total
True positives among those checked	90.6%	A statement about the 1,752, not the 23,019
Fixes shown	75	Out of 23,019

The 90.6% accuracy figure is from humans. The rest is just the model assessing its own output. Anthropic has also withheld the fixes used to derive the findings, the artifacts that would allow independent re-derivation. A result that can be validated only against the system that produced it, does not rise to the level of independent confirmation of its capability.

Extractive disclosure structure

The disclosure architecture inverts established norms, and economics are the reason why. Anthropic commits up to one hundred million dollars in model credits to a consortium of about a dozen large firms. The consortium attests to the capability that justifies restricting the model to the consortium, and the same firms sell the products and services that follow from that attestation. A rushed “emergency” memo about Mythos risks crediting 250 CISOs was apparently curated by security vendors who would capitalize on myths about machine risks. The most consequential findings instead have come from humans during the Glasswing period: the Palo Alto vulnerability that triggered a federal mandate was attributed to attackers operating in production. It was excluded from the company’s AI-credited count. Findings are directed to Anthropic while fixes fall to volunteer maintainers, even as the patch-generation step that a model can automate already runs in production for paying customers. Anthropic’s Claude Security product patched more than 2,100 vulnerabilities in three weeks behind a paywall, while the open-source projects apparently only received the vulnerabilities.

Market motivations

On June 1, 2026, Anthropic filed confidentially for an initial public offering following a funding round near a one-trillion-dollar valuation. On June 2, it expanded Glasswing to roughly 150 organizations across more than fifteen countries, covering power, water, healthcare, and communications. Access widened and capital was committed before any independent validation of the capability, and before the report Anthropic itself promised.

Several firms now trialling Mythos, including Google, Nvidia, and Cisco, are Anthropic investors, and Goldman Sachs, Morgan Stanley, and JPMorgan are reported to be in talks to underwrite the offering. The parties certifying the capability are the parties whose returns depend on it. Private equity firms are being courted to invest in Anthropic promises, as a means of pressuring their own portfolio companies to choose future Anthropic products.

Outlook

Anthropic committed to a public report within ninety days of the April 7 launch, due around July 6, 2026. However, the question of novelty has been repeatedly answered. With each reveal, Mythos has failed to prove its initial claims. A report containing a verified CVE list with reproduction steps would substantiate the capability claim and the program’s premise. A report that restates model-graded headline figures without independent verification would confirm the pattern described here.

The prudent posture is to treat their unproven capability as unproven.

Morrell’s airship rose about 300 feet and then ripped apart and crashed, shortly after it’s first launch on May 23, 1908. Source: The Jive Bomber

References: flyingpenguin series

The Boy That Cried Mythos: Verification is Collapsing Trust in Anthropic, April 13, 2026.
America Prepares as Anthropic Mythos is 100X More Deadly Than Martian Death Ray, April 13, 2026.
FreeBSD CVE-2026-4747 Log Suggests Mythos is a Marketing Trick, April 14, 2026.
Cartel or Not? Anthropic Mythos is a Curious Case, April 15, 2026.
Ox Security Report: Anthropic MCP is Execute First, Validate Never, April 15, 2026.
How SANS Mythos Marketing Disappoints Defenders, April 16, 2026.
Mythos Mystery in Mozilla Numbers: How 22 Vulns Became 271 or Maybe 3 in April, April 22, 2026.
Alisa Esage Throws Mythos Under Zero Day Bus, April 24, 2026.
Anthropic Mythos as Valuable as a Firehose in a Blizzard, May 2, 2026.
Seventy-Five Cents Gets You an Anthropic Mythos Killer, May 4, 2026.
cURL Toe to Toe With Mythos: Big Nothingburger Leaves Bad Taste, May 12, 2026.
Palo Alto Defender’s Guide Refutes Mythos Claim, May 13, 2026.
I’m on Mythos, May 25, 2026.
Mythos Grading Mythos: Got Patches Yet?, May 26, 2026.
Cisco’s Mythos Post Throws Anthropic Under the Bus, June 8, 2026.

References: Anthropic program materials

Project Glasswing (program page), Anthropic.
Project Glasswing: An initial update, Anthropic, late May 2026. Source of the 23,019 / 6,202 / 1,752 / 90.6% / 75 figures and the 90-day disclosure convention.

References: independent reproduction and refutation

AISLE reproduction: eight of eight open-weight models detect CVE-2026-4747, one at $0.11 per million tokens. Documented in references 1 and 10.
Vidoc reproduction on public Opus 4.6 and GPT-5.4. Documented in reference 10.
Nicholas Carlini’s personal confirmation that he found CVE-2026-4747 using Mythos Preview, placing it outside his February 5 paper. Documented in references 3 and 10.
Cisco frontier-model assessment, six models across 1.8 billion lines of code. Documented in reference 15.
Palo Alto Networks May 2026 Defender’s Guide and the CVE-2026-0300 advisory, with the federal-mandate CVE attributed to attackers in production and excluded from the AI-credited count. Documented in reference 12.
Mozilla Foundation Security Advisory 2026-30 (Firefox 150) and Bobby Holley, “The zero-days are numbered,” Mozilla blog, April 21, 2026. Documented in reference 7.
Claude Mythos Preview system card (244 pages), Anthropic. Documented in reference 1.
Finding Zero-Days with Any Model, Niels Provos, April 29, 2026. Reproduced the OpenBSD SACK finding and surfaced new zero-days with commercial and open-weight models on the open-source IronCurtain framework.
System Over Model, Tested: Reproducing Mythos’s FreeBSD Find on Local Open-Weight Models, clearbluejar, June 4, 2026.
System Over Model: Zero-Day Discovery at the Jagged Frontier, Stanislav Fort, AISLE, April 2026. The nano-analyzer pipeline reproduced CVE-2026-4747 for under $100.

References: press on the June expansion and IPO filing

Anthropic scales Claude Mythos to critical infrastructure in 15+ countries, TechCrunch, June 2, 2026.
Anthropic expanding access to Project Glasswing, CyberScoop, June 2026. Source for Claude Security patching 2,100+ vulnerabilities in three weeks.
Anthropic expands Mythos to 150 additional organizations in more than 15 countries, CNBC, June 2, 2026.
Anthropic expands Project Glasswing to 150 organizations in more than 15 countries, Help Net Security, June 3, 2026.
Experts: Anthropic’s move to expand Project Glasswing will end in Mythos public release, Cybernews, June 2026.
From Anthropic’s Mythos to the Birkin bag, scarcity sells, John Foley, Lex, Financial Times, April 23, 2026.

flyingpenguin

Executive Summary for Claude Mythos Project Glasswing: June 2026 Verification Status

Leave a Reply Cancel reply

a blog about the poetry of information security, since 1995