Skip to content
Briefings are running a touch slower this week while we rebuild the foundations.See roadmap
Iran Conflict 2026
1APR

Anthropic withholds Mythos from public release

2 min read
12:41UTC

Anthropic's most capable model scored 83.1% on vulnerability reproduction but will not be released publicly, going instead to twelve partners through a $100 million restricted programme.

ConflictDeveloping
Key takeaway

Anthropic set the precedent for withholding a frontier model from public release over systemic risk.

Anthropic released Claude Mythos Preview exclusively to twelve partner organisations through Project Glasswing on 8 April 2026, backed by $100 million in model usage credits 1. The model autonomously identified thousands of zero-day vulnerabilities across every major operating system and browser, including a 27-year-old OpenBSD flaw that had survived five million automated tests. On the CyberGym benchmark it scored 83.1% on vulnerability reproduction, compared with 66.6% for Anthropic's previous top model.

Anthropic has explicitly stated it will not release Mythos to the public. The twelve Glasswing partners include AWS, Apple, Google, Microsoft, CrowdStrike, Palo Alto Networks, and JPMorgan. Goldman Sachs, another partner, published displacement research the same week showing AI substitutes 25,000 jobs per month , placing these institutions on both sides of the AI capability and labour displacement story.

A Tom's Hardware review challenged the marketing: the "thousands" claim rested on only 198 manual reviews, and many flagged flaws were in outdated software 2. The Bessent-Powell emergency meeting suggests federal regulators took the risk seriously regardless.

Deep Analysis

In plain English

Every major AI model so far has been made available to the public, either free or via subscription. Anthropic has broken that pattern. Claude Mythos Preview is restricted to twelve partner organisations, including tech giants and financial institutions. The model can automatically find previously unknown security flaws in software at a scale that has never been seen from an AI system. Anthropic's position is that the capability is powerful enough that releasing it widely would create unacceptable risk, the same software that makes it useful to defenders could be used by attackers. So it is being distributed under a controlled programme called Project Glasswing, with $100 million in subsidised usage credits. A technical review by Tom's Hardware found that some of the specific claims were overstated, but the underlying capability gap the model represents is real.

Deep Analysis
Root Causes

Anthropic's capability assessment rests on the CyberGym benchmark jump from 66.6% to 83.1%, a 16.5-percentage-point improvement in autonomous vulnerability reproduction. Anthropic's own research on 'observed exposure' shows computer programmers face 75% task coverage from Claude-class models; Mythos's security capabilities represent the first instance where the coverage figure has operational implications beyond individual productivity.

The restriction decision reflects Anthropic's founding premise that AI safety and capability development must remain linked. Project Glasswing's $100 million credit allocation is structured as a subsidy for defensive deployment, not a commercial launch, which is itself novel for a frontier model.

What could happen next?
  • Precedent

    The first frontier AI model explicitly withheld from public release establishes capability-gating as a legitimate deployment option for safety-constrained AI systems.

    Medium term · 0.85
  • Risk

    The twelve Glasswing partners, which include both defensive (CrowdStrike, Palo Alto) and dual-use (AWS, Google, Microsoft) organisations, may deploy Mythos capabilities in ways beyond Anthropic's stated defensive intent.

    Short term · 0.62
  • Opportunity

    Security professionals who can interpret and direct AI-identified vulnerability data at scale represent a new premium-tier role the model creates even as it automates routine scanning.

    Medium term · 0.71
First Reported In

Update #5 · The model they won't release

Anthropic· 10 Apr 2026
Read original
Different Perspectives
Markets
Markets
Brent crude rose 2.2 per cent to $96.34 on 10 June, reversing a 7 per cent weekly decline built on deal optimism, as the overnight exchange repriced the Strait of Hormuz risk premium in a single session. The move reflects transit-risk repricing rather than supply shock: Iran's exports had already collapsed to below 300,000 barrels per day.
Pakistan
Pakistan
Pakistan's Naqvi channel, the only mediation track carrying both civilian and military buy-in, was stress-tested by live ordnance within 48 hours of the 6-7 June Tehran visit. Whether Washington informed Islamabad of the imminent strike plan while Naqvi was in Tehran remains undisclosed, putting the channel's neutrality under scrutiny.
Kuwait
Kuwait
Kuwait hosted the third Iranian strike on its soil since the 3 June airport drone attack, with Ali Al Salem airbase targeted in the three-country salvo. Its recent $1.98 billion Anduril Anvil counter-drone purchase signals it is rearming rather than reconsidering its hosting posture.
Bahrain
Bahrain
Bahrain absorbed the IRGC barrage via PAC-3 intercepts with its magazine already at 87 per cent depletion and no resupply before 2027. Sounding air-raid sirens over Manama, it faced the intercept burden with the thinnest defensive stack in the Gulf coalition.
Jordan
Jordan
Jordan reported all five incoming missiles intercepted with no injuries and no damage, a clean defensive performance that strengthens Amman's case for staying in the Western coalition without escalating its own posture. It now sits on Iran's target list for the first time despite not being a party to the Abraham Accords confrontation.
Iran / IRGC
Iran / IRGC
Foreign Minister Araghchi posted on X that US forces should 'leave our region if you want to be safe' and framed the exchange as a US defeat, while the IRGC claimed 21 targets hit and an F-35 hangar destroyed. The claims serve a domestic and Arab-audience framing rather than a verified battle-damage assessment.