Skip to content
Briefings are running a touch slower this week while we rebuild the foundations.See roadmap
AI: Jobs, Power & Money
8JUN

Tom's Hardware challenges Mythos zero-day claims

2 min read
11:04UTC

A technical review found Anthropic's marketing relied on 198 manual reviews to support claims of thousands of severe vulnerabilities.

EconomicDeveloping
Key takeaway

Only 198 manual reviews support Anthropic's claim of thousands of zero-day discoveries.

Tom's Hardware published a critical review of Anthropic's Mythos claims on 9 April, noting that the "thousands of zero-days" assertion rested on only 198 manual reviews 1. Many of the flagged vulnerabilities were in outdated software no longer in active use. The gap between Anthropic's marketing language and the verified sample is wide enough to warrant caution.

The Bessent-Powell emergency meeting at Treasury headquarters proceeded regardless of this scrutiny. Challenger data confirmed AI-attributed cuts crossed 107,094 the same month , suggesting federal regulators assessed the systemic risk of AI broadly, beyond Mythos's specific claims. Whether Mythos found hundreds or thousands of exploitable flaws, the CyberGym benchmark score of 83.1% versus 66.6% for its predecessor represents a measurable capability jump that the twelve Glasswing partners will deploy in production environments.

Deep Analysis

In plain English

When Anthropic announced that Claude Mythos had found 'thousands' of serious security flaws in software, it was a dramatic claim. Tom's Hardware, a technology publication, looked at how Anthropic had actually counted those flaws. The answer was: 198 human reviewers manually checked the model's outputs. Many of the flaws it identified were in old software that organisations had already stopped using. The gap between 'thousands of vulnerabilities' and 198 verified reviews is significant. The US Treasury and Federal Reserve held their emergency meeting with bank CEOs regardless of this critique, which suggests the regulators assessed the risk from the model's overall capability trajectory, not just the specific zero-day count.

First Reported In

Update #5 · The model they won't release

Tom's Hardware· 10 Apr 2026
Read original
Causes and effects
This Event
Tom's Hardware challenges Mythos zero-day claims
Independent scrutiny of Mythos's capability claims introduces uncertainty about the model's actual security impact, even as regulators acted on the headline numbers.
Different Perspectives
European workers and regulators
European workers and regulators
NBER working paper w34995 found European workers use generative AI at 32% versus 43% of US workers, a gap driven by management practice rather than regulation. The EU AI Act's high-risk employment deadline stays at December 2027, leaving European workers facing the same displacement curve two to four years behind the US.
AI industry (Leading the Future PAC, OpenAI, Andreessen Horowitz)
AI industry (Leading the Future PAC, OpenAI, Andreessen Horowitz)
Leading the Future committed over $100 million to the 2026 midterms and targeted regulation-minded candidates in the 2 June primaries; its counter-fund Public First formed at $50 million. The PAC runs advertising on healthcare and jobs without naming AI, mirroring the 1994 insurance industry campaign that defeated the Clinton health plan.
UK youth entering the labour market
UK youth entering the labour market
UK youth unemployment reached 14.7% in January-March 2026, the highest since 2014, with 22.7% of young jobseekers out of work more than a year. The ONS publishes no AI-exposure breakdown, so policy is being set blind to the channel doing the damage.
US displaced workers (tech and finance)
US displaced workers (tech and finance)
Tech workers face median reemployment times of 4.7 months, up 47% from 2024, with a hiring pool contracting faster than AI-specialist openings can absorb them. Finance operations workers are the next cohort: 52% of their employers now run agentic AI in the exact functions where most of them work.
TSMC and Taiwan chip supply chain
TSMC and Taiwan chip supply chain
Nvidia's 17% headcount growth to 42,000 on $81.6 billion in quarterly revenue depends on TSMC's CoWoS advanced packaging capacity constraining H100 and B200 supply, sustaining margins above 70%. The AI build-out's sole headcount-growth story runs through a Taiwan supply chain that has no parallel in downstream software.
Displaced tech workers globally
Displaced tech workers globally
CrowdStrike's SEC disclosure puts AI attribution on a material regulatory record for the first time, but Oracle's Massachusetts WARN clock expired unfiled after up to 14 workers were logged as remote despite office proximity. The legal apparatus cannot enforce what it cannot see: hybrid reclassification, GCC transfers, and hires never made.