The Lyceum: AI Daily — May 09, 2026
Photo: lyceumnews.com
Saturday, May 9, 2026
The Big Picture
The compute crunch keeps writing the headlines. Anthropic just rented its rival's entire supercomputer because demand grew 80x when it planned for 10x; OpenAI shipped voice models that finally collapse the transcription-reasoning-synthesis stack into one loop; and DeepSeek — the lab that built its identity on independence — is taking state money for the first time. Underneath all of it, Timothy Gowers, a Fields Medalist, published a blog post yesterday quietly revising his priors on whether large language models can do real mathematics.
What Just Shipped
- GPT-Realtime-2 (OpenAI): Speech-to-speech model with GPT-5-class reasoning, 128K context, parallel tool calls, and adjustable reasoning effort.
- GPT-Realtime-Translate (OpenAI): Live streaming speech translation across 70+ input languages into 13 output languages.
- Kimi K2.6 (Moonshot AI): 256K context window, now hosted across DeepInfra, Together, Phala, and BaseTen at $0.75/$3.50 per million in/out tokens.
- Qwen3.6 35B A3B (Alibaba/Qwen): Open-source model with 262K context, available via Together.
- Trinity Large Preview (Arcee AI): 131K context, $0.15/$0.45 per million tokens, optimized for specialized enterprise tasks.
- Grok Voice Think Fast 1.0 (xAI): Voice-mode reasoning model now listed on inference trackers.
Today's Stories
Anthropic Rented Its Rival's Entire Supercomputer
Overnight, Elon Musk became its landlord.
Anthropic announced a deal to take over compute capacity at SpaceX's Colossus 1 — the supercomputer xAI built — gaining what one circulating estimate pegs at 300 megawatts and 220,000 NVIDIA GPUs coming online within the month. The catalyst, per CEO Dario Amodei at Anthropic's developer conference: usage and revenue grew 80x in the first quarter of 2026 when the company had planned for 10x. Claude Code rate limits doubled for paid plans overnight, peak-hour throttling on Pro and Max is gone, and Opus API caps jumped sharply.
According to Fortune, citing New Street Research analyst Antoine Chkaiban, the deal will generate $3 to $4 billion in annual revenue for SpaceX with more than $2.5 billion in cash profit — conveniently arriving before SpaceX's expected IPO roadshow next month. If this works, frontier labs become tenants in each other's data centers and the "vertically integrated lab" thesis dies. If it strains, we'll know by whether Anthropic's reported $30 billion ARR run rate translates into pricing power, or just a bigger infrastructure bill payable to a man who, until recently, was suing them by association.
OpenAI's Voice API Just Grew Up
If you've ever tried to build a voice assistant, you know the drill: bolt together transcription, a reasoning model, and text-to-speech, then pray latency doesn't make users hang up. OpenAI just collapsed that stack.
GPT-Realtime-2 handles audio in and audio out with reasoning happening inside the audio loop — no awkward silence while the model "thinks." Context jumps from 32K to 128K tokens. Two companion models ship alongside: GPT-Realtime-Translate (live speech translation, 70+ input languages to 13 output) and GPT-Realtime-Whisper (streaming transcription as the speaker talks). Instruction retention increased from 36.7% to 70.8% on Scale AI's Audio MultiChallenge benchmark. Per Glean's internal evaluations, the model delivered a 42.9% relative increase in helpfulness in those evaluations over the prior version. Per Genspark's evaluations, its Call for Me Agent saw a 26% lift in effective conversation rate in those tests.
The signal that tells you which way it's going: ChatGPT's consumer voice mode hasn't been upgraded yet. When Sam Altman flips that switch, voice AI graduates from API-only to the mass market.
If voice agents finally feel native rather than stilted, customer support, accessibility, and hands-free workflows become the next interface battleground.
DeepSeek Takes State Money — and Stops Being a Hedge Fund Project
For a year, DeepSeek was the strangest company in AI: a hedge-fund-backed Chinese lab that published everything, took no outside money, and somehow kept pace with labs spending ten times more. That era is ending.
According to the Financial Times, China's state-backed semiconductor investment vehicle — known as the Big Fund — is in talks to lead DeepSeek's first external fundraising at roughly $45 billion, up from a $20 billion estimate weeks ago. Tencent is also reportedly in discussions to participate. Founder Liang Wenfeng historically financed operations through his quantitative fund, High-Flyer, which is what gave DeepSeek the freedom to behave like a research lab rather than a startup.
State capital changes that calculus, and the geopolitical framing around DeepSeek's open-source releases. DeepSeek is already optimizing models for Huawei's Ascend chips, reducing dependence on US silicon. If the round closes, the era of self-funded scrappy frontier research in China is over — and Beijing has formally bet on a national champion. Watch the publishing cadence afterward: if open-weight releases slow or get scoped, you'll know whose strings are attached.
Anthropic's Project Glasswing Is Hunting Zero-Days at Production Scale
Within the past 24 hours, Anthropic published Project Glasswing — a controlled program giving select organizations including AWS, Apple, Cisco, Google, JPMorgan Chase, and Microsoft access to Claude Mythos Preview to find vulnerabilities before attackers do. The headline claim from Anthropic's own page: over the past few weeks, Mythos Preview identified thousands of zero-day vulnerabilities, many critical, across every major operating system and web browser — including a 27-year-old flaw in OpenBSD, an OS famous for its security focus. Anthropic says the model identified nearly all of them and developed many exploits autonomously, without human steering.
The UK's AI Safety Institute corroborated the underlying capability jump: in controlled evaluations, Mythos Preview is the first model to solve AISI's "The Last Ones" — a 32-step corporate network attack simulation — completing it 3 out of 10 attempts.
The model isn't being released publicly. Glasswing is Anthropic's bet that you can deploy offensive cyber capability defensively — but only if you control distribution. If proliferation outpaces patching, the "thousands of zero-days" number stops being a defensive marketing line and becomes a public-safety problem. The signal to watch: independent confirmation from the named partner companies that Glasswing-flagged patches actually shipped.
A Fields Medalist Quietly Raised His Priors on AI Math
Timothy Gowers posted to his personal blog Friday afternoon. The post reached the top of r/singularity within hours. The substance: Gowers wrote that he has "just made a fairly large revision" to his assessment of large language models' mathematical capabilities after using ChatGPT 5.5 Pro on active research. He describes the model producing a proof for an open problem whose structure closely paralleled what he'd independently derived — suggesting it wasn't just pattern-matching to a known solution.
The context that makes this more than a single blog post: a November 2025 arXiv paper co-authored by Gowers and OpenAI researchers documented four new mathematical results human-verified as genuine contributions. Yesterday's post is Gowers signaling that GPT-5.5 Pro is qualitatively beyond what he saw in that earlier work. OpenAI separately says a customized version of GPT-5.5 helped researchers find a new proof related to Ramsey numbers.
The signal isn't "AI solved math." It's that a careful, skeptical expert is updating his priors in public, in real time. If more domain experts at Gowers's level start writing posts like this, the conversation about scientific automation moves from "interesting demos" to "credible witnesses." If they don't — if this remains a single anecdote — it's a fascinating data point about one model's strongest day.
⚡ What Most People Missed
- China's "Four AI Dragons" cleared one trillion yuan in combined valuation: Adding DeepSeek's $45 billion to Zhipu, MiniMax, and Kimi, the combined valuation of China's four leading model labs has crossed one trillion yuan. The market-cap story hasn't landed in Western business press yet, and it reframes the competitive landscape entirely. [Source: 36Kr — Chinese]
- Moonshot AI's $2 billion round is closing: Per the South China Morning Post, Kimi's parent company has crossed a $20 billion valuation, with Meituan and China Mobile among investors and ARR exceeding $200 million in April. This isn't a research lab anymore; it's a commercial rollout.
- Anthropic's next round is reportedly taking shape near $900 billion: The Financial Times is reporting Anthropic aims to raise up to $50 billion at roughly $900 billion — which would make it the largest private fundraise in history by a wide margin. The round is "taking shape," not signed; watch whether sovereign wealth or strategic corporates anchor it rather than traditional VCs.
- OpenAI quietly republished its Preparedness Framework: A new PDF appeared on OpenAI's CDN with no accompanying blog post. Policy often precedes product changes, so refreshed governance documents are operational signals even without a press release.
- Mozilla is building "Stack Overflow for agents": Mozilla AI's cq project, started in March, frames itself as cross-vendor, machine-readable implementation knowledge for coding agents — infrastructure for the layer below the agent, not another agent.
- A Uruguayan offender used ChatGPT to groom 30+ children: Luis Carvajal, with prior convictions, used ChatGPT to craft tailored grooming messages for minors he met through online games, and was sentenced in October 2025 to nearly 10 years. Generative AI weaponized at scale isn't theoretical anymore — it's in courtrooms.
📅 What to Watch
- If SpaceX's S-1 prominently features Anthropic compute revenue, Musk has successfully repositioned SpaceX as an AI infrastructure company before its IPO — not a rocket company that happens to rent GPUs.
- If sovereign wealth funds anchor Anthropic's reported $900B round rather than traditional VCs, it signals investors now view frontier labs as critical infrastructure, not startups — and prices them accordingly.
- If DeepSeek's open-source publishing cadence slows after the Big Fund round closes, state capital came with conditions, and China's open-weight era is ending.
- If named Glasswing partners publicly confirm patches shipped from Mythos-discovered zero-days, autonomous offensive cyber capability has crossed from research demo to deployed tool — and the disclosure timeline becomes a regulatory question.
- If ChatGPT consumer voice mode gets the Realtime-2 upgrade within the next two weeks, expect a wave of voice-first product pivots that have been waiting for exactly this latency profile.
The Closer
A safety lab paying its loudest critic $4 billion to host its model on a supercomputer named Colossus; a Fields Medalist updating his priors via WordPress on a Friday evening; a hedge-fund founder in Hangzhou about to take a check from Beijing's chip czar. The compute is so tight that "evil" is now a pricing tier, and somewhere in Uruguay a courtroom is figuring out what generative AI means when the user is a predator with prior convictions.
Talk soon.
If you know someone still calling this a hype cycle, forward them this one.