AI Daily — Apr 28, 2026
Photo: lyceumnews.com
Tuesday, April 28, 2026
The Big Picture
The two stories that defined AI's last seven years — Microsoft's exclusive grip on OpenAI, and the all-you-can-eat AI coding subscription — both ended on the same day. Meanwhile, Xiaomi quietly shipped a 1-trillion-parameter open-source model that does in four hours what a CS major does in three weeks, for a dollar per million tokens. The plumbing is what changed today. The models were almost beside the point.
What Just Shipped
- MiMo-V2.5-Pro (Xiaomi): MIT-licensed 1T-parameter MoE with 42B active and a 1M-token context window, tuned for agentic coding.
- MiMo-V2.5 (Xiaomi): Smaller sibling at roughly 310B total / 15B active, positioned as a native omni-modal agent under the same MIT license.
- Symphony (OpenAI): Open-source orchestration spec that turns issue trackers like Linear and Jira into the control plane for Codex agents.
- Talkie 13B (Talkie): A 13B foundation model trained exclusively on pre-1931 public domain text — a deliberate copyright-immunity play.
Today's Stories
The Microsoft–OpenAI Divorce Is Official, and Both Sides Are Calling It a Win
The deal that defined the AI industry since 2019 just got rewritten, and the most important word in the new version is non-exclusive.
Microsoft and OpenAI announced Monday that ChatGPT and other OpenAI products can now run on competing clouds, including Amazon Web Services and Google Cloud. Microsoft remains the primary cloud partner — products still launch first on Azure unless technical limits intervene — but Microsoft's license to OpenAI's intellectual property, which runs through 2032, is no longer exclusive, according to Microsoft's announcement and corroborating reporting from Bloomberg and iClarified.
Two clauses were removed from the old deal. Microsoft's obligation to share revenue with OpenAI is gone. And the AGI clause — the provision that would have let an independent panel declare OpenAI had built artificial general intelligence and unwound Microsoft's rights — has been struck. Financial and licensing terms now follow a fixed timeline rather than a milestone nobody could agree how to measure.
Reporters pointed to a practical trigger: OpenAI's up-to-$50-billion deal with Amazon had created a legal mess, with Microsoft's old terms blocking OpenAI from giving AWS exclusive rights to Frontier, its enterprise agent platform, as TechCrunch reported. AWS CEO Andy Jassy posted on X that OpenAI models are coming to Bedrock in the coming weeks.
If this works, OpenAI becomes a genuinely cloud-neutral product company and Microsoft trades exclusivity for a more durable financial relationship. If it doesn't, watch for Azure AI growth to slow visibly in the next two earnings cycles — Microsoft reports Q3 results Wednesday — and for the OpenAI revenue backlog question that's hung over the stock since February to become a clearer investor narrative.
Xiaomi Just Shipped the Open-Source Agent Model Nobody Saw Coming
Xiaomi may have just released the most capable open-source coding agent in the world. That sentence would have been absurd eighteen months ago.
Xiaomi released MiMo-V2.5 and MiMo-V2.5-Pro under the MIT License — meaning enterprises can use them in production without the usage restrictions Meta's Llama license imposes. The Pro model is a Mixture-of-Experts architecture (1 trillion total parameters, 42 billion active during inference), supports a 1 million-token context window, and handles agent workflows of more than 1,000 tool calls. It scores 57.2% on SWE-bench Pro, as reported by VentureBeat on April 27, 2026.
The number that matters isn't the benchmark. It's the token efficiency. Xiaomi reports the Pro model leads the open-source field at 63.8% on ClawEval (as reported on April 27, 2026) while consuming roughly 70,000 tokens per trajectory — 40% to 60% fewer tokens than Anthropic's Claude Opus 4.6, Google's Gemini 3.1 Pro, or OpenAI's GPT-5.4 need to reach comparable results. In a world where GitHub just announced usage-based billing, that gap is a direct cost advantage.
The vendor demo is striking — Xiaomi says the Pro model implemented a complete SysY compiler in Rust in 4.3 hours across 672 tool calls, scoring 233/233 on Peking University's hidden test suite for a project that takes CS majors several weeks. Independent verification is pending. But Build Fast With AI's review notes Xiaomi already held 21.1% of OpenRouter traffic by early April, roughly three times OpenAI's 7.5% share.
If enterprises start routing agent workloads to MiMo as Copilot's June 1 billing change lands, this is the moment Chinese open-source displaces Western commercial tooling in production. If the benchmarks don't replicate independently, it's another paper-tiger release. The signal to watch: Hugging Face download velocity over the next two weeks, and whether vLLM/SGLang deployment guides start appearing in enterprise procurement docs.
GitHub Just Ended the Flat-Rate Era of AI Coding
The $10/month AI coding subscription is about to mean something very different.
GitHub announced Monday that all Copilot plans will transition to usage-based billing on June 1, 2026. The base subscription prices aren't changing, but every plan will now include a monthly allotment of GitHub AI Credits, calculated on token consumption — input, output, and cached — at listed API rates.
GitHub's CPO Mario Rodriguez said the quiet part out loud: "Today, a quick chat question and a multi-hour autonomous coding session can cost the user the same amount. GitHub has absorbed much of the escalating inference cost behind that usage, but the current premium request model is no longer sustainable," per The Register.
There's a buried second shoe. GitHub's changelog confirms Copilot code review on private repos will start consuming GitHub Actions minutes on the same day — meaning AI consumption is now tied directly to existing DevOps budgets. Developer reaction in Visual Studio Magazine's coverage was blunt: "you will get less, but pay the same price."
If Copilot's preview bill experience launches in early May and shows projected costs significantly above current spend, expect a wave of teams to evaluate MiMo-V2.5-Pro, Kimi K2.6, and other open-source alternatives. If users adapt quietly, GitHub has successfully repriced autonomous software labor as a metered utility. The signal is straightforward: enterprise procurement RFPs hitting open-source coding models in May.
OpenAI Open-Sources Symphony, Turning Issue Trackers Into Agent Control Planes
OpenAI published Symphony — an open-source orchestration spec that treats Linear, Jira, or GitHub Projects as the control plane for long-running Codex agents. OpenAI said internal teams using the pattern saw a 500% increase in landed pull requests in internal tests.
The plain version: instead of one AI assistant waiting for prompts, every issue ticket gets a worker that runs, reports, and hands off to humans for review. Agents become supervised background staff rather than ephemeral chatbots.
This matters more than it reads on first glance. Orchestration specs are how runaway inference costs get controlled — Symphony explicitly maps spend to specific tickets, which is exactly the dashboard engineering leaders need now that GitHub is metering Copilot. If Symphony-style patterns get copied into mainstream issue trackers in May, agentic coding graduates from novelty to managed labor system. If it stays an OpenAI internal flex, it's just a clever blog post.
China Blocks Meta's Manus Acquisition, and Agent M&A Becomes Geopolitical
China's National Development and Reform Commission ordered the parties to withdraw from Meta's acquisition of Manus, a Singapore-registered agent startup with Chinese founders, per CNBC.
The lesson is that buying the next hot agent startup is now a national-security review on both sides of the Pacific. If this becomes pattern rather than precedent, Western buyers stop targeting Singapore-based agent firms with mainland connections — which is most of them. Watch the next two cross-border agent deals announced; if either gets pulled or restructured around founder visas, the Manus block was the new normal.
Physical AI's Real Bottleneck Is the Operating System, Not the Model
Applied Intuition founders Qasar Younis and Peter Ludwig laid out their full platform thesis on Latent Space this week. The $15 billion company runs driverless L4 trucks in Japan today, and their core argument cuts against the dominant narrative: physical AI's bottleneck isn't model intelligence — it's deployment onto constrained hardware with millisecond latency, low power, small memory, and fail-safes for when a cosmic ray flips a bit in a processor.
Their pitch is "Android for moving machines." The vehicle software stack today is fragmented across dozens of operating systems, much like phones before iOS and Android. Applied wants to consolidate the platform layer so AI applications can actually run on cars, trucks, mining equipment, and defense systems — across chipsets, with reliable updates that don't brick a vehicle.
If the thesis holds, Applied becomes the platform tax on every moving machine. If specialized stacks win per-vertical, it ends up as one vendor among many. The signal: watch their next major OEM or defense contract win in Q2 earnings season, and whether Tesla-style end-to-end models can be distilled small enough to run onboard without Applied's middleware.
OpenAI's Smartphone Plans Sharpen, Per Ming-Chi Kuo
Supply-chain analyst Ming-Chi Kuo of TF International Securities reported that OpenAI is working with MediaTek and Qualcomm on mobile processors, with Luxshare Precision winning the system co-design and manufacturing contract. Final chip specs are expected by late 2026 or early 2027; mass production is targeted for 2028.
The architecture implication is what matters. Current mobile agents are throttled by app sandboxing — even a powerful assistant on iOS still has to navigate permissions, payment APIs, and inter-app barriers. Owning the OS lets an agent orchestrate calendar, payments, and messaging as a unified workflow. China is already running a faster, more aggressive version through ByteDance and ZTE's Doubao phone, which simulates GUI interactions to bypass APIs — and which WeChat, Alipay, and major banking apps began blocking for security reasons.
If OpenAI ships a device in 2028 with OS-level agent control, Apple and Google's privacy-sandbox approach becomes the throttle on agentic UX rather than its protector. If the device ships and nobody buys it, OpenAI joins the long list of companies that learned consumer hardware is brutal. Watch which Chinese OEMs — Honor, Vivo, and others — actually integrate Doubao at scale before OpenAI ships anything.
⚡ What Most People Missed
Honor is adapting DeepSeek V4 for Guangdong-made compute chips. Chinese-language reporting Monday confirmed Honor is adapting DeepSeek V4 for Guangdong-made compute chips, accelerating commercial deployment. Combined with V4's launch on Huawei silicon last week, the gap between NVIDIA and Ascend is closing on the software side, not just the hardware. [Source: 21 Finance — Chinese]
Wujie Dynamics says it crossed $200M in cumulative angel financing. Per 36Kr, the embodied-AI startup completed an Angel++ round Monday with Envision Group and the Beijing AI Industry Investment Fund participating. Angel rounds aren't supposed to look like late-stage war chests — investors are prepaying for physical AI as infrastructure. Single-source for now; treat as a watchable signal.
Talkie 13B is a copyright-immunity play disguised as a model release. The 13B model was trained exclusively on pre-1931 public domain text. It will be blind to anything post-1930 by design. It's an explicit legal-ballast strategy against the lawsuits paralyzing the industry — and a hint that "data-orthogonal" architectures are becoming a strategy.
Volcengine's Doubao 2.0 is shipping in cars. SAIC Roewe introduced what it calls the first "AI-native" car series at the Beijing Auto Show, per 36Kr, with Volkswagen outlining a roadmap for in-car agents shipping in new models in the second half of 2026. The car is being recast as an agent runtime with wheels.
Microsoft TRELLIS.2 is image-to-3D at 1536³ resolution. A 4B-parameter open-source model from Microsoft generates textured PBR 3D assets from a single image — the r/LocalLLaMA post is climbing without any mainstream coverage. Game studios and robotics simulation teams just got a free replacement for a meaningful chunk of their asset pipeline.
📅 What to Watch
- If Microsoft Q3 earnings Wednesday show Azure AI growth holding or accelerating despite the OpenAI exclusivity unwind, the market reads the restructuring as strategy rather than concession — and Satya Nadella has cover to diversify aggressively.
- If Copilot's preview bill in early May shows projected costs significantly above current subscriptions, expect mid-May to bring the first credible enterprise migration test for Chinese open-source coding models.
- If OpenAI's Frontier ships on AWS Bedrock before it ships on Azure, "Azure first" is already a formality and the Microsoft partnership is a financial instrument, not a technical one.
- If Honor or Vivo officially integrates ByteDance's Doubao agent OS at scale, China bypasses Android's app-sandbox limits before the US — and the OpenAI phone becomes a defensive product, not an offensive one.
- If a second cross-border agent acquisition gets pulled after Manus, the Singapore-with-Chinese-founders structure is officially dead as an M&A vehicle.
The Closer
The AGI clause was removed in a press release, a phone company in Shenzhen built a Rust compiler before lunch, and developers learned that their $10 coding assistant has been quietly running up a tab the whole time. Somewhere in a vault in Beijing, a 13-billion-parameter model trained entirely on Edwardian novels is patiently explaining why it has no opinion on the iPhone. That's the briefing.
Forward this to the engineering manager who's about to find out what their Copilot bill actually looks like in June.