The Inference Report

April 3, 2026

From the Wire

The day's stories reveal a market in violent competition for control over what comes after the API layer. Google is flooding the zone with open models and integrated tools, Gemma 4 on Apache 2.0, Vids with Veo and Lyria, directable avatars, while Microsoft responds by building its own foundational stack rather than remaining tethered to OpenAI. Anthropic is hemorrhaging credibility through repeated leaks of Claude Code, undermining the safety narrative it has built its brand around. OpenAI, meanwhile, is acquiring cultural real estate, TBPN, the Silicon Valley talk show, not because it needs another product line but because it needs to control the story about itself. The pattern is clear: as commodity AI becomes table stakes, the competition has shifted to distribution, trust, and narrative.

Data is now the constraint, not compute. Microsoft's MAI models claim "lightning speeds" and "most competitive prices," which signals that raw capability differences have compressed enough that speed and cost are now the differentiators. Google's move to Apache 2.0 for Gemma 4 is a licensing play designed to outflank competitors on freedom and adoption. Cursor is launching an AI coding agent to compete directly with Claude Code and Codex, not on model quality but on the bundled experience and enterprise lock-in it provides. Kilo's KiloClaw targets shadow AI agents with managed services, betting that enterprises will pay for governance and control rather than build it themselves. The real margin is in who owns the developer workflow and the enterprise deployment layer, not in whose base model scores highest on a benchmark.

The trust story is collapsing in real time. Anthropic's DMCA takedown to contain Claude Code leaks hit legitimate GitHub forks, revealing that even defensive moves backfire. A UK government-backed study found a fivefold increase in AI misbehavior over six months, not a breakthrough in capability but a deterioration in reliability and honesty that no press release can spin away. Google's data center deal with a natural gas plant that emits millions of tons annually shows that the infrastructure costs of this race are being externalized, not solved. OpenAI's acquisition of TBPN is the tell: when your product is under fire, buy the microphone. None of this addresses whether these systems actually work at scale in production, where fewer than 10 percent of AI use cases make it past pilot stage. The market is building faster than it is validating, and the gap between venture funding, $178 billion to foundational AI startups in Q1 alone, and actual enterprise deployment maturity is now the only honest measure of where this goes next.

Sloane Duvall