The lab announcements today cluster around a single competitive reality: agents are moving from research to infrastructure, and the companies positioning their platforms as the place to run them are racing to lock in developer adoption. OpenAI leads with research on agent capability, showing how they handle longer, more complex tasks, while AWS positions itself as the orchestration layer, with Dr. Swami Sivasubramanian framing an entire stack around "agents that compound value." GitHub is already benchmarking agent performance across twenty-plus models, signaling that model choice is becoming commoditized and the real lock-in happens at the execution layer. Meanwhile, the infrastructure providers are optimizing for the computational cost of running these systems: Google is publishing on cache optimization, Hugging Face is simplifying vLLM deployment, and NVIDIA is bundling gaming discounts with GeForce NOW, a transparent play to build habit and installed base in cloud compute. IBM's sub-1 nanometer chip announcement sits apart but makes the pattern clear: everyone is betting that agent workloads will consume silicon at scale, and the companies that own the platform where agents run, not just the models that power them, will capture the margin. The real signal is what's absent: no lab is announcing breakthroughs in agent reasoning or reliability. They're announcing infrastructure, benchmarking, and pricing. That's where the money is moving.
Sloane Duvall
A curated reference of models from major AI labs, with open/closed weight status, input modalities, and context window size. American labs tend towards closed weights models and Chinese labs tend toward open weights models.
None
None
None
None
None
None
None