The Inference Report

April 27, 2026

The infrastructure for AI agents is solidifying around two distinct needs: control and observation. On the control side, projects like CUA and openpilot are building the sandboxes and SDKs that let agents actually do things, CUA explicitly targets desktop automation across operating systems, while openpilot demonstrates the pattern at scale by controlling 300+ cars. Smaller runtimes like KrillClaw (49KB in Zig) show the opposite impulse: proving that agent execution doesn't require framework bloat. On the observation side, TensorZero and the MCP Inspector are addressing a real gap in the stack. You can build an agent now, but running it in production without visibility into what it's doing or how well it's working is still painful. TensorZero unifies the gateway, observability, and experimentation in one place, solving the problem that PostHog solves for product analytics but specifically for LLM pipelines. The MCP Inspector takes a narrower angle, focusing on debugging the protocol layer itself.

Skills and memory are becoming first-class concerns. GitNexus solves a specific problem: drop a codebase in your browser and get a queryable knowledge graph without spinning up infrastructure. Beads does something similar for agent context, it's explicitly framed as memory augmentation for coding agents. Skills repos like the Composio curated list and mattpocock's Skills collection are treating agent capabilities as composable, versioned artifacts rather than one-off integrations. This mirrors how package ecosystems matured: the infrastructure works, now the question is how you organize what the infrastructure can do. The surge in agent-specific tooling suggests the bottleneck has shifted from "can we build agents" to "can we make them reliable, observable, and useful at scale."

Jack Ridley

Trending
mattpocock/skills
25966

My personal directory of skills, straight from my .claude directory.

Alishahryar1/free-claude-code
14822

Use claude-code for free in the terminal, VSCode extension or via discord like openclaw

Z4nzu/hackingtool
66193

ALL IN ONE Hacking Tool For Hackers

abhigyanpatwari/GitNexus
30567

GitNexus: The Zero-Server Code Intelligence Engine - GitNexus is a client-side knowledge graph creator that runs entirely in your browser. Drop in a GitHub repo or ZIP file, and get an interactive knowledge graph wit a built in Graph RAG Agent. Perfect for code exploration

PostHog/posthog
34020

🦔 PostHog is an all-in-one developer platform for building successful products. We offer product analytics, web analytics, session replay, error tracking, feature flags, experimentation, surveys, data warehouse, a CDP, and an AI product assistant to help debug your code, ship features faster, and keep all your usage and customer data in one stack.

microsoft/typescript-go
25214

Staging repo for development of native port of TypeScript

trycua/cua
14595

Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).

gastownhall/beads
21884

Beads - A memory upgrade for your coding agent

curl/curl
41667

A command line tool and library for transferring data with URL syntax, supporting DICT, FILE, FTP, FTPS, GOPHER, GOPHERS, HTTP, HTTPS, IMAP, IMAPS, LDAP, LDAPS, MQTT, MQTTS, POP3, POP3S, RTSP, SCP, SFTP, SMB, SMBS, SMTP, SMTPS, TELNET, TFTP, WS and WSS. libcurl offers a myriad of powerful features

home-assistant/core
86729

🏡 Open source home automation that puts local control and privacy first.