The infrastructure for AI agents is solidifying around two distinct needs: control and observation. On the control side, projects like CUA and openpilot are building the sandboxes and SDKs that let agents actually do things, CUA explicitly targets desktop automation across operating systems, while openpilot demonstrates the pattern at scale by controlling 300+ cars. Smaller runtimes like KrillClaw (49KB in Zig) show the opposite impulse: proving that agent execution doesn't require framework bloat. On the observation side, TensorZero and the MCP Inspector are addressing a real gap in the stack. You can build an agent now, but running it in production without visibility into what it's doing or how well it's working is still painful. TensorZero unifies the gateway, observability, and experimentation in one place, solving the problem that PostHog solves for product analytics but specifically for LLM pipelines. The MCP Inspector takes a narrower angle, focusing on debugging the protocol layer itself.
Skills and memory are becoming first-class concerns. GitNexus solves a specific problem: drop a codebase in your browser and get a queryable knowledge graph without spinning up infrastructure. Beads does something similar for agent context, it's explicitly framed as memory augmentation for coding agents. Skills repos like the Composio curated list and mattpocock's Skills collection are treating agent capabilities as composable, versioned artifacts rather than one-off integrations. This mirrors how package ecosystems matured: the infrastructure works, now the question is how you organize what the infrastructure can do. The surge in agent-specific tooling suggests the bottleneck has shifted from "can we build agents" to "can we make them reliable, observable, and useful at scale."
Jack Ridley
My personal directory of skills, straight from my .claude directory.
Use claude-code for free in the terminal, VSCode extension or via discord like openclaw
ALL IN ONE Hacking Tool For Hackers
GitNexus: The Zero-Server Code Intelligence Engine - GitNexus is a client-side knowledge graph creator that runs entirely in your browser. Drop in a GitHub repo or ZIP file, and get an interactive knowledge graph wit a built in Graph RAG Agent. Perfect for code exploration
🦔 PostHog is an all-in-one developer platform for building successful products. We offer product analytics, web analytics, session replay, error tracking, feature flags, experimentation, surveys, data warehouse, a CDP, and an AI product assistant to help debug your code, ship features faster, and keep all your usage and customer data in one stack.
Staging repo for development of native port of TypeScript
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).
Beads - A memory upgrade for your coding agent
A command line tool and library for transferring data with URL syntax, supporting DICT, FILE, FTP, FTPS, GOPHER, GOPHERS, HTTP, HTTPS, IMAP, IMAPS, LDAP, LDAPS, MQTT, MQTTS, POP3, POP3S, RTSP, SCP, SFTP, SMB, SMBS, SMTP, SMTPS, TELNET, TFTP, WS and WSS. libcurl offers a myriad of powerful features
🏡 Open source home automation that puts local control and privacy first.
The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
The world's smallest AI agent runtime. 49KB. Written in Zig. Zero dependencies.
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
Test, Debug, and Evaluate MCP servers, ChatGPT apps, and MCP Apps (ext-apps)
RLAnything & DemyAgent: General and scalable agentic RL algorithms across terminal, GUI, SWE, and tool-call settings
Low-code framework for building custom LLMs, neural networks, and other AI models
Selene is a desktop app that runs AI agents on your machine. Connect them to your WhatsApp, Telegram, Slack, or Discord. Write code, generate images, build personal assistants. All from one place. Your data stays on your device.
Benchmarking synthetic data generation methods.
The world's most comprehensive notification reader for Android devices.
Awesome Building and Implementing AI-Driven Solution
Curated list of AI tools, platforms, and resources for developers, creators, and businesses.
A comprehensive list of top best AI agents directories available online
🤖 A curated list of AI personal assistants, autonomous agents, and related tools
A curated list of resources tailored towards AI Engineers
Collection of UIs for ComfyUI
🌟 450+ Expert AI Skills | CEO, Doctor, Engineer, Scientist & more | Multi-platform: Claude, Cursor, Codex, Kimi, OpenClaw | Transform AI into any professional
A curated list of awesome tools, frameworks, libraries, and resources for running AI models on edge devices, including smartphones, IoT devices, embedded systems, and hardware accelerators. Edge AI focuses on processing data locally on the device, reducing latency and enhancing privacy.
Useful resources for developing with the RK3588. :rocket:
A curated list of the best AI tools for generating personalized or artistic photos using artificial intelligence.