The Inference Report

March 11, 2026

Agentic systems have moved from research curiosity to infrastructure problem. The trending repos show developers building two distinct layers: frameworks for running multi-agent workflows (agency-agents, deer-flow, superpowers) and testing infrastructure to validate them (promptfoo). The scale of this adoption is real, the top agentic repos have accumulated hundreds of thousands of stars, but the category splits cleanly between those solving the "how do I orchestrate this" problem and those solving "how do I know it works." Promptfoo's traction reflects a genuine pain point: as teams deploy agents with different LLM backends, comparing performance and detecting failure modes requires systematic testing, not guesswork. The discovery repos suggest the field is also maturing sideways into specialization: reinforcement learning for agent reasoning (AReaL), speech integration for multimodal agents (sherpa-onnx), and domain-specific tooling like forensic analysis (IPED) that happens to use agentic patterns.

What's notable is what's not trending. The repos gaining traction solve operational problems, testing, orchestration, evaluation, rather than proposing new model architectures or claiming performance breakthroughs. The marketing language in some descriptions (whimsy injectors, the lobster way) suggests the field still has room for clarity, but the underlying work is pragmatic: build agents that can be compared, debugged, and deployed. The presence of domain-specific tools like page-agent (web automation via natural language) and tgo (customer service agent teams) indicates this isn't just LLM researchers playing with prompts anymore. These are tools for shipping production systems. That shift from novelty to necessity is the real story in this week's activity.

Jack Ridley

Trending
volcengine/OpenViking
11197

OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need through a file system paradigm, enabling hierarchical context delivery and self-evolving.

anthropics/claude-plugins-official
11518

Official, Anthropic-managed directory of high quality Claude Code Plugins.

dimensionalOS/dimos
981

Dimensional is the agentic operating system for physical space. Vibecode humanoids, quadrupeds, drones, and other hardware platforms in natural language and build multi-agent systems that work seamlessly with physical input (cameras, lidar, actuators).

p-e-w/heretic
14120

Fully automatic censorship removal for language models

langflow-ai/openrag
2855

OpenRAG is a comprehensive, single package Retrieval-Augmented Generation platform built on Langflow, Docling, and Opensearch.

lightpanda-io/browser
17501

Lightpanda: the headless browser designed for AI and automation

msitarzewski/agency-agents
44775

A complete AI agency at your fingertips** - From frontend wizards to Reddit community ninjas, from whimsy injectors to reality checkers. Each agent is a specialized expert with personality, processes, and proven deliverables.

fishaudio/fish-speech
27360

SOTA Open Source TTS

InsForge/InsForge
4329

Give agents everything they need to ship fullstack apps. The backend built for agentic development.

obra/superpowers
84068

An agentic skills framework & software development methodology that works.

Daily discovery
Firmament-Autopilot/FMT-FirmwareRobotics
695

Firmament Autopilot Embedded System

wanshuiyin/Auto-claude-code-research-in-sleepMCP
1068

ARIS ⚔️ (Auto-Research-In-Sleep) — Claude Code skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation via Codex MCP

OliBomby/MapperatorinatorGenerative AI
431

An AI framework for generating and modding osu! beatmaps for all gamemodes from spectrogram inputs.

unslothai/unslothFine-tuning
53998

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

LMD0311/Awesome-World-ModelComputer Vision
1890

Collect some World Models for Autonomous Driving (and Robotic, etc.) papers.

leoncuhk/awesome-quant-aiAI Agents
181

A curated list of awesome resources for quantitative investment and trading strategies focusing on artificial intelligence and machine learning applications in finance.

camel-ai/camelDeep Learning
16350

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

argilla-io/argillaRLHF
4895

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

openml/automlbenchmarkAutoML
453

OpenML AutoML Benchmarking Framework

hyperspaceai/agiAutonomous Agents
602

The first distributed AGI system. Thousands of autonomous AI agents collaboratively train models, share experiments via P2P gossip, and push breakthroughs here. Fully peer-to-peer. Join from your browser or CLI.