The Inference Report

March 14, 2026

The infrastructure around AI agents and language models continues to consolidate around practical deployment concerns rather than model capability itself. Microsoft's BitNet framework for 1-bit LLM inference and Google's LiteRT for on-device ML represent a shift in where the actual engineering work happens: not in training larger models, but in running existing ones efficiently at scale and on constrained hardware. This mirrors a broader pattern in the trending repos, where the real traction goes to tools that solve the integration problem, the testing problem, or the operational problem that comes after you have a model.

Agent frameworks dominate the discovery and trending lists, but they're solving different layers of the same problem. Langflow's OpenRAG packages retrieval-augmented generation as a single platform combining document processing, search, and generation. Alibaba's page-agent and the various agent infrastructure repos like InsForge and Hindsight attack the coordination problem: how do you give an agent the right tools, memory, and decision-making capability to actually accomplish something without human intervention at every step. Promptfoo addresses the earlier problem in the pipeline, providing structured testing and comparison across models and prompts before deployment. What separates the tools gaining real adoption from the noise is specificity: they solve a concrete bottleneck, not a vague aspiration. Dolt's approach to versioning and diffing data like code does something similar for data infrastructure, treating a real operational pain point as a first-class problem worth building around.

Jack Ridley

Trending
volcengine/OpenViking
11197

OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need through a file system paradigm, enabling hierarchical context delivery and self-evolving.

anthropics/claude-plugins-official
11518

Official, Anthropic-managed directory of high quality Claude Code Plugins.

dimensionalOS/dimos
981

Dimensional is the agentic operating system for physical space. Vibecode humanoids, quadrupeds, drones, and other hardware platforms in natural language and build multi-agent systems that work seamlessly with physical input (cameras, lidar, actuators).

p-e-w/heretic
14120

Fully automatic censorship removal for language models

langflow-ai/openrag
2855

OpenRAG is a comprehensive, single package Retrieval-Augmented Generation platform built on Langflow, Docling, and Opensearch.

lightpanda-io/browser
17501

Lightpanda: the headless browser designed for AI and automation

msitarzewski/agency-agents
44775

A complete AI agency at your fingertips** - From frontend wizards to Reddit community ninjas, from whimsy injectors to reality checkers. Each agent is a specialized expert with personality, processes, and proven deliverables.

fishaudio/fish-speech
27360

SOTA Open Source TTS

InsForge/InsForge
4329

Give agents everything they need to ship fullstack apps. The backend built for agentic development.

obra/superpowers
84068

An agentic skills framework & software development methodology that works.

Daily discovery
Firmament-Autopilot/FMT-FirmwareRobotics
695

Firmament Autopilot Embedded System

wanshuiyin/Auto-claude-code-research-in-sleepMCP
1068

ARIS ⚔️ (Auto-Research-In-Sleep) — Claude Code skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation via Codex MCP

OliBomby/MapperatorinatorGenerative AI
431

An AI framework for generating and modding osu! beatmaps for all gamemodes from spectrogram inputs.

unslothai/unslothFine-tuning
53998

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

LMD0311/Awesome-World-ModelComputer Vision
1890

Collect some World Models for Autonomous Driving (and Robotic, etc.) papers.

leoncuhk/awesome-quant-aiAI Agents
181

A curated list of awesome resources for quantitative investment and trading strategies focusing on artificial intelligence and machine learning applications in finance.

camel-ai/camelDeep Learning
16350

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

argilla-io/argillaRLHF
4895

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

openml/automlbenchmarkAutoML
453

OpenML AutoML Benchmarking Framework

hyperspaceai/agiAutonomous Agents
602

The first distributed AGI system. Thousands of autonomous AI agents collaboratively train models, share experiments via P2P gossip, and push breakthroughs here. Fully peer-to-peer. Join from your browser or CLI.