The Inference Report

June 2, 2026

The trending set reveals two distinct developer priorities emerging in parallel. First, there's aggressive infrastructure for AI agents: trading frameworks, memory engines, web scrapers, and terminal coding agents are all climbing the charts. These aren't toys. TradingAgents sits at 82k stars because it solves a concrete problem, coordinating multiple LLM instances to make financial decisions, and the fact that stefan-jansen's Machine Learning for Algorithmic Trading book repo maintains 18k stars suggests this isn't new money chasing hype but practitioners building on established foundations. The agent tooling trend extends to design and HTML generation, where pbakaus/impeccable and nexu-io/html-anything both treat AI output as a material to be shaped rather than consumed raw. What's notable is that these repos aren't abstracting away the agent; they're making the agent's constraints visible and workable, hash-anchored edits, sandboxed previews, specialized skill definitions.

The second pattern is conversion and data movement. microsoft/markitdown's 139k stars sits at the top because file format conversion is a solved problem that never stops being needed, especially now that every document pipeline touches an LLM. D4Vinci/Scrapling's web scraping framework and dmtrKovalenko/fff's file search toolkit follow the same logic: they're optimizing the plumbing between data sources and models. Meanwhile, the discovery set shows infrastructure consolidating around specific constraints: SkyPilot abstracts cloud compute heterogeneity, FireRedASR2S packages production-grade speech recognition with language identification and punctuation restoration as a single system, and DeepEcho generates synthetic time series. These aren't flashy. They solve the unglamorous work of getting data into shape before the model ever sees it. That's where the actual friction lives, and that's where serious projects are investing.

Jack Ridley

Trending
Daily discovery