The trending set reveals two distinct developer priorities emerging in parallel. First, there's aggressive infrastructure for AI agents: trading frameworks, memory engines, web scrapers, and terminal coding agents are all climbing the charts. These aren't toys. TradingAgents sits at 82k stars because it solves a concrete problem, coordinating multiple LLM instances to make financial decisions, and the fact that stefan-jansen's Machine Learning for Algorithmic Trading book repo maintains 18k stars suggests this isn't new money chasing hype but practitioners building on established foundations. The agent tooling trend extends to design and HTML generation, where pbakaus/impeccable and nexu-io/html-anything both treat AI output as a material to be shaped rather than consumed raw. What's notable is that these repos aren't abstracting away the agent; they're making the agent's constraints visible and workable, hash-anchored edits, sandboxed previews, specialized skill definitions.
The second pattern is conversion and data movement. microsoft/markitdown's 139k stars sits at the top because file format conversion is a solved problem that never stops being needed, especially now that every document pipeline touches an LLM. D4Vinci/Scrapling's web scraping framework and dmtrKovalenko/fff's file search toolkit follow the same logic: they're optimizing the plumbing between data sources and models. Meanwhile, the discovery set shows infrastructure consolidating around specific constraints: SkyPilot abstracts cloud compute heterogeneity, FireRedASR2S packages production-grade speech recognition with language identification and punctuation restoration as a single system, and DeepEcho generates synthetic time series. These aren't flashy. They solve the unglamorous work of getting data into shape before the model ever sees it. That's where the actual friction lives, and that's where serious projects are investing.
Jack Ridley
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
Python tool for converting files and office documents to Markdown.
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
Hermes WebUI: The best way to use Hermes Agent from the web or from your phone!
A modern platform for visual, flexible, and extensible graph-based investigations. For cybersecurity analysts and investigators.
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
Code for Machine Learning for Algorithmic Trading, 2nd edition.
Memory engine and app that is extremely fast, scalable. The Memory API for the AI era.
Simplifying reinforcement learning for complex game environments
A minimal quadrotor autonomy framework in Rust (Mac, Linux, Windows)
Fast and Accurate ML in 3 Lines of Code
Build resilient language agents as graphs.
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
Ultralytics YOLO 🚀
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Open source agentic operating system
CodeWiki is a knowledge platform that analyzes repositories into AST graphs, builds GraphRAG indexes, and generates source-grounded developer wikis with FastAPI, React, and LiteLLM.
A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and both speech and singing ASR. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects. FireRedPunc supports zh and en.
😎 Awesome lists about generative AI use cases
A curated list of resources tailored towards AI Engineers
A curated list of awesome things about Bittensor.
A curated list of AI tools, courses, books, and resources for anyone interested in exploring artificial intelligence, machine learning, and deep learning.
🚀 Explore a curated collection of top Model Context Protocol (MCP) servers for seamless connectivity and enhanced experiences in your projects.
Perfect for creators, devs & AI lovers. Always updated.
⚡Delightful WebNN resources, curated list of awesome things around WebNN ecosystem.😎
Awesome-next-devtools
A collection of MCP servers.
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.