The Inference Report

April 6, 2026

The GitHub trending set reveals a sharp split between what developers are actually building versus what's generating viral attention. On one side sit purpose-built tools addressing concrete problems: MLX-VLM packages inference and fine-tuning for vision language models specifically on Apple Silicon, addressing the practical reality that not every AI workload runs on cloud GPUs. Sklearn-genetic-opt and FL-bench tackle narrower but real problems, hyperparameter tuning via evolutionary algorithms and federated learning benchmarking, where the solution matters more than the hype. On the other side, repos like openscreen (a Screen Studio alternative) and Telegram Desktop climb the charts through utility and polish rather than novelty, suggesting developers reward tools that simply work well at their stated job.

The agent and platform layer is consolidating around two patterns. Goose, pi-mono, and onyx all position themselves as extensible foundations for AI agents and LLM applications, each making different bets on abstraction: Goose as a code-executing agent, pi-mono as a unified LLM API with multiple interfaces, onyx as a chat platform. Rather than compete on features, they're competing on which abstractions let builders move fastest. Pixeltable and Burn represent a quieter but significant trend, infrastructure that doesn't pretend to be general purpose. Pixeltable specifically targets multimodal AI workloads with a declarative, incremental model. Burn positions itself as a tensor library that doesn't sacrifice flexibility for performance, a direct challenge to existing frameworks. Google's gallery and LiteRT-LM suggest the company is betting that on-device inference and local model exploration will be the distribution channel for edge ML, which means the real work is making it frictionless to try and deploy. The repos gaining traction solve for speed, specificity, or honest trade-offs, not for doing everything at once.

Jack Ridley

Trending

google-ai-edge/gallery

19109 ★

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

google-ai-edge/LiteRT-LM

2743 ★

NVIDIA/personaplex

8164 ★

PersonaPlex code.

abhigyanpatwari/GitNexus

24911 ★

GitNexus: The Zero-Server Code Intelligence Engine - GitNexus is a client-side knowledge graph creator that runs entirely in your browser. Drop in a GitHub repo or ZIP file, and get an interactive knowledge graph wit a built in Graph RAG Agent. Perfect for code exploration

tobi/qmd

19836 ★

mini cli search engine for your docs, knowledge bases, meeting notes, whatever. Tracking current sota approaches while being all local

forrestchang/andrej-karpathy-skills

8323 ★

elebumm/RedditVideoMakerBot

10217 ★

Create Reddit Videos with just✨ one command ✨

HKUDS/DeepTutor

12664 ★

"DeepTutor: AI-Powered Personalized Learning Assistant"

TheCraigHewitt/seomachine

4129 ★

A specialized Claude Code workspace for creating long-form, SEO-optimized blog content for any business. This system helps you research, write, analyze, and optimize content that ranks well and serves your target audience.

aaif-goose/goose

38508 ★

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Daily discovery

inboxpraveen/LLM-Minutes-of-MeetingNLP

164 ★

🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where we'll be open for contributions to enable real-time meeting transcription! 🚀

GreenmaskIO/greenmaskSynthetic Data

1650 ★

Database anonymization, synthetic data generation and logical dump

trustgraph-ai/trustgraphKnowledge Graph

1956 ★

The context development platform. Store, enrich, and retrieve structured knowledge with graph-native infrastructure, semantic retrieval, and portable context cores.

crate/crateVector Database

4380 ★

CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.

commaai/openpilotRobotics

60542 ★

openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.

huggingface/diffusersImage Generation

33282 ★

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

miurla/morphicGenerative AI

8744 ★

An AI-powered search engine with a generative UI

qijianpeng/awesome-edge-computingEdge AI

501 ★

A curated list of awesome edge computing, including Frameworks, Simulators, Tools, etc.

felladrin/MiniSearchRAG

554 ★

Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space

graphnet-team/graphnetNeural Network

110 ★

A Deep learning library for neutrino telescopes

Awesome AI

CodeLabClub/awesome-STEM-AI

4 ★

awesome STEM AI

maverickg59/awesome_ai_resources

2 ★

A curated list of resources tailored towards AI Engineers

zenml-io/awesome-open-data-annotation

683 ★

Open Source Data Annotation & Labeling Tools

WagnerAgent/awesome-mcp-servers-devops

92 ★

A curated, DevOps-focused list of Model Context Protocol (MCP) servers—covering source control, IaC, Kubernetes, CI/CD, cloud, observability, security, and collaboration—with a bias toward maintained, production-ready integrations.

samouraiworld/awesome-mistral

36 ★

A curated list of awesome resources, tools, libraries, and projects for the Mistral AI ecosystem.

rafska/awesome-local-llm

1240 ★

A curated list of awesome platforms, tools, practices and resources that helps run LLMs locally

yayxs/awesome-x-claude-code

24 ★

Tweets about Claude Code on Twitter / X

dqj5182/awesome-nature-aiml-papers

0 ★

AI, ML papers published at Nature