AI Explorer

Infrastructure

LLM inference, model serving, and AI infrastructure projects.

Data source

Category lists combine GitHub search queries, repository topics, descriptions, and sync snapshots.

Ranking logic

Projects are filtered for category relevance, then ordered by stars and quality signals.

Best for

Use category pages when you already know the AI workflow or tool type you want to evaluate.

27 projects

#1
llama.cppggml-org/llama.cpp47

LLM inference in C/C++

Infra
Stars
118,502
Growth
-
Language
C++
Created
2023-03-10
#2
gpt4allnomic-ai/gpt4all42

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

Infra
Stars
77,379
Growth
-
Language
C++
Created
2023-03-27
#3
whisper.cppggml-org/whisper.cpp44

Port of OpenAI's Whisper model in C/C++

SpeechInfra
Stars
51,118
Growth
-
Language
C++
Created
2022-09-25
#4
rayray-project/ray43

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Infra
Stars
43,044
Growth
-
Language
Python
Created
2016-10-25
#5
ChatTTS2noise/ChatTTS36

A generative speech model for daily dialogue.

SpeechInfra
Stars
39,520
Growth
-
Language
Python
Created
2024-05-27
#6
mediapipegoogle-ai-edge/mediapipe45

Cross-platform, customizable ML solutions for live and streaming media.

VideoInfra
Stars
35,867
Growth
-
Language
C++
Created
2019-06-13
#7
qdrantqdrant/qdrant41

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

SearchRAGInfra
Stars
32,748
Growth
-
Language
Rust
Created
2020-05-30
#8
gitleaksgitleaks/gitleaks41

Find secrets with Gitleaks 🔑

Infra
Stars
27,918
Growth
-
Language
Go
Created
2018-01-27
#9
llm-actionliguodongiot/llm-action43

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

Infra
Stars
24,612
Growth
-
Language
HTML
Created
2023-05-23
#10
free-llm-api-resourcescheahjs/free-llm-api-resources37

A list of free LLM inference resources accessible via API.

Infra
Stars
24,349
Growth
-
Language
Python
Created
2024-07-04
#11
faster-whisperSYSTRAN/faster-whisper35

Faster Whisper transcription with CTranslate2

SpeechInfra
Stars
23,905
Growth
-
Language
Python
Created
2023-02-11
#12
agents-towards-productionNirDiamant/agents-towards-production36

End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.

AgentsRAGInfra
Stars
20,873
Growth
-
Language
Jupyter Notebook
Created
2025-06-16
#13
web-llmmlc-ai/web-llm39

High-performance In-browser LLM Inference Engine

Infra
Stars
18,280
Growth
-
Language
TypeScript
Created
2023-04-13
#14
agent-lightningmicrosoft/agent-lightning35

The absolute trainer to light up AI agents.

AgentsInfra
Stars
17,355
Growth
-
Language
Python
Created
2025-06-18
#15
ktransformerskvcache-ai/ktransformers39

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Infra
Stars
17,344
Growth
-
Language
Python
Created
2024-07-26
#16
omlxjundot/omlx39

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

Infra
Stars
17,216
Growth
-
Language
Python
Created
2026-02-13
#17
weaviateweaviate/weaviate39

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

SearchRAGInfra
Stars
16,454
Growth
-
Language
Go
Created
2016-03-30
#18
dagsterdagster-io/dagster36

An orchestration platform for the development, production, and observation of data assets.

AutomationInfra
Stars
15,763
Growth
-
Language
Python
Created
2018-04-30
#19
litgptLightning-AI/litgpt40

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Infra
Stars
13,447
Growth
-
Language
Python
Created
2023-05-04
#20
OpenLLMbentoml/OpenLLM40

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Infra
Stars
12,378
Growth
-
Language
Python
Created
2023-04-19
#21
mistral-inferencemistralai/mistral-inference40

Official inference library for Mistral models

Infra
Stars
10,824
Growth
-
Language
Jupyter Notebook
Created
2023-09-27
#22
openvinoopenvinotoolkit/openvino38

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

ImageInfraSpeech
Stars
10,447
Growth
-
Language
C++
Created
2018-10-15
#23
LMCacheLMCache/LMCache61

LMCache: Supercharge Your LLM with the Fastest KV Cache Layer

Infra
Stars
9,667
Growth
-
Language
Python
Created
2024-05-28
#24
whichllmAndyyyy64/whichllm84

Find the local LLM that actually runs and performs best on your hardware. Ranked by real, recency-aware benchmarks, not parameter count. One command, run it instantly.

Infra
Stars
4,313
Growth
-
Language
Python
Created
2026-03-04
#25
DreamServerLight-Heart-Labs/DreamServer62

Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

AgentsRAGImage
Stars
1,262
Growth
-
Language
Python
Created
2026-02-09
#26
minimax-m3-desktop-app-free-apizyn26/minimax-m3-desktop-app-free-api39

minimax m3 free model ai model free api large language model llm 1m context window sparse attention msa architecture native multimodality computer use computer control autonomous coding assistant agentic ai framework swe bench pro software engineering workflow automation multi agent hugging face api access local llm inference

AgentsCodingInfra
Stars
131
Growth
-
Language
Go
Created
2026-06-19
#27
Awesome-Datasets-Hubahammadmejbah/Awesome-Datasets-Hub36

A curated collection of datasets for Large Language Models (LLMs), covering medical AI, NLP, multimodal learning, instruction tuning, reasoning, code generation, and evaluation benchmarks.

CodingInfra
Stars
122
Growth
-
Language
Unknown
Created
2026-05-15
All projects loaded