AI Explorer
Infrastructure
LLM inference, model serving, and AI infrastructure projects.
Data source
Category lists combine GitHub search queries, repository topics, descriptions, and sync snapshots.
Ranking logic
Projects are filtered for category relevance, then ordered by stars and quality signals.
Best for
Use category pages when you already know the AI workflow or tool type you want to evaluate.
27 projects
LLM inference in C/C++
- Stars
- 118,502
- Growth
- -
- Language
- C++
- Created
- 2023-03-10
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
- Stars
- 77,379
- Growth
- -
- Language
- C++
- Created
- 2023-03-27
Port of OpenAI's Whisper model in C/C++
- Stars
- 51,118
- Growth
- -
- Language
- C++
- Created
- 2022-09-25
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
- Stars
- 43,044
- Growth
- -
- Language
- Python
- Created
- 2016-10-25
A generative speech model for daily dialogue.
- Stars
- 39,520
- Growth
- -
- Language
- Python
- Created
- 2024-05-27
Cross-platform, customizable ML solutions for live and streaming media.
- Stars
- 35,867
- Growth
- -
- Language
- C++
- Created
- 2019-06-13
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
- Stars
- 32,748
- Growth
- -
- Language
- Rust
- Created
- 2020-05-30
Find secrets with Gitleaks 🔑
- Stars
- 27,918
- Growth
- -
- Language
- Go
- Created
- 2018-01-27
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
- Stars
- 24,612
- Growth
- -
- Language
- HTML
- Created
- 2023-05-23
A list of free LLM inference resources accessible via API.
- Stars
- 24,349
- Growth
- -
- Language
- Python
- Created
- 2024-07-04
Faster Whisper transcription with CTranslate2
- Stars
- 23,905
- Growth
- -
- Language
- Python
- Created
- 2023-02-11
End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.
- Stars
- 20,873
- Growth
- -
- Language
- Jupyter Notebook
- Created
- 2025-06-16
High-performance In-browser LLM Inference Engine
- Stars
- 18,280
- Growth
- -
- Language
- TypeScript
- Created
- 2023-04-13
The absolute trainer to light up AI agents.
- Stars
- 17,355
- Growth
- -
- Language
- Python
- Created
- 2025-06-18
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
- Stars
- 17,344
- Growth
- -
- Language
- Python
- Created
- 2024-07-26
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
- Stars
- 17,216
- Growth
- -
- Language
- Python
- Created
- 2026-02-13
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
- Stars
- 16,454
- Growth
- -
- Language
- Go
- Created
- 2016-03-30
An orchestration platform for the development, production, and observation of data assets.
- Stars
- 15,763
- Growth
- -
- Language
- Python
- Created
- 2018-04-30
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
- Stars
- 13,447
- Growth
- -
- Language
- Python
- Created
- 2023-05-04
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
- Stars
- 12,378
- Growth
- -
- Language
- Python
- Created
- 2023-04-19
Official inference library for Mistral models
- Stars
- 10,824
- Growth
- -
- Language
- Jupyter Notebook
- Created
- 2023-09-27
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
- Stars
- 10,447
- Growth
- -
- Language
- C++
- Created
- 2018-10-15
LMCache: Supercharge Your LLM with the Fastest KV Cache Layer
- Stars
- 9,667
- Growth
- -
- Language
- Python
- Created
- 2024-05-28
Find the local LLM that actually runs and performs best on your hardware. Ranked by real, recency-aware benchmarks, not parameter count. One command, run it instantly.
- Stars
- 4,313
- Growth
- -
- Language
- Python
- Created
- 2026-03-04
Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.
- Stars
- 1,262
- Growth
- -
- Language
- Python
- Created
- 2026-02-09
minimax m3 free model ai model free api large language model llm 1m context window sparse attention msa architecture native multimodality computer use computer control autonomous coding assistant agentic ai framework swe bench pro software engineering workflow automation multi agent hugging face api access local llm inference
- Stars
- 131
- Growth
- -
- Language
- Go
- Created
- 2026-06-19
A curated collection of datasets for Large Language Models (LLMs), covering medical AI, NLP, multimodal learning, instruction tuning, reasoning, code generation, and evaluation benchmarks.
- Stars
- 122
- Growth
- -
- Language
- Unknown
- Created
- 2026-05-15