💖🧸 Self hosted, you-owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minecraft, Factorio playing. Web / macOS / Windows supported.

语音

Stars: 40,427
增长: -
语言: TypeScript
创建时间: 2024-12-01

ChatTTS2noise/ChatTTS36

A generative speech model for daily dialogue.

语音基础设施

Stars: 39,520
增长: -
语言: Python
创建时间: 2024-05-27

#10

MockingBirdbabysor/MockingBird33

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

语音

Stars: 36,908
增长: -
语言: Python
创建时间: 2021-08-07

#11

OpenVoicemyshell-ai/OpenVoice38

Instant voice cloning by MIT and MyShell. Audio foundation model.

语音

Stars: 36,804
增长: -
语言: Python
创建时间: 2023-11-29

#12

khojkhoj-ai/khoj39

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

图像RAG搜索

Stars: 35,380
增长: -
语言: Python
创建时间: 2021-08-16

#13

VoxCPMOpenBMB/VoxCPM42

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

语音

Stars: 32,024
增长: -
语言: Python
创建时间: 2025-09-16

#14

free-claude-codeAlishahryar1/free-claude-code82

Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)

编程语音

Stars: 30,078
增长: -
语言: Python
创建时间: 2026-01-28

#15

faster-whisperSYSTRAN/faster-whisper35

Faster Whisper transcription with CTranslate2

语音基础设施

Stars: 23,905
增长: -
语言: Python
创建时间: 2023-02-11

#16

Pixelle-VideoAIDC-AI/Pixelle-Video42

🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine

图像视频语音

Stars: 23,760
增长: -
语言: Python
创建时间: 2025-11-07

#17

whisperXm-bain/whisperX42

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

语音

Stars: 22,761
增长: -
语言: Python
创建时间: 2022-12-09

#18

CosyVoiceFunAudioLLM/CosyVoice36

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

语音

Stars: 21,876
增长: -
语言: Python
创建时间: 2024-07-03

#19

index-ttsindex-tts/index-tts35

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

语音

Stars: 21,480
增长: -
语言: Python
创建时间: 2025-02-06

#20

buzzchidiwilliams/buzz42

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

语音

Stars: 19,871
增长: -
语言: Python
创建时间: 2022-09-24

#21

dianari-labs/dia36

A TTS model capable of generating ultra-realistic dialogue in one pass.

语音

Stars: 19,326
增长: -
语言: Python
创建时间: 2025-04-19

#22

FunASRmodelscope/FunASR43

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

语音MCP

Stars: 18,673
增长: -
语言: Python
创建时间: 2022-11-24

#23

pyvideotransjianchang512/pyvideotrans40

Translate the video from one language to another and embed dubbing & subtitles.

语音

Stars: 18,131
增长: -
语言: Python
创建时间: 2023-10-02

#24

SpeechNVIDIA-NeMo/Speech43

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

语音

Stars: 17,634
增长: -
语言: Python
创建时间: 2019-08-05

#25

NeMoNVIDIA-NeMo/NeMo43

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

语音

Stars: 17,560
增长: -
语言: Python
创建时间: 2019-08-05

#26

leonleon-ai/leon41

🧠 Leon is your open-source personal assistant.

智能体语音自动化

Stars: 17,344
增长: -
语言: TypeScript
创建时间: 2019-02-10

#27

vosk-apialphacep/vosk-api36

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

语音

Stars: 14,887
增长: -
语言: Jupyter Notebook
创建时间: 2019-09-03

#28

sherpa-onnxk2-fsa/sherpa-onnx38

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

语音

Stars: 13,237
增长: -
语言: C++
创建时间: 2022-09-01

#29

meetilyZackriya-Solutions/meetily38

Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization built on Rust. 100% local processing. no cloud required. Meetily (Meetly Ai - https://meetily.ai) is the #1 Self-hosted, Open-source Ai meeting note taker for macOS & Windows.

语音

Stars: 12,943
增长: -
语言: Rust
创建时间: 2024-12-26

#30

supertonicsupertone-inc/supertonic37

Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.

语音

Stars: 12,789
增长: -
语言: Swift
创建时间: 2025-11-18

向下滚动加载更多