All Voice AI Tools

8,165 tools ranked by quality score

Showing 1–100 of 8,165

#	Tool	Score	Tier	Category	Stars	Language
1	k2-fsa/sherpa-onnx Speech-to-text, text-to-speech, speaker diarization, speech enhancement,...	88	Verified	vosk-asr-implementations	10,885	C++
2	Uberi/speech_recognition Speech recognition module for Python, supporting several engines and APIs,...	85	Verified	automatic-speech-recognition	8,959	Python
3	TalAter/annyang 💬 Speech recognition for your site	84	Verified	web-speech-api-libraries	6,667	TypeScript
4	espnet/espnet End-to-End Speech Processing Toolkit	83	Verified	speaker-diarization-embedding	9,768	Python
5	Blaizzy/mlx-audio A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS)...	80	Verified	text-to-speech-tts	6,227	Python
6	m-bain/whisperX WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)	80	Verified	whisper-diarization	20,758	Python
7	elevenlabs/elevenlabs-python The official Python SDK for the ElevenLabs API.	79	Verified	ai-workflow-automation	2,887	Python
8	rapidaai/voice-ai Rapida is an open-source, end-to-end voice AI orchestration platform for...	76	Verified	voice-agent-applications	686	Go
9	DrewThomasson/ebook2audiobook Generate audiobooks from e-books, voice cloning & 1158+ languages!	76	Verified	ebook-to-audiobook-conversion	18,503	Python
10	OpenBMB/VoxCPM VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and...	75	Verified	voice-cloning-tools	6,143	Python
11	PaddlePaddle/PaddleSpeech Easy-to-use Speech Toolkit including Self-Supervised Learning model,...	74	Verified	funasr-speech-recognition	12,556	Python
12	jdepoix/youtube-transcript-api This is a python API which allows you to get the transcript/subtitles for a...	73	Verified	video-transcription-extraction	7,078	Python
13	salute-developers/GigaAM Foundational Model for Speech Recognition Tasks	73	Verified	speech-emotion-recognition	504	Python
14	espeak-ng/espeak-ng eSpeak NG is an open source speech synthesizer that supports more than...	73	Verified	espeak-ng-ecosystem	6,250	C
15	met4citizen/TalkingHead Talking Head (3D): A JavaScript class for real-time lip-sync using full-body...	73	Verified	ai-avatar-platforms	1,101	JavaScript
16	ggml-org/whisper.cpp Port of OpenAI's Whisper model in C/C++	72	Verified	whisper-framework-ports	47,665	C++
17	jianchang512/pyvideotrans Translate the video from one language to another and embed dubbing & subtitles.	72	Verified	video-dubbing-tools	16,496	Python
18	nateshmbhat/pyttsx3 Offline Text To Speech synthesis for python	72	Verified	lightweight-tts-libraries	2,493	Python
19	KoljaB/RealtimeTTS Converts text to speech in realtime	71	Verified	lightweight-tts-libraries	3,800	Python
20	cmusphinx/pocketsphinx A small speech recognizer	71	Verified	automatic-speech-recognition	4,278	C
21	alphacep/vosk-api Offline speech recognition API for Android, iOS, Raspberry Pi and servers...	71	Verified	text-to-speech-conversion	14,377	Jupyter Notebook
22	FluidInference/FluidAudio Frontier CoreML audio models in your apps — text-to-speech, speech-to-text,...	71	Verified	ios-speech-frameworks	1,689	Swift
23	devnen/Chatterbox-TTS-Server Self-host the powerful Chatterbox TTS model. This server offers a...	70	Verified	self-hosted-tts-servers	1,101	Python
24	pnnbao97/VieNeu-TTS Vietnamese TTS with instant voice cloning • On-device • Real-time CPU...	70	Verified	voice-cloning-synthesis	894	Python
25	descriptinc/descript-audio-codec State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz,...	69	Established	audio-noise-reduction	1,732	Python
26	mozilla-ai/document-to-podcast Blueprint by Mozilla.ai for generating podcasts from documents using local AI	69	Established	content-to-podcast-converters	173	Python
27	lucidrains/HS-TasNet Implementation of HS-TasNet, "Real-time Low-latency Music Source Separation...	69	Established	audio-source-separation	86	Python
28	readest/readest Readest is a modern, feature-rich ebook reader designed for avid readers...	69	Established	ai-powered-ereaders	18,791	TypeScript
29	livekit/livekit End-to-end realtime stack for connecting humans and AI	69	Established	ai-avatar-platforms	17,671	Go
30	EDCD/EDDI Companion application for Elite Dangerous	69	Established	voice-controlled-robotics	520	C#
31	k2-fsa/sherpa Speech-to-text server framework with next-gen Kaldi	69	Established	funasr-speech-recognition	896	C++
32	IAHispano/Applio A simple, high-quality voice conversion tool focused on ease of use and performance.	69	Established	voice-cloning-tools	3,070	Python
33	pndurette/gTTS Python library and CLI tool to interface with Google Translate's text-to-speech API	68	Established	lightweight-tts-libraries	2,594	Python
34	Picovoice/cheetah On-device streaming speech-to-text engine powered by deep learning	68	Established	funasr-speech-recognition	661	Python
35	diodiogod/TTS-Audio-Suite A ComfyUI custom node integration for multi-engine multi-language...	68	Established	comfyui-tts-nodes	774	Python
36	collabora/WhisperLive A nearly-live implementation of OpenAI's Whisper.	68	Established	speech-to-text-converters	3,894	Python
37	EDDiscovery/EDDiscovery Captains log and 3d star map for Elite Dangerous	68	Established	voice-controlled-robotics	880	C#
38	kxxt/aspeak A simple text-to-speech client for Azure TTS API.	68	Established	openai-tts-applications	500	Rust
39	Picovoice/rhino On-device Speech-to-Intent engine powered by deep learning	67	Established	speech-ai-coursework	698	Python
40	Vonage/vonage-php-sdk-core Vonage REST API client for PHP. API support for SMS, Voice, Text-to-Speech,...	67	Established	sms-voice-integrations	928	PHP
41	meizhong986/WhisperJAV ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD....	67	Established	audio-transcription-tools	1,216	HTML
42	Kieirra/murmure Fully local, private and cross platform Speech-to-Text with LLM Post-processing	67	Established	speech-to-text-converters	585	TypeScript
43	thewh1teagle/kokoro-onnx TTS with kokoro and onnx runtime	67	Established	kokoro-tts-ecosystem	2,419	Python
44	cboard-org/cboard Augmentative and Alternative Communication (AAC) system with text-to-speech...	67	Established	react-native-voice-libraries	732	JavaScript
45	jamiepine/voicebox The open-source voice synthesis studio	67	Established	self-hosted-tts-servers	13,404	TypeScript
46	huggingface/speech-to-speech Build local voice agents with open-source models	67	Established	text-to-speech-conversion	4,541	Python
47	Picovoice/porcupine On-device wake word detection powered by deep learning	67	Established	wake-word-detection	4,743	Python
48	rany2/edge-tts Use Microsoft Edge's online text-to-speech service from Python WITHOUT...	66	Established	edge-tts-implementations	10,304	Python
49	mbailey/voicemode Natural (2-way) voice conversations with Claude Code	66	Established	text-to-speech-mcp	885	Python
50	speechmatics/speechmatics-python Python library and CLI for Speechmatics	66	Established	speech-recognition-apis	75	Python
51	thewh1teagle/sherpa-rs Rust bindings to https://github.com/k2-fsa/sherpa-onnx	66	Established	text-embedding-runtimes	302	Rust
52	lenML/Speech-AI-Forge 🍦 Speech-AI-Forge is a project developed around TTS generation model,...	65	Established	text-to-speech-tts	1,386	Python
53	SYSTRAN/faster-whisper Faster Whisper transcription with CTranslate2	65	Established	whisper-transcription-apps	21,444	Python
54	RHVoice/RHVoice a free and open source speech synthesizer for Russian and other languages	65	Established	espeak-ng-ecosystem	1,771	C++
55	foyoux/pygtrans 谷歌翻译, 支持 APIKEY 一口气翻译十万条	65	Established	speech-translation-apps	246	Python
56	software-mansion/react-native-executorch Declarative way to run AI models in React Native on device, powered by ExecuTorch.	65	Established	react-native-voice-libraries	1,284	C++
57	travisvn/chatterbox-tts-api Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling...	65	Established	voice-assistant-devices	554	Python
58	Softcatala/whisper-ctranslate2 Whisper command line client compatible with original OpenAI client based on...	64	Established	speech-to-text-converters	1,255	Python
59	Vonage/vonage-node-sdk Vonage API client for Node.js. API support for SMS, Voice, Text-to-Speech,...	64	Established	sms-voice-integrations	396	TypeScript
60	shibing624/parrots Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine....	64	Established	parakeet-asr-implementations	526	Python
61	FunAudioLLM/CosyVoice Multi-lingual large voice generation model, providing inference, training...	64	Established	voice-assistant-devices	19,991	Python
62	pion/mediadevices Go implementation of the MediaDevices API.	64	Established	mediapipe-implementations	633	Go
63	jatinkrmalik/vocalinux Free, open-source, 100% offline voice dictation for Linux. Speak and type...	64	Established	voice-dictation-typing	188	Python
64	compulim/web-speech-cognitive-services Polyfill Web Speech API with Cognitive Services for both speech-to-text and...	64	Established	dotnet-tts-libraries	70	JavaScript
65	vilassn/whisper_android Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android	64	Established	whisper-framework-ports	630	C++
66	index-tts/index-tts An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System	63	Established	zero-shot-voice-synthesis	19,454	Python
67	yeyupiaoling/MASR Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2...	63	Established	text-to-speech-frameworks	724	Python
68	herimor/voxtream VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and...	63	Established	coqui-tts-applications	210	Python
69	rsxdalv/TTS-WebUI A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio,...	63	Established	text-to-speech	3,017	TypeScript
70	yeyupiaoling/PPASR 基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Confor...	63	Established	speaker-diarization-embedding	875	Python
71	khanld/chunkformer ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription	63	Established	conformer-asr-implementations	78	Python
72	santinic/audiblez Generate audiobooks from e-books	63	Established	ebook-to-audiobook-conversion	5,920	Python
73	ccoreilly/vosk-browser A speech recognition library running in the browser thanks to a WebAssembly...	63	Established	vosk-asr-implementations	507	JavaScript
74	denizsafak/abogen Generate audiobooks from EPUBs, PDFs and text with synchronized captions.	62	Established	ai-podcast-generation	4,194	Python
75	thewh1teagle/phonikud Hebrew grapheme to phoneme (G2P)	62	Established	grapheme-to-phoneme-conversion	91	Python
76	jamsch/expo-speech-recognition Speech Recognition for React Native Expo projects	62	Established	react-native-voice-libraries	566	TypeScript
77	tsmdt/whisply 💬 Fast, cross-platform CLI and GUI for batch transcription, translation,...	62	Established	whisper-diarization	108	Python
78	TensorSpeech/TensorFlowASR :zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in...	62	Established	end-to-end-asr-frameworks	1,005	Python
79	k2-fsa/sherpa-ncnn Real-time speech recognition and voice activity detection (VAD) using...	62	Established	ios-speech-frameworks	1,648	C++
80	supertone-inc/supertonic Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.	62	Established	lightweight-tts-runtimes	2,734	C++
81	Rei-x/discord-speech-recognition Speech to text extension for discord.js	62	Established	discord-tts-bots	62	TypeScript
82	tensorflow/lingvo Lingvo	62	Established	automatic-speech-recognition	2,857	Python
83	playht/pyht PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API	62	Established	text-to-speech	220	Python
84	kahne/fastwer A PyPI package for fast word/character error rate (WER/CER) calculation	62	Established	asr-evaluation-metrics	70	Python
85	FelippeChemello/podcast-maker Fully automated video maker using motion graphics and text-to-speech...	62	Established	ai-video-generation	672	TypeScript
86	fishaudio/fish-speech SOTA Open Source TTS	62	Established	text-to-speech-tts	26,613	Python
87	amicalhq/amical 🎙️ AI Dictation App - Open Source and Local-first ⚡ Type 3x faster, no...	62	Established	local-voice-dictation	1,014	TypeScript
88	modelscope/FunASR A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA...	62	Established	automatic-speech-recognition	15,283	Python
89	ieasybooks/tafrigh تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.	61	Established	whisper-subtitle-generation	141	Python
90	githubharald/CTCDecoder Connectionist Temporal Classification (CTC) decoding algorithms: best path,...	61	Established	ctc-asr-implementations	835	Python
91	Azure-Samples/Cognitive-Speech-TTS Microsoft Text-to-Speech API sample code in several languages, part of...	61	Established	dotnet-tts-libraries	1,004	C#
92	gunthercox/chatterbot-voice A example of verbal communication using ChatterBot	61	Established	voice-chatbot-applications	112	—
93	gradio-app/fastrtc The python library for real-time communication	61	Established	ai-assistant-platforms	4,547	JavaScript
94	pavelzbornik/whisperX-FastAPI FastAPI service on top of WhisperX	61	Established	speech-to-text-converters	174	Python
95	travisvn/edge-tts-universal Use Microsoft Edge's online text-to-speech service in Node.js, browsers, or...	61	Established	edge-tts-implementations	59	TypeScript
96	analyticsinmotion/werpy 🐍📦 Ultra-fast Python package for calculating and analyzing the Word Error...	61	Established	asr-evaluation-metrics	23	Python
97	speechbrain/speechbrain A PyTorch-based Speech Toolkit	61	Established	wav2vec2-speech-recognition	11,311	Python
98	dangvansam/viet-asr VietASR - Vietnamese Automatic Speech Recognition	61	Established	end-to-end-asr-frameworks	165	Python
99	janvarev/Irene-Voice-Assistant Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы...	61	Established	general-purpose-voice-assistants	1,113	Python
100	fgnt/meeteval MeetEval - A meeting transcription evaluation toolkit	61	Established	asr-evaluation-metrics	149	Python

1 2 3 … … 80 81 82 Next »