All Voice AI Tools

8,165 tools ranked by quality score · Page 2 of 82

Showing 101–200 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
101	daanzu/kaldi-active-grammar Python Kaldi speech recognition with grammars that can be set...	61	Established	kaldi-asr-ecosystem	347	Python
102	roryeckel/wyoming_openai OpenAI-Compatible Proxy Middleware for the Wyoming Protocol	61	Established	lightweight-tts-runtimes	150	Python
103	kishanrajput23/Jarvis-Desktop-Voice-Assistant A python based desktop voice assistant capable of executing system-level...	61	Established	python-voice-assistants	589	Python
104	sandrohanea/whisper.net Whisper.net. Speech to text made simple using Whisper Models	61	Established	whisper-framework-ports	894	C#
105	ChetanXpro/nodejs-whisper NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as...	61	Established	whisper-framework-ports	201	TypeScript
106	royshil/obs-localvocal OBS plugin for local speech recognition and captioning using AI	61	Established	speech-to-text-converters	1,412	C++
107	NVIDIA-AI-Blueprints/pdf-to-podcast Transform PDFs into AI podcasts for engaging on-the-go audio content.	61	Established	pdf-to-audio-conversion	803	Python
108	nazdridoy/kokoro-tts A CLI text-to-speech tool using the Kokoro model, supporting multiple...	61	Established	kokoro-tts-ecosystem	1,296	Python
109	PyThaiNLP/PyThaiTTS Open Source Thai Text-to-speech library in Python	61	Established	lightweight-tts-runtimes	58	Jupyter Notebook
110	zuoban/tts tts 服务	61	Established	system-tts-wrappers	602	TypeScript
111	githubharald/CTCWordBeamSearch Connectionist Temporal Classification (CTC) decoder with dictionary and...	61	Established	ctc-asr-implementations	577	C++
112	charleprr/redditube A video generator from Reddit posts and comments	61	Established	ai-video-generation	62	JavaScript
113	Picovoice/web-voice-processor A library for real-time voice processing in web browsers	60	Established	web-speech-api-libraries	239	TypeScript
114	snakers4/silero-models Silero Models: pre-trained text-to-speech models made embarrassingly simple	60	Established	gradio-tts-webuis	5,822	Jupyter Notebook
115	deepgram/deepgram-python-sdk Official Python SDK for Deepgram.	60	Established	voice-ai-sdks	406	Python
116	Wikidepia/g2p-id Indonesian Grapheme-to-Phoneme (IPA notation)	60	Established	grapheme-to-phoneme-conversion	43	Python
117	sdkcarlos/artyom.js A voice control - voice commands - speech recognition and speech synthesis...	60	Established	web-speech-api-libraries	1,268	JavaScript
118	JamesBrill/react-speech-recognition 💬Speech recognition for your React app	60	Established	react-speech-recognition	835	JavaScript
119	lugia19/elevenlabslib Full python wrapper for the elevenlabs API.	60	Established	elevenlabs-integrations	158	Python
120	OpenVoiceOS/ovos-tts-server simple flask server to host OpenVoiceOS tts plugins as a service	60	Established	espeak-ng-ecosystem	15	Python
121	yandexdataschool/speech_course YSDA course in Speech Processing.	60	Established	speech-ai-coursework	319	Jupyter Notebook
122	mkiol/dsnote Speech Note Linux app. Note taking, reading and translating with offline...	60	Established	voice-dictation-typing	1,395	C++
123	morganney/tts-react Convert text to speech using React.	60	Established	aws-polly-tts	67	TypeScript
124	Vonage/vonage-ruby-sdk Vonage REST API client for Ruby. API support for SMS, Voice, Text-to-Speech,...	60	Established	sms-voice-integrations	220	Ruby
125	PyThaiNLP/pythaiasr Python Thai Automatic Speech Recognition	60	Established	automatic-speech-recognition	77	Python
126	daswer123/xtts-api-server A simple FastAPI Server to run XTTSv2	60	Established	self-hosted-tts-servers	577	Python
127	revdotcom/revai-node-sdk Node.js SDK for the Rev AI API	60	Established	google-tts-libraries	21	TypeScript
128	TensorSpeech/TensorFlowTTS :stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art...	60	Established	fastspeech-tts-models	3,995	Python
129	istupakov/onnx-asr A lightweight Python package for Automatic Speech Recognition using ONNX models	60	Established	automatic-speech-recognition	281	Python
130	MycroftAI/mycroft-precise A lightweight, simple-to-use, RNN wake word listener	60	Established	wake-word-detection	959	Python
131	Spr-Aachen/Easy-Voice-Toolkit A user-friendly audio toolkit for voice recognition, voice transcription,...	60	Established	voice-ai-learning-collections	875	Python
132	itsmevictor/clean-transcribe A simple CLI to transcribe Youtube videos or local audio/video files and...	59	Established	audio-transcription-tools	23	Python
133	OpenVoiceOS/ovos-tts-plugin-espeakNG espeakNG plugin	59	Established	espeak-ng-ecosystem	2	Python
134	n1teshy/yapper-tts offline text to speech and free SOTA LLM APIs to let your programs speak to you	59	Established	lightweight-tts-libraries	46	Python
135	Ailln/cn2an 📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）	59	Established	lightweight-tts-runtimes	758	Python
136	shivammehta25/Matcha-TTS [ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching	59	Established	text-to-speech-frameworks	1,259	Jupyter Notebook
137	mdiller/MangoByte A discord bot that provides the ability to play dota hero response clips, do...	59	Established	discord-tts-bots	93	Python
138	CorentinJ/Real-Time-Voice-Cloning Clone a voice in 5 seconds to generate arbitrary speech in real-time	59	Established	voice-cloning-synthesis	59,518	Python
139	deepgram/deepgram-js-sdk Official JavaScript SDK for Deepgram.	59	Established	deepgram-starter-projects	248	TypeScript
140	ken107/read-aloud An awesome browser extension that reads aloud webpage content with one click	59	Established	browser-tts-extensions	1,639	JavaScript
141	phuc-nt/my-translator Real-time speech translation — macOS & Windows, free TTS, no server, your...	59	Established	ios-speech-frameworks	308	JavaScript
142	mybigday/whisper.rn React Native binding of whisper.cpp.	59	Established	whisper-framework-ports	749	C++
143	kstonekuan/tambourine-voice Your personal voice interface for any app. Speak naturally and your words...	59	Established	local-voice-dictation	313	Rust
144	pilot51/voicenotify Android app that speaks notifications	59	Established	android-speech-apps	218	Kotlin
145	linto-ai/WebVoiceSDK Buildings block for voice-enabled applications in the browser	59	Established	text-to-speech-conversion	38	JavaScript
146	p0n1/epub_to_audiobook EPUB to audiobook converter, optimized for Audiobookshelf, WebUI included	59	Established	ai-podcast-generation	1,921	Python
147	coqui-ai/TTS 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research...	59	Established	text-to-speech-frameworks	44,801	Python
148	Enemyx-net/VibeVoice-ComfyUI A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech...	58	Established	comfyui-tts-nodes	1,391	Python
149	aichaos/rivescript-python A RiveScript interpreter for Python. RiveScript is a scripting language for...	58	Established	discord-ai-chatbots	157	Python
150	tabahi/bournemouth-forced-aligner Extract phoneme-level timestamps from speeh audio.	58	Established	asr-evaluation-metrics	121	Python
151	linto-ai/whisper-timestamped Multilingual Automatic Speech Recognition with word-level timestamps and confidence	58	Established	whisper-speech-transcription	2,778	Python
152	thevickypedia/Jarvis Fully Functional Voice Based Natural Language UI	58	Established	python-voice-assistants	232	Python
153	babysor/MockingBird 🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time	58	Established	voice-cloning-synthesis	36,874	Python
154	vivekuppal/transcribe Transcribe is a real time transcription, conversation, Language learning...	58	Established	audio-transcription-tools	250	Python
155	DigitalPhonetics/IMS-Toucan Controllable and fast Text-to-Speech for over 7000 languages!	58	Established	text-to-speech-frameworks	2,190	Python
156	gooofy/py-kaldi-asr Some simple wrappers around kaldi-asr intended to make using kaldi's...	58	Established	kaldi-asr-ecosystem	170	C++
157	gabrielmittag/NISQA NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment	58	Established	text-to-speech-frameworks	917	Python
158	davidacm/NVDA-IBMTTS-Driver This project is aimed at developing and maintaining the NVDA IBMTTS driver....	58	Established	piper-tts-ecosystem	71	Python
159	richardr1126/openreader An open-source read-along document reader server with high-quality TTS...	58	Established	ai-powered-ereaders	292	TypeScript
160	dictation-toolbox/dragonfly Speech recognition framework allowing powerful Python-based scripting and...	58	Established	automatic-speech-recognition	411	Python
161	altunenes/parakeet-rs very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA...	58	Established	parakeet-asr-implementations	227	Rust
162	alphacep/vosk VOSK Speech Recognition Toolkit	58	Established	vosk-asr-implementations	493	C
163	moonstar-x/discord-tts-bot A Text-to-Speech bot for Discord.	58	Established	discord-tts-bots	102	JavaScript
164	argmaxinc/WhisperKit On-device Speech Recognition for Apple Silicon	58	Established	whisper-speech-transcription	5,775	Swift
165	fishaudio/fish-audio-python The official Python library for the Fish Audio API.	58	Established	openai-tts-applications	151	Python
166	r9y9/nnmnkwii Library to build speech synthesis systems designed for easy and fast prototyping.	58	Established	voice-cloning-synthesis	399	Python
167	fishaudio/Bert-VITS2 vits2 backbone with multilingual-bert	58	Established	voice-assistant-devices	8,707	Python
168	MainRo/deepspeech-server A testing server for a speech to text service based on coqui.ai	58	Established	parakeet-asr-implementations	219	Python
169	ManimCommunity/manim-voiceover Manim plugin for all things voiceover	58	Established	ai-video-generation	280	Python
170	wenet-e2e/wenet Production First and Production Ready End-to-End Speech Recognition Toolkit	57	Established	end-to-end-asr-frameworks	5,056	Python
171	kurianbenoy/whisper_normalizer A python package for whisper normalizer	57	Established	speech-to-text-converters	76	Jupyter Notebook
172	capacitor-community/text-to-speech ⚡️ Capacitor plugin for synthesizing speech from text.	57	Established	web-speech-api-libraries	123	Java
173	FirezTheGreat/1SHOT All my works - https://github.com/FirezTheGreat (latest music commands/djs...	57	Established	discord-tts-bots	84	JavaScript
174	kalliope-project/kalliope Kalliope is a framework that will help you to create your own personal assistant.	57	Established	python-voice-assistants	1,754	Python
175	jim60105/docker-whisperX Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level...	57	Established	whisper-diarization	422	Dockerfile
176	dectalk/dectalk Modern builds for the 90s/00s DECtalk text-to-speech application.	57	Established	dotnet-tts-libraries	418	PostScript
177	Picovoice/speech-to-text-benchmark speech to text benchmark framework	57	Established	text-to-speech-conversion	683	Python
178	nttcslab-sp/kaldiio A pure python module for reading and writing kaldi ark files	57	Established	kaldi-asr-ecosystem	268	Python
179	i3thuan5/tai5-uan5_gian5-gi2_kang1-ku7 臺灣言語工具	57	Established	lightweight-tts-runtimes	144	Python
180	dlutton/flutter_tts Flutter Text to Speech package	57	Established	educational-voice-apps	732	Dart
181	petercunha/tts :pencil: :sound: A simple text-to-speech tool. Converts your text to speech...	57	Established	aws-polly-tts	171	JavaScript
182	alphacep/vosk-android-demo Offline speech recognition for Android with Vosk library.	57	Established	java-tts-libraries	1,023	Java
183	pnlpal/dictionariez 📚 A customizable dictionary extension that supports double-click lookups in...	57	Established	ai-powered-ereaders	635	JavaScript
184	ai-ng/swift Fast voice assistant powered by Groq, Cartesia, and Vercel.	57	Established	conversational-chatbot-applications	590	TypeScript
185	wq2012/SimpleDER A lightweight library to compute Diarization Error Rate (DER).	57	Established	asr-evaluation-metrics	62	Python
186	asterics/Asterics-AAC Free, easy-to-use AAC app with offline support, flexible input options,...	57	Established	android-voice-assistants	106	JavaScript
187	openctp/openctp openctp提供CTP股票期权、中泰证券XTP、华鑫证券奇点TORA、东方证券OST、东方财富证券EMT、盈透证券TWS、易盛TAP、量投QDP等各通道...	57	Established	system-tts-wrappers	2,715	C
188	sfortis/openai_tts Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine...	57	Established	voice-assistant-devices	181	Python
189	BryceWG/BiBi-Keyboard 说点啥（BiBi Keyboard）:一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR...	57	Established	audio-transcription-tools	535	Kotlin
190	R3gm/SoniTranslate Synchronized Translation for Videos. Video dubbing	57	Established	video-dubbing-tools	1,341	Python
191	midas-research/audino Open source audio annotation tool for humans	57	Established	data-annotation-tools	1,131	TypeScript
192	hkchengrex/MMAudio [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality...	57	Established	vision-language-models	2,115	Python
193	OpenMOSS/MOSS-TTSD MOSS-TTSD is a spoken dialogue generation model designed for expressive...	57	Established	voice-assistant-devices	1,202	Python
194	yeyupiaoling/PaddlePaddle-DeepSpeech 基于PaddlePaddle实现的语音识别，中文语音识别。项目完善，识别效果好。支持Windows，Linux下训练和预测，支持Nvidia Jetson开发板预测。	57	Established	speaker-diarization-embedding	758	Python
195	pykaldi/pykaldi A Python wrapper for Kaldi	57	Established	kaldi-asr-ecosystem	1,030	Python
196	sindresorhus/awesome-whisper 🔊 Awesome list for Whisper — an open-source AI-powered speech recognition...	56	Established	audio-transcription-tools	2,219	—
197	sidharthrajaram/StyleTTS2 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and...	56	Established	text-to-speech-tts	161	Python
198	agentvoiceresponse/avr-infra The AVR Infrastructure project is designed to launch the Agent Voice...	56	Established	deepgram-starter-projects	83	—
199	pot-app/pot-desktop 🌈一个跨平台的划词翻译和OCR软件 \| A cross-platform software for text translation and recognition.	56	Established	ios-speech-frameworks	17,383	JavaScript
200	yeyupiaoling/Whisper-Finetune Fine-tune the Whisper speech recognition model to support training without...	56	Established	whisper-speech-transcription	1,200	C

« Prev 1 2 3 4 … 80 81 82 Next »