All Voice AI Tools

8,165 tools ranked by quality score · Page 4 of 82

Showing 301–400 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
301	taigrr/elevenlabs ElevenLabs Artificial Voice Synthesis Client	53	Established	elevenlabs-integrations	64	Go
302	kaldi-asr/kaldi kaldi-asr/kaldi is the official location of the Kaldi project.	53	Established	kaldi-asr-ecosystem	15,346	Shell
303	deepgram-starters/node-transcription Get started using Deepgram's Transcription with this Node demo app	53	Established	deepgram-starter-projects	33	JavaScript
304	Agents365-ai/video-podcast-maker AI-powered video podcast creation skill for coding agents. Supports Bilibili...	53	Established	tts	350	TypeScript
305	EveryVoiceTTS/EveryVoice The EveryVoice TTS Toolkit - Text To Speech for your language	53	Established	coqui-tts-applications	43	Python
306	aedocw/epub2tts Turn an epub or text file into an audiobook	53	Established	text-to-speech	903	Python
307	BolajiAyodeji/chat-with-siri 🤖 A text-to-speech chatbot built using Nextjs, OpenAI, and ElevenLabs.	53	Established	voice-command-assistants	25	TypeScript
308	BoltzmannEntropy/MimikaStudio MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic...	53	Established	qwen3-tts-applications	357	Dart
309	deepgram-starters/node-voice-agent Get started using Deepgram's Voice Agent with this Node demo app	53	Established	deepgram-starter-projects	31	JavaScript
310	yanorei32/discord-tts TTS Discord Bot [VOICEROID, VOICEVOX, AivisSpeech, kttsproject, WinRT, and...	53	Established	discord-tts-bots	16	Rust
311	nl8590687/ASRT_SpeechRecognition A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统	53	Established	ctc-asr-implementations	8,359	Python
312	PaciStardust/HOSCY Companion for OSC and Communication	53	Established	dotnet-tts-libraries	37	C#
313	unilight/seq2seq-vc A sequence-to-sequence voice conversion toolkit.	53	Established	zero-shot-voice-synthesis	108	Jupyter Notebook
314	Macoron/whisper.unity Running speech to text model (whisper.cpp) in Unity3d on your local machine.	53	Established	whisper-framework-ports	704	C#
315	echogarden-project/echogarden Cross-platform speech toolset, used from the command-line or as a Node.js...	53	Established	google-tts-libraries	439	TypeScript
316	ciffelia/koe Discord 読み上げ Bot	53	Established	discord-tts-bots	43	Rust
317	primepake/wav2lip_288x288 Wav2Lip version 288 and pipeline to train	53	Established	lip-reading-synthesis	642	Python
318	Weilbyte/tiktok-tts Generate TikTok Text-to-Speech voices in your browser	52	Established	telegram-voice-transcription	419	JavaScript
319	abus-aikorea/voice-pro Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS,...	52	Established	gradio-tts-webuis	6,366	Python
320	TuananhCR/Dia-Finetuning-Vietnamese TTS Dia finetuning for Vietnamese	52	Established	tts-model-finetuning	125	Python
321	adrianlyjak/obsidian-aloud-tts Obsidian TTS Plugin	52	Established	edge-tts-implementations	80	TypeScript
322	deepgram-devs/nextjs-text-to-speech Get started using Deepgram's Text-to-Speech with this Next.js demo app	52	Established	deepgram-starter-projects	24	TypeScript
323	PrzemyslawSwiderski/python-gradle-plugin Gradle plugin to run Python projects.	52	Established	voice-ai-learning-collections	22	Kotlin
324	jonatasgrosman/huggingsound HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools	52	Established	wav2vec2-speech-recognition	470	Python
325	FENRlR/MB-iSTFT-VITS2 Application of MB-iSTFT-VITS components to vits2_pytorch	52	Established	vits-tts-implementations	134	Python
326	mathigatti/midi2voice Singing synthesis from MIDI file	52	Established	espeak-ng-ecosystem	284	Python
327	HeyWillow/willow Open source, local, and self-hosted Amazon Echo/Google Home competitive...	52	Established	voice-assistant-applications	2,987	C
328	robdmac/talkito TalkiTo lets developers interact with AI systems through speech across...	52	Established	text-to-speech-mcp	54	Python
329	scarletcho/KoLM Korean text normalization and language preparation package for LM in...	52	Established	kaldi-asr-ecosystem	63	Python
330	misyaguziya/VRCT VRCT(VRChat Chatbox Translator & Transcription)	52	Established	dotnet-tts-libraries	340	Python
331	reazon-research/ReazonSpeech Massive open Japanese speech corpus	52	Established	speech-corpora-datasets	373	Python
332	yeyupiaoling/YeAudio Python的音频工具	52	Established	funasr-speech-recognition	16	Python
333	mlalma/KokoroTestApp Test application for Kokoro TTS model	52	Established	text-to-speech-tts	35	Swift
334	OpenVoiceOS/ovos-tts-plugin-cotovia galician tts plugin for OVOS	52	Established	espeak-ng-ecosystem	3	Python
335	soniqo/speech-swift AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and...	52	Established	ios-speech-frameworks	417	Swift
336	Thiagohgl/ai-pronunciation-trainer This tool uses AI to evaluate your pronunciation.	52	Established	ai-tutoring-platforms	452	Python
337	zaigie/FunSpeech 开箱即用的本地私有化部署语音服务，快速搭建FunASR与CosyVoice2/3后端	52	Established	funasr-speech-recognition	111	Python
338	saharmor/whisper-playground Build real time speech2text web apps using OpenAI's Whisper...	52	Established	whisper-transcription-apps	833	Python
339	ArdaGnsrn/elevenlabs-laravel This is an Open Source PHP Laravel package for ElevenLabs Text to Speech API.	52	Established	elevenlabs-integrations	21	PHP
340	asiff00/On-Device-Speech-to-Speech-Conversational-AI This is an on-CPU real-time conversational system for two-way speech...	52	Established	local-voice-assistants	242	Python
341	alphacep/awesome-russian-speech Russian speech technology links	52	Established	voice-ai-learning-collections	370	—
342	h5p/h5p-speak-the-words Create questions answered through speech	52	Established	web-speech-api-libraries	9	JavaScript
343	lucasnewman/nanospeech A simple, hackable text-to-speech system in PyTorch and MLX	52	Established	fastspeech-tts-models	186	Python
344	thorstenMueller/Thorsten-Voice Thorsten-Voice: A free to use, offline working, high quality german TTS...	52	Established	coqui-tts-applications	705	Python
345	pszemraj/vid2cleantxt Python API & command-line tool to easily transcribe speech-based video files...	52	Established	video-transcription-extraction	220	Jupyter Notebook
346	stefantaubert/pinyin-to-ipa Command-line interface and Python library to transcribe pinyin to IPA. The...	52	Established	grapheme-to-phoneme-conversion	53	Python
347	JSchmie/ScrAIbe-WebUI WebUI for ScAIbe	52	Established	gradio-tts-webuis	52	Python
348	manyeyes/ManySpeech AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment...	52	Established	funasr-speech-recognition	71	C#
349	voicegain/platform Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)	52	Established	audio-transcription-apps	32	HTML
350	mgonzs13/audio_common A PortAudio based audio_common with text to speech for ROS 2	52	Established	lightweight-tts-libraries	32	C++
351	FunAudioLLM/SenseVoice Multilingual Voice Understanding Model	52	Established	voice-assistant-devices	7,691	Python
352	react-native-voice/voice :microphone: React Native Voice Recognition library for iOS and Android...	51	Established	react-native-voice-libraries	2,153	TypeScript
353	shhossain/BanglaSpeech2Text BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla...	51	Established	whisper-transcription-apps	121	Python
354	readium/speech 💬 A TypeScript library for implementing read aloud on the Web	51	Established	web-speech-api-tts	12	TypeScript
355	Sharrnah/whispering-ui Native UI for the Whispering Tiger project -...	51	Established	speech-to-text-converters	315	Go
356	canopyai/Orpheus-TTS Towards Human-Sounding Speech	51	Established	multimodal-vision-language	6,000	Python
357	pannous/tensorflow-speech-recognition 🎙Speech recognition using the tensorflow deep learning framework,...	51	Established	speaker-diarization-embedding	2,176	Python
358	dangvansam/viet-tts VietTTS: An Open-Source Vietnamese Text to Speech	51	Established	tts-model-finetuning	83	Python
359	RageAgainstThePixel/com.rest.elevenlabs A non-official Eleven Labs voice synthesis client for Unity (UPM)	51	Established	elevenlabs-integrations	105	C#
360	MasuRii/opencode-smart-voice-notify 🔊 Smart voice notification plugin for OpenCode with multiple TTS engines...	51	Established	edge-tts-implementations	43	TypeScript
361	athena-team/athena an open-source implementation of sequence-to-sequence based speech processing engine	51	Established	ctc-asr-implementations	970	C++
362	Kyubyong/dc_tts A TensorFlow Implementation of DC-TTS: yet another text-to-speech model	51	Established	tacotron-tts-models	1,159	Python
363	pnnbao97/Kani-TTS-Vie Fast Vietnamese TTS. 370M params, 3-second inference.	51	Established	voice-cloning-synthesis	64	Jupyter Notebook
364	bambocher/pocketsphinx-python Python interface to CMU Sphinxbase and Pocketsphinx libraries	51	Established	automatic-speech-recognition	373	Python
365	HA6Bots/Automatic-Youtube-Reddit-Text-To-Speech-Video-Generator-and-Uploader A series of 3 programs that will automatically receive scripts from Reddit,...	51	Established	ai-video-generation	656	Python
366	google/uis-rnn This is the library for the Unbounded Interleaved-State Recurrent Neural...	51	Established	speaker-diarization-embedding	1,589	Python
367	alexa-pi/AlexaPi Alexa client for all your devices! # No active development. PRs welcome #...	51	Established	voice-assistant-applications	1,331	Python
368	vannu07/jarvis 🤖 Jarvis - AI Voice Assistant with Face Recognition \| Hacktoberfest 2025...	51	Established	voice-assistant-projects	32	Python
369	spring-media/TransformerTTS 🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based...	51	Established	text-to-speech-frameworks	1,161	Python
370	TheStageAI/TheWhisper Optimized Whisper models for streaming and on-device use	51	Established	whisper-framework-ports	821	Python
371	WhiteMagic2014/tts-edge-java java sdk for Edge Read Aloud	51	Established	edge-tts-implementations	76	Java
372	whitphx/streamlit-stt-app Real time web based Speech-to-Text app with Streamlit	51	Established	streamlit-tts-apps	253	Python
373	transcriptionstream/transcriptionstream turnkey self-hosted offline transcription and diarization service with llm summary	51	Established	audio-transcription-tools	920	Python
374	yuvraj108c/ComfyUI-Whisper Transcribe audio and add subtitles to videos using Whisper in ComfyUI	51	Established	comfyui-extensions	218	Python
375	mallorbc/whisper_mic Project that allows one to use a microphone with OpenAI whisper.	51	Established	speech-to-text-converters	785	Python
376	codeforequity-at/botium-speech-processing Botium Speech Processing	51	Established	web-speech-api-tts	944	JavaScript
377	keithito/tacotron A TensorFlow implementation of Google's Tacotron speech synthesis with...	51	Established	text-to-speech-frameworks	2,988	Python
378	zai-org/GLM-ASR GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters	51	Established	llm-scaling-architecture	759	Python
379	xiangyuecn/Recorder html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式，支持pc和Android、iOS部分浏览器、Hybrid...	51	Established	web-speech-api-libraries	5,577	JavaScript
380	ekwek1/soprano Soprano: Instant, Ultra-Realistic Text-to-Speech	51	Established	lightweight-tts-libraries	1,203	Python
381	BolisettySujith/J.A.R.V.I.S A voice assistant 🗣️ which can be used to interact with your computer 💻 and...	51	Established	python-voice-assistants	341	Python
382	ArkanDash/Multi-Model-RVC-Inference RVC Inference with multiple model and huggingface support	51	Established	voice-cloning-tools	112	Python
383	XDcobra/react-native-sherpa-onnx React Native TurboModule for Sherpa-ONNX offline on-device Speech Processing...	51	Established	react-native-voice-libraries	9	TypeScript
384	MycroftAI/adapt Adapt Intent Parser	51	Established	speech-ai-coursework	722	Python
385	at16k/at16k Trained models for automatic speech recognition (ASR). A library to quickly...	51	Established	automatic-speech-recognition	130	Python
386	kan-bayashi/ParallelWaveGAN Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN &...	51	Established	neural-vocoder-implementations	1,637	Jupyter Notebook
387	ftyers/commonvoice-utils Linguistic processing for Common Voice	51	Established	voice-ai-learning-collections	58	Python
388	soobinseo/Transformer-TTS A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"	51	Established	text-to-speech-frameworks	690	Python
389	drethage/speech-denoising-wavenet A neural network for end-to-end speech denoising	51	Established	audio-noise-reduction	708	Python
390	gooofy/py-nltools A collection of basic python modules for spoken natural language processing	51	Established	speech-recognition-apis	55	Python
391	marytts/marytts MARY TTS -- an open-source, multilingual text-to-speech synthesis system...	51	Established	java-tts-libraries	2,573	Java
392	NVIDIA/OpenSeq2Seq Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP	51	Established	neural-machine-translation	1,560	Python
393	srvk/eesen The official repository of the Eesen project	51	Established	end-to-end-asr-frameworks	834	C++
394	doctoroyy/edge-tts-as-a-service This is a simple HTTP service that uses the Edge-TTS library to generate...	51	Established	edge-tts-implementations	33	Python
395	pierreaubert/spinorama A library to display and compare spinorama (speakers measurements) graphs.	51	Established	automatic-speech-recognition	151	Python
396	jaywalnut310/glow-tts A Generative Flow for Text-to-Speech via Monotonic Alignment Search	51	Established	text-to-speech-frameworks	704	Python
397	totalvoice/totalvoice-node Client em NodeJS para API da Totalvoice	51	Established	sms-voice-integrations	61	JavaScript
398	AdolfVonKleist/Phonetisaurus Phonetisaurus G2P	51	Established	grapheme-to-phoneme-conversion	510	Shell
399	AI4Bharat/Chitralekha Chitralekha - A video transcreation platform for Indic languages, supporting...	51	Established	video-dubbing-tools	113	—
400	julius-speech/julius Open-Source Large Vocabulary Continuous Speech Recognition Engine	51	Established	keyword-speech-recognition	1,930	C

« Prev 1 2 3 4 5 6 … 80 81 82 Next »