All Voice AI Tools

8,165 tools ranked by quality score · Page 15 of 82

Showing 1401–1500 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
1401	tihu-nlp/tihu Persian Text-To-Speech	41	Emerging	persian-speech-ai	85	C++
1402	markokosticdev/cloud_text_to_speech_flutter Single interface to Google, Microsoft, and Amazon Text-To-Speech.	41	Emerging	educational-voice-apps	8	Dart
1403	orange2ai/youtube-subtitle-translator 🌐 Real-time YouTube subtitle translator browser extension. Translate...	41	Emerging	live-meeting-translation	26	JavaScript
1404	rudrankriyam/Glosik Sample project for F5-TTS using MLX Swift	41	Emerging	ios-speech-frameworks	50	Swift
1405	lucko515/speech-recognition-neural-network This is the end-to-end Speech Recognition neural network, deployed in Keras....	41	Emerging	speaker-diarization-embedding	190	HTML
1406	cameronking4/VapiBlocks Vapi Blocks is a library of components & api snips to copy and paste into...	41	Emerging	voice-command-assistants	83	TypeScript
1407	Lunarien/Lunariens-Mental-Math-Trainer Mental math trainer made in C#.	41	Emerging	dotnet-tts-libraries	10	C#
1408	holm-aune-bachelor2018/ctc Speech recognition with CTC in Keras with Tensorflow backend	41	Emerging	ctc-asr-implementations	31	Python
1409	AryanVBW/AiVoiceClonerPRO Revolutionize Your Voice with AI Voice Cloner! Transform Your Speech into...	41	Emerging	voice-cloning-synthesis	72	Python
1410	Emotional-Text-to-Speech/hmm-for-emo-tts :computer: A repository with comprehensive instructions for using the...	41	Emerging	zero-shot-voice-synthesis	50	CSS
1411	declare-lab/speech-adapters Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient...	41	Emerging	end-to-end-asr-frameworks	42	Python
1412	modelscope/FunCodec FunCodec is a research-oriented toolkit for audio quantization and...	41	Emerging	neural-vocoder-implementations	442	Python
1413	Kini218/speech-to-text Speech to text script on python	41	Emerging	speech-recognition-apis	35	Python
1414	alias454/YATSEE YATSEE - Yet Another Tool for Speech Extraction & Enrichment	41	Emerging	personal-assistant-rag	31	Python
1415	MHaggis/ASRGEN ASR Configurator, Essentials and Atomic Testing	41	Emerging	automatic-speech-recognition	104	Python
1416	nl8590687/ASRT_SDK_Python3 ASRT语音识别系统的Python版SDK	41	Emerging	voice-ai-sdks	54	Python
1417	1038lab/ComfyUI-SparkTTS ComfyUI-SparkTTS is a custom ComfyUI node implementation of SparkTTS, an...	41	Emerging	comfyui-tts-nodes	124	Python
1418	Dostoyewski/django_voice_bot Package for django onpage support bot with speech recognition and voice commands	41	Emerging	voice-chatbot-applications	4	Python
1419	iBrammm/qwen-asr 🎙️ Implement fast, dependency-free C inference for Qwen3-ASR speech-to-text...	41	Emerging	qwen3-tts-applications	1	C
1420	yl4579/HiFTNet HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter...	41	Emerging	text-to-speech-frameworks	247	Python
1421	titilambert/pynuance Wrapper for Nuance Communications services	41	Emerging	lightweight-tts-libraries	3	Python
1422	Andrewcpu/elevenlabs-api 🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs...	41	Emerging	elevenlabs-integrations	38	Java
1423	Frikallo/parakeet.cpp Ultra fast and portable Parakeet implementation for on-device inference in...	41	Emerging	parakeet-asr-implementations	244	C++
1424	tktcorporation/discord-tts-bot A discord bot to use tts in your voice channel.	41	Emerging	discord-tts-bots	4	Rust
1425	janewu77/ela-extension English Learner Assistant	41	Emerging	browser-tts-extensions	4	JavaScript
1426	1neReality/MITSUHA World's First Multilingual Inexpensive Therapeutic Sophisticated...	41	Emerging	gemini-api-applications	272	Python
1427	bhattbhavesh91/wav2vec2-huggingface-demo Speech to Text with self-supervised learning based on wav2vec 2.0 framework...	41	Emerging	wav2vec2-asr-models	29	Jupyter Notebook
1428	kokimame/joytan Creative Audio/Textbook Maker 🎵 📖 See our YouTube channel	41	Emerging	ebook-to-audiobook-conversion	139	Python
1429	serpapps/ai-voice-cloner AI Voice Cloning Desktop Application that runs locally on your computer and...	41	Emerging	voice-cloning-tools	55	—
1430	ssssssilver/sherpa-ncnn-unity 在Unity环境下，借助sherpa-ncnn框架，实现实时并准确的中英双语语音识别功能。	41	Emerging	dotnet-tts-libraries	77	C#
1431	Kaljurand/Arvutaja An Android app for voice actions in Estonian and English	41	Emerging	android-speech-apps	30	Java
1432	quangvu3/coqui-xtts Coqui XTTS model with Vietnamese added	41	Emerging	tts-model-finetuning	4	Python
1433	yzfly/awesome-voice-agents A curated list of voice AI agent frameworks, tools, resources, and best practices	41	Emerging	voice-agent-applications	20	—
1434	zhangzijie-pro/Speaker-Verification Dual-model speech AI toolkit for speaker verification and speaker-aware...	41	Emerging	funasr-speech-recognition	8	Python
1435	pika-online/AESRC2020 a deep accent recognition network	41	Emerging	end-to-end-asr-frameworks	50	Python
1436	zeropointnine/tts-audiobook-tool Audiobook creation tool with support for multiple TTS models (Qwen3-TTS,...	41	Emerging	ebook-to-audiobook-conversion	81	Python
1437	Edw590/VISOR---A-Voice-Assistant V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!	41	Emerging	voice-assistant-projects	36	Go
1438	CodeBySonu95/VoxSherpa-TTS 🎙️ VoxSherpa TTS Offline Neural Text-to-Speech Engine for Android ⚡...	41	Emerging	kokoro-tts-ecosystem	23	Java
1439	renorari/VoiceJP-Discord A discord-app can text-to-speech and speech-to-text	41	Emerging	discord-tts-bots	4	TypeScript
1440	TETYYS/SAPI4 Web interface for Microsoft Sam & friends	41	Emerging	dotnet-tts-libraries	131	C++
1441	mattmireles/kokoro-coreml PyTorch → CoreML conversion pipeline for Kokoro TTS. Unlocks fast on-device...	41	Emerging	kokoro-tts-ecosystem	32	Python
1442	mapluisch/OpenAI-Realtime-API-for-Unity Implementation of OpenAI's Realtime API in Unity. Easily integrate...	41	Emerging	ai-avatar-platforms	31	ShaderLab
1443	shenbengit/TTSTool 科大讯飞离线语音，Text to Speech，TTS	41	Emerging	android-speech-apps	36	Kotlin
1444	aditya-an1l/RILearn Reinventing Reading with a touch of Interactivity aided Learning	41	Emerging	ai-powered-ereaders	4	HTML
1445	leprosus/golang-tts Text-to-Speach golang package based in Amazon Polly service	41	Emerging	go-tts-libraries	26	Go
1446	cherts/mspeech Program for speech recognition using the Google Speech API, voice commands,...	41	Emerging	dotnet-tts-libraries	38	Pascal
1447	nithincvpoyyil/voice-listener An reusable angular component for voice based input using web speech API	41	Emerging	web-speech-api-libraries	2	CSS
1448	aboda-dirbas/whisperclip 🎤 Enhance your voice-to-text transcriptions with WhisperClip, prioritizing...	41	Emerging	local-voice-dictation	1	Swift
1449	Renovamen/Speech-and-Text Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech...	41	Emerging	speech-recognition-apis	341	Python
1450	antifield/vmt Discord App for Transcribing & Translating Voice Messages	41	Emerging	discord-tts-bots	14	Python
1451	smaranjitghose/AIAudioTranscriber A minimalistic web app to generate transciption for audio built using Python	41	Emerging	real-time-voice-translation	31	Python
1452	N6UDP/SteamDiscordTTSBot A steam chat to Discord TTS bridge	41	Emerging	discord-tts-bots	3	C#
1453	deepgram-starters/php-transcription Get started using Deepgram's speech-to-text with this PHP demo app	41	Emerging	deepgram-starter-projects	3	PHP
1454	doveg/whisper-real-time A real time offline transcriber with gui, based on OpenAI whisper	41	Emerging	speech-to-text-converters	16	Python
1455	rishikksh20/gmvae_tacotron Gaussian Mixture VAE Tacotron	41	Emerging	tacotron-tts-models	54	Python
1456	EndlessReform/fish-speech.rs A Fish Speech implementation in Rust, with Candle.rs	40	Emerging	rust-tts-libraries	110	Rust
1457	gillesdemey/google-speech-v2 :speech_balloon: Reverse Engineering Google's Speech To Text API (v2)	40	Emerging	php-tts-libraries	470	—
1458	mramshaw/Speech-Recognition Speech recognition with Python	40	Emerging	automatic-speech-recognition	18	Python
1459	yapit-tts/yapit Listen to anything. TTS for documents, papers, and web pages.	40	Emerging	openai-tts-applications	4	Python
1460	PhilippeRo/IBus-Speech-To-Text A speech to text IBus engine using VOSK	40	Emerging	vosk-asr-implementations	36	Python
1461	rishikksh20/Avocodo-pytorch Avocodo: Generative Adversarial Network for Artifact-free Vocoder	40	Emerging	neural-vocoder-implementations	122	Python
1462	Alex-Tremayne/LaTeXt Python package for converting LaTeX to text which can be read by text to...	40	Emerging	lightweight-tts-libraries	4	Python
1463	Harshit-shrivastav/TikTok-TTS-Bot A python TikTok Text to speech generator telegram bot.	40	Emerging	telegram-voice-transcription	15	Python
1464	jing332/tts-server-android 这是一个Android系统TTS应用，内置微软演示接口，可自定义HTTP请求，可导入其他本地TTS引擎，以及根据中文双引号的简单旁白/对话识别朗读...	40	Emerging	java-tts-libraries	4,315	Kotlin
1465	saurabhdaware/bol Slightly more consistent Text-to-speech for Web and a wrapper around speechSynthesis	40	Emerging	web-speech-api-tts	3	JavaScript
1466	danielclough/vibevoice-rs Rust implementation of VibeVoice text-to-speech with voice cloning and...	40	Emerging	rust-tts-libraries	61	Rust
1467	ehtisham91/Django-Speech-to-text-Chat This App allows users to convert their speech into text and send that text...	40	Emerging	web-based-tts-apps	20	HTML
1468	0xPD33/sonori Sonori is a fully local STT app for Linux (Wayland).	40	Emerging	rust-speech-recognition	17	Rust
1469	gheyret/UQSpeechDataset Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット	40	Emerging	multilingual-speech-datasets	34	—
1470	izwi-ai/izwi On-device AI engine for transcription, TTS, and voice workflows.	40	Emerging	rust-speech-recognition	181	Rust
1471	Nighthawk42/mOrpheus Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.	40	Emerging	audio-transcription-tools	84	Python
1472	aws-samples/sample-voicebot-nova-sonic A sample implementation of real-time voice assistant using Amazon Nova 2...	40	Emerging	voice-assistant-frameworks	3	JavaScript
1473	dsi-icl/do-voice-interaction The goal of this project is to provide a voice assistant to the Data...	40	Emerging	general-purpose-voice-assistants	6	HTML
1474	kaituoxu/Listen-Attend-Spell A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End...	40	Emerging	conformer-asr-implementations	207	Python
1475	bgArray/ZhiYin 知音 - AI音频听觉功能集成软件。提供声乐技术识别分析、伴奏分离等伴奏多种工具。	40	Emerging	funasr-speech-recognition	8	Python
1476	Labmem-Zhouyx/CDFSE_FastSpeech2 The Official Implementation of “Content-Dependent Fine-Grained Speaker...	40	Emerging	fastspeech-tts-models	87	Python
1477	speechly/speechly Client libraries, examples and demos of Speechly API for the Web.	40	Emerging	web-speech-api-libraries	185	TypeScript
1478	domesticatedviking/TextyMcSpeechy Easily create Piper text-to-speech models in any voice. Make a...	40	Emerging	piper-tts-ecosystem	631	Shell
1479	thinh-vu/ur_audio_sub Generate text captions for audio files & youtube video using OpenAI Whisper...	40	Emerging	video-transcription-extraction	16	Jupyter Notebook
1480	lucascamillomd/anki-tts A free, open-source app for Anki text-to-speech in MacOS.	40	Emerging	anki-tts-integration	2	Python
1481	tugstugi/mongolian-speech-recognition Mongolian speech recognition with PyTorch	40	Emerging	end-to-end-asr-frameworks	138	Python
1482	loretoparisi/wave2vec-recognize-docker Wave2vec 2.0 Recognize pipeline	40	Emerging	wav2vec2-asr-models	33	Python
1483	Baidu-AIP/speech-tts-cors 百度语音语音合成跨域demo以及支持库	40	Emerging	web-speech-api-tts	109	JavaScript
1484	HeyHeyChicken/NOVA-Python NOVA is a customizable voice assistant made with Python.	40	Emerging	voice-assistant-applications	17	Python
1485	mmpneo/curses Speech to Text and KB input captions for OBS, VRChat, Twitch chat and Discord	40	Emerging	live-caption-generation	695	TypeScript
1486	Umbaji/NMTMD Official repository for the Opensource Textdataset for NMT for local langues...	40	Emerging	speech-corpora-datasets	26	—
1487	ethicalabs-ai/Kurtis-E1-MLX-Voice-Agent A lightweight voice companion, optimized for macOS.	40	Emerging	ios-speech-frameworks	9	Python
1488	p1an-lin-jung/teochew-g2p 这是一个潮州话文本端的处理工具和正字标准，主要为潮州方言的语音合成服务	40	Emerging	grapheme-to-phoneme-conversion	49	Python
1489	FR33TR1ST/VoiceAssistant A VoiceAsistant with WhisperAI speech recognition	40	Emerging	local-voice-assistants	32	Python
1490	wwdok/faster-whisper-webui-cn Cloned from https://huggingface.co/spaces/aadnk/faster-whisper-webui, and...	40	Emerging	speech-to-text-converters	28	Python
1491	tsensei/OpenReels Open-source AI pipeline that turns any topic into a fully rendered...	40	Emerging	tts	41	TypeScript
1492	yui-mhcp/text_to_speech (Multi Speaker) Text-To-Speech (TTS) project	40	Emerging	fastspeech-tts-models	10	Python
1493	ritazh/EchoML 🔉 A web app to play, visualize, and annotate your audio files for machine learning	40	Emerging	audio-music-learning	120	JavaScript
1494	ahaocd/davinci-voice-clone DaVinci Subtitle Alignment + Voice Clone + AI Emotion Optimization \| CosyVoice2 TTS	40	Emerging	voice-cloning-tools	4	Python
1495	eellak/gsoc2021-audio-annotation-tool Creation of a multi user audio first annotation tool - GSoC 2021	40	Emerging	data-annotation-tools	29	HTML
1496	small-cactus/Jarvis-ChatGPT-VoiceAssistant Jarvis powered by GPT-3.5/GPT-4	40	Emerging	python-voice-assistants	27	Python
1497	ibm-self-serve-assets/Watson-Speech This collection demonstrates how to help you to quickly embed Watson Speech...	40	Emerging	audio-transcription-apps	17	Jupyter Notebook
1498	maum-ai/wavegrad2 Unofficial Pytorch Implementation of WaveGrad2	40	Emerging	audio-noise-reduction	112	Jupyter Notebook
1499	carleeno/elevenlabs_tts Custom TTS Integration using ElevenLabs API	40	Emerging	elevenlabs-integrations	99	Python
1500	awslabs/speech-representations Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)	40	Emerging	end-to-end-asr-frameworks	104	Python

« Prev 1 2 3 … 13 14 15 16 17 … 80 81 82 Next »