All Voice AI Tools

8,165 tools ranked by quality score · Page 14 of 82

Showing 1301–1400 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
1301	rafaballerini/AssistentePessoal Assistente pessoal virtual desenvolvida com Python 🤖	42	Emerging	general-purpose-voice-assistants	412	Python
1302	repodiac/german_transliterate Python module to clean and transliterate (i.e. normalize) German text...	42	Emerging	text-normalization-engines	36	Python
1303	lancejames221b/jarvis-voice OpenJarvis — Real-time AI voice assistant for Discord. Talk to the same...	42	Emerging	python-voice-assistants	5	JavaScript
1304	ranchlai/mandarin-tts Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 ,...	42	Emerging	fastspeech-tts-models	484	Python
1305	atomicoo/PTTS-WebAPP Parallel TTS web demo based on Flask + Vue (Vuetify). 基于 Flask + Vue 的语音合成单网页演示项目。	42	Emerging	web-based-tts-apps	48	Python
1306	Skeli010/GaryTTS 强大免费的本地文本转语音软件	42	Emerging	lightweight-tts-runtimes	2	—
1307	puff-dayo/Kokoro-82M-Android A minimal Android demo app for Kokoro-TTS	42	Emerging	kokoro-tts-ecosystem	49	Kotlin
1308	NateRickard/Xamarin.Cognitive.Speech A client library that makes it easy to work with the Microsoft Cognitive...	42	Emerging	dotnet-tts-libraries	58	C#
1309	sksalahuddin2828/AI_Personal_Digital_Assistant AI Personal Voice Assistant Project (Male - Female version)	42	Emerging	voice-assistant-applications	212	Python
1310	Youdef20/voxtral.c 🔊 Streamline audio processing with Voxtral.c, a pure C implementation for...	42	Emerging	lightweight-tts-runtimes	2	C
1311	aahl/qwen-tts2api 🗣️ Qwen TTS to OpenAI Speech API	42	Emerging	qwen3-tts-applications	46	Python
1312	wq2012/SpeakerRecognitionFromScratch Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家	42	Emerging	speaker-diarization-embedding	47	Python
1313	tikhonp/yandex-speechkit-lib-python Python SDK for Yandex Speechkit API.	42	Emerging	yandex-speechkit-tools	54	Python
1314	BlinkTagInc/gtfs-tts Review GTFS stop pronunciations to determine which stops need a tts_stop_name value.	42	Emerging	google-tts-libraries	5	TypeScript
1315	scart97/thunder-speech A Hackable speech recognition library.	42	Emerging	ctc-asr-implementations	25	Python
1316	showlab/whisperVideo Find out who said what in the video.	42	Emerging	whisper-diarization	138	Jupyter Notebook
1317	PyThaiNLP/tts-thai Thai TTS	42	Emerging	lightweight-tts-runtimes	46	Scheme
1318	googlecreativelab/obvi A Polymer 3+ webcomponent / button for doing speech recognition	42	Emerging	web-speech-api-libraries	59	JavaScript
1319	twilio-labs/sample-autopilot-voice-ivr Voice-Powered IVR Chatbot with Autopilot	42	Emerging	chatbot-frameworks	22	JavaScript
1320	ErcinDedeoglu/WhisperDock Dockerized Whisper C++ speech-to-text API for easy deployment and rapid...	42	Emerging	whisper-transcription-apps	28	C++
1321	SteTR/Emost-Bot Discord Music Bot using Voice Recognition to receive commands.	42	Emerging	discord-tts-bots	36	JavaScript
1322	kamiazya/ngx-speech-recognition Angular 5+ speech recognition service (based on browser implementation such...	42	Emerging	web-speech-api-libraries	25	TypeScript
1323	jordicor/santa-claus-is-calling A magical Christmas experience where Santa Claus (AI with Santa's voice)...	42	Emerging	voice-agent-applications	11	Python
1324	hcy71o/AutoVocoder Autovocoder: Fast Waveform Generation from a Learned Speech Representation...	42	Emerging	neural-vocoder-implementations	71	Python
1325	nipponjo/tts_arabic 🎙️ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format — Python...	42	Emerging	lightweight-tts-runtimes	37	Python
1326	everydaycodings/MimicMania MimicMania is a web application that allows you to generate speech and clone...	42	Emerging	voice-cloning-tools	60	Python
1327	linagora-labs/ssak SSAK contains helpers and tools to process data and train/infer ASR models.	42	Emerging	automatic-speech-recognition	5	Python
1328	kristofferv98/VoiceProcessingToolkit The VoiceProcessingToolkit is an all-encompassing suite designed for...	42	Emerging	coqui-tts-applications	4	Python
1329	ringger/transcribe-critic Multi-source transcript merging inspired by textual criticism — LLM...	41	Emerging	whisper-diarization	14	Python
1330	WilleIshere/SimplerKokoro A Python package that makes it easy to use the Kokoro voice synthesis library.	41	Emerging	kokoro-tts-ecosystem	12	Python
1331	huckiyang/Voice2Series-Reprogramming ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time...	41	Emerging	text-to-speech-frameworks	73	TypeScript
1332	AkojimaSLP/Beamforming-for-speech-enhancement simple delaysum, MVDR and CGMM-MVDR	41	Emerging	keyword-speech-recognition	279	Python
1333	gittyeric/FAlexa Create your own verbal commands that fuzzily map to custom Javascript /...	41	Emerging	vue-speech-recognition	5	TypeScript
1334	book000/audio-transcriber-docker Automatically transcribe the audio of video / audio files using Speech Recognition.	41	Emerging	real-time-voice-translation	3	JavaScript
1335	jing332/tts-server-go 微软TTS服务转发，以便在阅读APP中通过网络导入方式收听微软TTS / Edge大声朗读	41	Emerging	edge-tts-implementations	411	Go
1336	Saganaki22/ComfyUI-Step_Audio_EditX_TTS ComfyUI nodes for Step Audio EditX - State-of-the-art zero-shot voice...	41	Emerging	text-to-speech-tts	57	Python
1337	gianpaj/sexyvoice Voice Cloning, Voice Call and Text to Speech platform. Perfect for content...	41	Emerging	text-to-speech	17	TypeScript
1338	CoffeeVampir3/audiocraft-webui Quick webui for audiocraft	41	Emerging	audio-music-learning	169	Python
1339	seven-io/net-client Official .NET API Client for seven	41	Emerging	sms-voice-integrations	3	C#
1340	nabz0r/mac-local-translator Local translation app for Mac using speech recognition and offline translation	41	Emerging	local-voice-dictation	4	Swift
1341	mostafa-kermaninia/speech-processing-toolkit A comprehensive machine learning pipeline for robust Speaker Identification...	41	Emerging	speaker-diarization-embedding	4	Jupyter Notebook
1342	sotelo/parrot RNN-based generative models for speech.	41	Emerging	next-word-prediction	609	Python
1343	TeamAudio/reaspeech Speech recognition for REAPER	41	Emerging	audio-transcription-tools	36	Lua
1344	bishop-ai/bishop-ai Voice and text virtual assistant	41	Emerging	virtual-assistants-nlp	28	JavaScript
1345	Lastorder-DC/chatreader-kor 채팅 읽어주는 로봇	41	Emerging	twitch-chat-tts	17	JavaScript
1346	spokestack/spokestack-ios Spokestack: give your iOS app a voice interface!	41	Emerging	ios-speech-frameworks	45	Swift
1347	HenestrosaDev/audiotext A desktop application that transcribes audio from files, microphone input or...	41	Emerging	real-time-voice-translation	345	Python
1348	jianchang512/fireredasr-ui 一个中文语音转文字项目，封装自FireRedASR	41	Emerging	funasr-speech-recognition	85	Python
1349	WangHelin1997/SSR-Speech SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis	41	Emerging	zero-shot-voice-synthesis	147	Python
1350	COBACOBAINI/vibe Transcribe audio and video offline with OpenAI Whisper on your device,...	41	Emerging	vibe-coding-framework	5	TypeScript
1351	hubendubler/gTTS.js A Promise based Node.js/TypeScript port of the gTTS Google-Text-To-Speech...	41	Emerging	google-tts-libraries	5	TypeScript
1352	FontaineRiant/wrAIter AI writing assistant with voiced narrator and characters and an illustrator	41	Emerging	text-to-speech-tts	38	Python
1353	JasonLovesDoggo/Flow Native MacOS dictation that captures audio, transcribes speech, and formats...	41	Emerging	local-voice-dictation	10	Rust
1354	DeeepMaker/subtitle-to-audio A python script to generate .wav audio files for .srt subtitle files	41	Emerging	whisper-subtitle-generation	34	Python
1355	alsrb0607/KoreanSTT kospeech를 활용한 한국어 음성 인식 모델 개발	41	Emerging	voice-ai-learning-collections	28	Python
1356	MikeyParton/react-speech-kit React hooks for Speech Recognition and Speech Synthesis	41	Emerging	react-speech-recognition	246	JavaScript
1357	botbahlul/pyvosklivesubtitle PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23...	41	Emerging	live-caption-generation	29	Python
1358	botbahlul/VOSK-Powered-Live-Subtitle-V3 ANDROID APP that can RECOGNIZE ANY LIVE AUDIO/VIDEO STREAMING (using free...	41	Emerging	live-caption-generation	42	Java
1359	OwenEdwards/videojs-speak-descriptions-track A Video.js 7 middleware that uses browser speech synthesis to speak...	41	Emerging	web-speech-api-tts	6	JavaScript
1360	Johnson145/voxtral_wyoming Offline Speech-to-Text (STT) service using Mistral's Voxtral model with...	41	Emerging	lightweight-tts-runtimes	24	Python
1361	gdoudeng/react-native-baidu-asr The react-native Baidu voice library provides voice recognition, voice...	41	Emerging	react-native-voice-libraries	34	Java
1362	XimilalaXiang/DeLive DeLive is a cross-platform desktop app that captures system audio output and...	41	Emerging	live-caption-generation	22	TypeScript
1363	OpenMOSS/MOSS-Audio-Tokenizer MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on...	41	Emerging	voice-ai-learning-collections	162	Python
1364	georgezhao2010/apple_airplayer Make your AirPlay devices as TTS speakers	41	Emerging	home-assistant-tts	136	Python
1365	totalvoice/totalvoice-php Client em PHP para API da Totalvoice	41	Emerging	sms-voice-integrations	29	PHP
1366	MainRo/docker-deepspeech-server A dockerfile to run deepspeech-server	41	Emerging	parakeet-asr-implementations	30	Dockerfile
1367	aks-devs/mod_openai_asr Freeswitch Speech-To-Text module	41	Emerging	vosk-asr-implementations	15	C
1368	hhguo/MSMC-TTS Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS	41	Emerging	text-to-speech-frameworks	169	Python
1369	TartuNLP/text-to-speech-api REST API for neural text-to-speech synthesis	41	Emerging	lightweight-tts-libraries	17	Python
1370	finos/greenkey-asrtoolkit A collection of useful tools for handling speech recognition data	41	Emerging	automatic-speech-recognition	30	Python
1371	AIFSH/ComfyUI-FishSpeech a custom comfyui node for fish-speech	41	Emerging	comfyui-tts-nodes	49	Python
1372	OwenTyme/voice-zero Collection of samples suitable for use with zero-shot text to speech engines.	41	Emerging	tts-model-finetuning	5	—
1373	revdotcom/reverb Open source inference code for Rev's model	41	Emerging	automatic-speech-recognition	435	Python
1374	yxshee/speech-command-recognition speech command recognition using CNNs, with preprocessing, model training,...	41	Emerging	speaker-diarization-embedding	4	Jupyter Notebook
1375	kapi2800/qwen3-tts-apple-silicon Run Qwen3-TTS text-to-speech locally on Mac (M1/M2/M3/M4). Voice cloning,...	41	Emerging	qwen3-tts-applications	396	Python
1376	kgnlp/allophant A multilingual phoneme recognizer capable of generalizing zero-shot to...	41	Emerging	speaker-diarization-embedding	29	Python
1377	fqueis/pollinationsai 🔥 TypeScript SDK wrapper for Pollinations AI services	41	Emerging	google-tts-libraries	13	TypeScript
1378	HectorPulido/chatbot-with-voice Jarvis like chatbot with voice	41	Emerging	voice-chatbot-applications	20	Python
1379	rtzr/Awesome-Korean-Speech-Recognition 한국어 음성인식 STT API 리스트. 각 성능 벤치마크.	41	Emerging	voice-ai-learning-collections	492	—
1380	amitdev01/awesome-voice-ai Awesome Voice Ai	41	Emerging	voice-ai-learning-collections	4	—
1381	petewarden/spchcat Speech recognition tool to convert audio to text transcripts, for Linux and...	41	Emerging	automatic-speech-recognition	482	C
1382	tuan3w/cnn_vocoder A fast cnn-based vocoder	41	Emerging	neural-vocoder-implementations	78	Python
1383	alamparelli/mcp-claude-say Voice interaction for Claude Code - Talk to Claude and hear responses using...	41	Emerging	voice-enabled-coding-assistants	7	Python
1384	kahne/SpeechTransProgress Tracking the progress in end-to-end speech translation	41	Emerging	audio-transcription-apps	261	—
1385	forfrt/SteerMoE SteerMoE: Efficient Audio-Language Models with Preserved Reasoning Capabilities	41	Emerging	voice-ai-learning-collections	9	Python
1386	Edw590/VISOR---Android-Version-Assistant V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!	41	Emerging	voice-assistant-projects	35	Java
1387	mobassir94/comprehensive-bangla-tts Aiming to achieve ultimate Multilingual TTS pipeline with main focus on...	41	Emerging	tts-model-finetuning	43	Jupyter Notebook
1388	dpm76/QuickRouteMap Simple route guidance application.	41	Emerging	android-speech-apps	1	Java
1389	18F/dol-whd-14c The 14(c) system will become a modern, digital-first service. Applicants...	41	Emerging	government-procurement-docs	16	C#
1390	priyanujgogoi-28/flowery-tts Wrapper of Flowery Text to Speech API for Dart	41	Emerging	educational-voice-apps	5	Dart
1391	Yuan-ManX/audio-development-tools Audio Development Tools (ADT) is a project for advancing sound, speech, and...	41	Emerging	audio-source-separation	441	—
1392	solaoi/lycoris Real-time speech recognition & AI-powered note-taking app for macOS with...	41	Emerging	local-voice-dictation	73	TypeScript
1393	arpy8/ESP32_Voice_Assistant This project combines embedded system and AI inference to create an...	41	Emerging	voice-assistant-devices	39	Python
1394	dsfsi/dsfsi-datasets Official DSFSI Public Datasets Registry - Comprehensive catalog of 50+...	41	Emerging	speech-corpora-datasets	6	Jupyter Notebook
1395	TheMorpheus407/OpenAI-Audiobook-Generator This project is a web-based application that converts text into audio,...	41	Emerging	openai-tts-applications	84	JavaScript
1396	TartuNLP/text-to-speech-worker Estonian multi-speaker neural text-to-speech worker that processes requests...	41	Emerging	self-hosted-tts-servers	16	Python
1397	Pranjalya/tts-tortoise-gradio A Gradio setup for Tortoise TTS.	41	Emerging	gradio-tts-webuis	45	Python
1398	ardha27/AI-Waifu-Vtuber AI Vtuber for Streaming on Youtube/Twitch	41	Emerging	interactive-ai-avatars	1,049	Python
1399	yeahhe365/PageTalk 一个简洁且优秀的描述是：这是一款在任何网页上实现无缝语音转文字的 Chrome 扩展，使用先进的 ASR API。	41	Emerging	browser-tts-extensions	37	JavaScript
1400	JoelShine/Jarvis-v2.0 This is a major update of my project JARVIS-The-Ultimate-Project. You can...	41	Emerging	python-voice-assistants	32	Python

« Prev 1 2 3 … 12 13 14 15 16 … 80 81 82 Next »