All Voice AI Tools

8,165 tools ranked by quality score · Page 22 of 82

Showing 2101–2200 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
2101	ElsebaiyMohamed/Modablag This project presents a comprehensive study on video dubbing techniques and...	36	Emerging	voice-cloning-synthesis	11	—
2102	nidi3/swiss-wowbagger Let yourself be insulted in swiss german. Schöner fluchen auf Berndeutsch.	36	Emerging	java-tts-libraries	13	Kotlin
2103	jefflai108/Semi-Supervsied-Spoken-Language-Understanding-PyTorch Semi-supervised spoken language understanding (SLU) via self-supervised...	36	Emerging	voice-ai-learning-collections	12	Python
2104	ayutaz/uCosyVoice CosyVoice3 text-to-speech for Unity using ONNX inference. Supports zero-shot...	36	Emerging	coqui-tts-applications	16	C#
2105	gokhaneraslan/XTTS_V2-finetuning Training XTTS V2 and PEFT LORA Text-to-Speech (TTS)	36	Emerging	tts-model-finetuning	4	Python
2106	crimson0829/RecordVoiceView 录音控件 for Android，支持实时语音转化为文字	36	Emerging	android-speech-apps	13	Kotlin
2107	GuruCharan94/az-podcast-transcriber A podcast transcription service built on Azure that transcribes any new...	36	Emerging	audio-transcription-apps	10	C#
2108	d-kavinraja/MouthMap MouthMap is a deep learning-based lip reading system that converts silent...	36	Emerging	lip-reading-synthesis	4	Jupyter Notebook
2109	TejasQ/praise Do stuff with your voice in the browser.	36	Emerging	web-speech-api-libraries	13	TypeScript
2110	shervinemami/practice_speechrec_mappings A game to help design a better character mapping and to learn the mapping...	36	Emerging	automatic-speech-recognition	11	Python
2111	StachePL/ExcelToAmazonPolly Simple text-to-speech tool combining powers of Excel and Amazon Polly.	36	Emerging	aws-polly-tts	13	C#
2112	rudra00434/SoulPlayer My own music application build with Django , Tailwind CSS and Spacy...	36	Emerging	news-audio-bulletins	4	HTML
2113	deeheber/text-to-speech-converter A serverless application that converts blobs of text to speech in an audio file	36	Emerging	aws-polly-tts	13	JavaScript
2114	Yuan-ManX/ComfyUI-ChatterboxTTS ComfyUI-ChatterboxTTS is now available in ComfyUI, Chatterbox is the first...	36	Emerging	comfyui-tts-nodes	13	Python
2115	techiaith/docker-huggingface-stt-cy Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech...	36	Emerging	coqui-tts-applications	13	Python
2116	heyseth/Piper_TTS Use Piper TTS in Visual Studio Code	36	Emerging	piper-tts-ecosystem	4	TypeScript
2117	Malith-Rukshan/whisper-transcriber-bot 🎙️ AI-powered Telegram bot for voice-to-text transcription using OpenAI...	36	Emerging	speech-to-text-converters	12	Python
2118	hay/audio2text Python command line utility wrappers for Whispercpp and other speech-to-text...	36	Emerging	speech-to-text-converters	12	Python
2119	wulee510505/Text2Speach 一句代码搞定语音合成，文字转语音	36	Emerging	java-tts-libraries	68	Java
2120	uzbekvoice/UzbekVoiceBot Current and Live Telegram bot for collecting dataset	36	Emerging	telegram-voice-transcription	7	Python
2121	ducnt18121997/Viet-Text-Normalization A Python library for text normalization, specifically designed for...	36	Emerging	text-normalization-engines	13	Python
2122	Jugendhackt/synthi-tts Hackathon project to digitize your own voice and have it speak for you!...	36	Emerging	lightweight-tts-libraries	12	Python
2123	playerony/TensorFlowTTS-ts This project implements TensorflowTTS in Tensorflow.js using Typescript,...	36	Emerging	web-speech-api-tts	12	TypeScript
2124	poretsky/rulex Russian pronunciation dictionary	36	Emerging	espeak-ng-ecosystem	12	C
2125	Harshit-Raj-14/JARVIS-Python-Voice-Assistant J.A.R.V.I.S - Python Smart AI Voice Assistant	36	Emerging	voice-assistant-projects	7	TeX
2126	momalekiii/VTT Extract Speech/Text from Video	36	Emerging	speech-recognition-apis	12	Python
2127	nishantnnb/spectrolipi A tool designed to manage annotations for bioacoustics.	36	Emerging	data-annotation-tools	1	JavaScript
2128	MitchellAW/Discord-Bot My own Discord chat bot built in Python using the discord.py API. Has been...	36	Emerging	discord-tts-bots	12	Python
2129	theinlinaung2010/Azure_speech_to_test Sample code for testing speech recognition (speech-to-text) of Burmese...	36	Emerging	dotnet-tts-libraries	3	Jupyter Notebook
2130	ismailperim/reportcast Transform reports into podcasts with AI - Nobody reads your reports. But...	36	Emerging	content-to-podcast-converters	4	TypeScript
2131	aflr-archive/apiaudio-python api.audio Python SDK	36	Emerging	voice-ai-sdks	25	Python
2132	cloudcommunity/Text-to-Speech-Engines A list of different text to speech engines.	36	Emerging	google-tts-libraries	11	—
2133	LWalone/fish-speech 🐟 Enhance communication with Fish Speech, a powerful multilingual...	36	Emerging	voice-assistant-devices	1	Python
2134	MontrealAI/sign2text-v0 Sign Language to Text (A to Z) with Artificial Intelligence \| Pre-Alpha Demo	36	Emerging	sign-language-recognition	8	JavaScript
2135	neosun100/Step-Audio-R1.1 Step-Audio-R1.1: The First Audio Language Model with Test-Time Compute...	36	Emerging	voice-cloning-synthesis	4	Python
2136	sahu-adarsh/intervyu Practice job interviews with Neerja, an AI interviewer powered by Claude....	36	Emerging	ai-interview-simulators	3	Python
2137	jcsilva/docker-kaldi-android Dockerfile for compiling Kaldi for Android.	36	Emerging	kaldi-asr-ecosystem	65	Shell
2138	parzibyte/conversor-imagen-a-texto-js Extraer texto de imagen utilizando JavaScript y Tesseract.js	36	Emerging	ai-image-generation-platforms	6	HTML
2139	ThePlasmak/faster-whisper An OpenClaw skill that uses faster-whisper (a faster implementation of the...	36	Emerging	openclaw-skill-integrations	4	Python
2140	syb0rg/Khronos The open source intelligent personal assistant	36	Emerging	local-voice-assistants	26	C
2141	morfeusys/porfir Голосовой ассистент Порфирьевич	36	Emerging	android-voice-assistants	23	Kotlin
2142	Voice-Privacy-Challenge/Voice-Privacy-Challenge-2020 Baseline Recipe for VoicePrivacy Challenge 2020:...	36	Emerging	automatic-speech-recognition	64	Shell
2143	CodersCreative/faster-whisper-rs a rust crate for easily implementing faster-whisper stt into your rust programs.	36	Emerging	rust-speech-recognition	23	Rust
2144	LinqLover/simple-openai-tts-playground Try out the OpenAI Text to Speech API in your browser.	36	Emerging	openai-tts-applications	18	JavaScript
2145	LearnedVector/Wav2Letter Speech Recognition model based off of FAIR research paper built using Pytorch.	36	Emerging	wav2vec2-asr-models	87	Python
2146	egorsmkv/tts_uk High-fidelity speech synthesis for Ukrainian using modern neural networks.	36	Emerging	ukrainian-voice-ai	10	Jupyter Notebook
2147	ontypehq/mlx-swift-asr On-device speech recognition for Apple Silicon, powered by MLX.	36	Emerging	ios-speech-frameworks	4	Swift
2148	atosystem/SpeechCLIP SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model,...	36	Emerging	clip-vision-language	119	Python
2149	rafalimadev/piper-tts-call Python wrapper for Piper TTS with real-time CLI/GUI, global hotkeys, and...	36	Emerging	piper-tts-ecosystem	1	Python
2150	NeoKazuya/qwen3-tts-enhanced Enhanced Qwen3-TTS voice cloning GUI with multi-reference samples, variation...	36	Emerging	voice-cloning-synthesis	17	Python
2151	Degon3399/XTTS_V2 This repository offers a framework for fine-tuning the XTTS_V2 model,...	36	Emerging	tts-model-finetuning	1	Python
2152	aviaryan/Very-Fast-Dictation Instant dictation app for Mac	36	Emerging	audio-transcription-tools	64	Python
2153	mikex86/DeepSpeech-Java-Bindings Java Bindings for the C++ library DeepSpeech	36	Emerging	java-tts-libraries	10	Java
2154	QuantiusBenignus/blurt Gnome shell extension for accurate OFFLINE speech to text input in Linux...	36	Emerging	whisper-transcription-apps	103	JavaScript
2155	MahtaFetrat/ManaTTS-Persian-Tacotron2-Model Tacotron2 Persian Text-to-Speech Model trained on ManaTTS, the largest open...	36	Emerging	tacotron-tts-models	10	Jupyter Notebook
2156	daslearning-org/text-to-speech-offline A lightweight cross-platform Text-To-Speech application which works on...	36	Emerging	lightweight-tts-libraries	3	Python
2157	oleksandr-g-rock/speech2text speech2text	36	Emerging	speech-recognition-apis	1	Python
2158	Saganaki22/ComfyUI-KugelAudio 🗣️ ComfyUI nodes for KugelAudi- Open-source text-to-speech with voice...	36	Emerging	comfyui-tts-nodes	29	Python
2159	winedarkmoon/ElevenGUI A user-friendly interface for ElevenLabs' API with added audio transcription...	36	Emerging	elevenlabs-integrations	12	Python
2160	1038lab/ComfyUI-VoxCPMTTS A clean, efficient ComfyUI custom node for VoxCPM TTS (Text-to-Speech)...	36	Emerging	comfyui-tts-nodes	36	Python
2161	greg-kennedy/p5-NRL-TextToPhoneme Perl implementation of the Naval Research Laboratory text-to-phoneme...	36	Emerging	grapheme-to-phoneme-conversion	15	Perl
2162	wildminder/ComfyUI-KaniTTS ComfyUI node for modular, human‑like Kani TTS. Generate natural,...	36	Emerging	comfyui-tts-nodes	38	Python
2163	mu-hashmi/personaplex-mlx PersonaPlex on Apple Silicon: an MLX port of NVIDIA’s full-duplex...	36	Emerging	ios-speech-frameworks	35	Python
2164	tim-gromeyer/VoiceAssistant Empower Your Voice, Secure Your Privacy - Experience VoiceAssistant, Your...	36	Emerging	general-purpose-voice-assistants	18	C++
2165	echonoshy/tingshu Tingshu 听舒｜ Bringing the author’s voice directly to you	36	Emerging	lightweight-tts-runtimes	33	Python
2166	llami-team/wake-me AI-based React component library that detects clapping sounds or finger...	36	Emerging	hand-gesture-control	29	TypeScript
2167	Robofied/Voicenet Comprehensive Python library for speech and voice.	36	Emerging	text-to-speech-conversion	32	Jupyter Notebook
2168	stefantaubert/mean-opinion-score Python library for calculating the mean opinion score and 95% confidence...	36	Emerging	lightweight-tts-libraries	24	Python
2169	kaloprojects/KALO-ESP32-Voice-Assistant Code snippets showing how to record I2S audio and store as .wav file on...	36	Emerging	voice-controlled-robotics	42	C++
2170	fernicar/Parakeet_GUI_TINS_Edition A desktop application built using the TINS paradigm for transcribing audio...	36	Emerging	parakeet-asr-implementations	3	Python
2171	sydkwests/kwest-whisper-analysis Conducted a comprehensive technical analysis of the Whisper model on...	36	Emerging	whisper-transcription-apps	4	Jupyter Notebook
2172	Oct4Pie/persian-stt A Text-To-Speech Model Developed Using 🐸STT	36	Emerging	voice-cloning-synthesis	13	Jupyter Notebook
2173	Ma-Dan/asr-decode 从Kaldi中裁剪的轻量级语音识别解码推理框架，目前实现了MFCC+GMM+Viterbi，不依赖OpenFST、OpenBLAS等库	36	Emerging	kaldi-asr-ecosystem	22	C++
2174	wblgers/hmm_speech_recognition_demo A demo for simple isolated Chinese speech word recognition using GMMHMM in Python	36	Emerging	keyword-speech-recognition	43	Python
2175	htn-l/htn-l.github.io Takes in audio feed from lectures or meetings, performs speech to text...	36	Emerging	meeting-transcription-summarizers	9	HTML
2176	supershaneski/openai-chatterbox A sample Nuxt 3 application that listens to chatter in the background and...	36	Emerging	speech-to-text-converters	10	Vue
2177	tsengia/JSGFKit_Plus_Plus A C++ library for parsing and manipulating JSGF grammar files.	36	Emerging	funasr-speech-recognition	14	C++
2178	bundlab/voice-stream 🎙️ Lightweight offline Python TTS engine. Thread-safe, CLI-ready, and...	36	Emerging	coqui-tts-applications	1	Python
2179	MahtaFetrat/ManaTTS-Persian-Speech-Dataset ManaTTS is the largest open Persian speech dataset with 114+ hours of...	36	Emerging	persian-speech-ai	49	Jupyter Notebook
2180	sooftware/lightning-asr Modular and extensible speech recognition library leveraging...	36	Emerging	end-to-end-asr-frameworks	50	Python
2181	sayyedrizwan/TextConvertor Convert Text into Voice(Speech) and Speech into Text..	36	Emerging	android-speech-apps	3	Java
2182	edouardpoitras/eva Open source voice-enabled personal assistant	36	Emerging	general-purpose-voice-assistants	10	Python
2183	vigonotion/tts.astromech Text to Astromech integration for Home Assistant (R2D2 Beep Boop Sounds)	36	Emerging	home-assistant-tts	53	Python
2184	notebook-nexus/chatterbox-tts-colab Transform any text into natural-sounding speech, clone voices from audio...	36	Emerging	text-to-speech-conversion	27	—
2185	smartgic/docker-mycroft Mycroft AI Voice Assistant Docker images and docker-compose.yml files for...	36	Emerging	coqui-tts-applications	41	Dockerfile
2186	amitpatil321/VoiceForm Voice Controlled Form, Which can be filled, cleared, submitted using only...	36	Emerging	vue-speech-recognition	4	JavaScript
2187	maemreyo/omnivoice-server OpenAI-compatible HTTP server for OmniVoice text-to-speech	36	Emerging	—	2	Python
2188	cottongeeks/podscript Generate podcast transcripts using language and speech-to-text models	36	Emerging	ai-podcast-generation	171	TypeScript
2189	Sundy1219/ctc_beam_search_lm CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统	36	Emerging	ctc-asr-implementations	48	C++
2190	shanghaimoon888/mod_vadasr This is FreeSwitch module that can do VAD and ASR with IFLYTEK websocket api.	36	Emerging	vosk-asr-implementations	50	C
2191	mahimairaja/openrtc-python OpenRTC lets developers run multiple LiveKit voice agents in one Python...	36	Emerging	voice-agent-applications	2	Python
2192	DKMitt/speech-to-text-js The Voice Note App's purpose is to experiment with the Web Speech API by...	36	Emerging	web-speech-api-libraries	51	JavaScript
2193	Sri-Krishna-V/Elu AI-powered Chrome extension that makes any web article accessible —...	36	Emerging	browser-tts-extensions	3	JavaScript
2194	vectominist/MiniASR A mini, simple, and fast end-to-end automatic speech recognition toolkit.	36	Emerging	end-to-end-asr-frameworks	53	Jupyter Notebook
2195	lucko515/Speech-commands-recognition Recognizing common speech commands using Keras and Tensorflow.	36	Emerging	keyword-speech-recognition	10	Python
2196	Zoomicon/SpeechLib Library for Speech Synthesis and Recognition using Windows.Speech or...	36	Emerging	dotnet-tts-libraries	9	C#
2197	GuangChen2333/FindUrVoicesPJSK 《世界计划 : 缤纷舞台》单角色语音数据集一键获取小工具 \| 无需手动打标 \| wav无压缩 \| A simple tool for obtaining...	36	Emerging	tts-dataset-creation	20	Python
2198	aks-devs/mod_google_tts Freeswitch Text-To-Speech module	36	Emerging	vosk-asr-implementations	4	C
2199	hmeutzner/kaldi-avsr Kaldi-based audio-visual speech recognition	36	Emerging	kaldi-asr-ecosystem	6	Shell
2200	lissettecarlr/kuon 久远：一个开发中的大模型语音助手，当前关注易用性，简单上手，支持对话选择性记忆和Model Context Protocol (MCP)服务。...	36	Emerging	voice-agent-applications	47	Python

« Prev 1 2 3 … 20 21 22 23 24 … 80 81 82 Next »