All Voice AI Tools

8,165 tools ranked by quality score · Page 16 of 82

Showing 1501–1600 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
1501	HurroWorld/text-to-audio2face Web interface to convert text to speech and route it to an Audio2Face...	40	Emerging	voice-ai-assistants	34	JavaScript
1502	hwRG/End-to-End-TTS-Fine-Tune Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.	40	Emerging	fastspeech-tts-models	29	Python
1503	qforge-dev/qspeak qSpeak is a powerful voice transcription and AI assistant tool that helps...	40	Emerging	ai-note-taking-apps	62	TypeScript
1504	definitio/ha-rhvoice Home Assistant integration for RHVoice - a local text-to-speech engine.	40	Emerging	home-assistant-tts	52	Python
1505	jimbobbennett/SpeechToTextSamples Sample code showing how to use the Azure Speech to Text service from Python 🗣	40	Emerging	dotnet-tts-libraries	29	Python
1506	henryhale/ttspeech 🔊 A fully basic voice synthesizer in vanillaJS	40	Emerging	web-speech-api-tts	17	HTML
1507	tianbot/rosecho Tianbot Rosecho (Tianecho)，中文语音人机交互模块，支持ROS即插即用	40	Emerging	voice-controlled-robotics	36	C
1508	inboxpraveen/Speech-Annotation-Tool Review, correct, and export ASR transcripts at scale. Web-based ASR accuracy...	40	Emerging	whisper-speech-transcription	10	Python
1509	oscie57/tiktok-voice Simple Python script to interact with the TikTok TTS API	40	Emerging	telegram-voice-transcription	599	Python
1510	RafalWilinski/serverless-medium-text-to-speech 🔊 Serverless-based, text-to-speech service for Medium articles	40	Emerging	aws-polly-tts	95	JavaScript
1511	QiBowen2008/SuperTextToolBox 一个免费的文字处理工具箱	40	Emerging	dotnet-tts-libraries	57	Rich Text Format
1512	SadeghKrmi/pertts-streamlit Persian text-to-speech streamlit interface	40	Emerging	voice-cloning-synthesis	46	Python
1513	Saganaki22/ComfyUI-KittenTTS 😻 A simple ComfyUI custom node for KittenTTS - an ultra-lightweight...	40	Emerging	comfyui-tts-nodes	9	Python
1514	gladchinda/web-speech-demo Learn how to build a simple text-to-speech voice app for the web using the...	40	Emerging	web-speech-api-tts	22	JavaScript
1515	MicheleYin/misaki-rs Rust port of Misaki	40	Emerging	rust-nlp-bindings	6	Rust
1516	HerbertHe/edge-tts-server Server for edge-tts	40	Emerging	edge-tts-implementations	29	TypeScript
1517	jscrane/TTS Arduino Text-to-Speech Library	40	Emerging	embedded-tts-systems	214	C
1518	kaushiknishchay/ComfyUI-Qwen3-ASR ComfyUI nodes for Qwen3-ASR (0.6B/1.7B) and ForcedAligner. Supports...	40	Emerging	qwen3-tts-applications	11	Python
1519	lucasnewman/vocos-mlx Implementation of 'Vocos: Closing the gap between time-domain and...	40	Emerging	fastspeech-tts-models	24	Python
1520	IceFog72/pocket-tts-openapi Fast, local, OpenAI-compatible TTS server with voice cloning support powered...	40	Emerging	self-hosted-tts-servers	10	Python
1521	coqui-ai/STT-models Open models for Coqui STT	40	Emerging	voice-cloning-synthesis	152	—
1522	soundhound/houndify-sdk-go The official Houndify SDK for Go	40	Emerging	go-tts-libraries	25	Go
1523	satyam9090/Automatic-Indian-Sign-Language-Translator-ISL I created an application which takes in live speech or audio recording as...	40	Emerging	sign-language-translation	131	Python
1524	nerdaxic/glados-voice-assistant DIY Voice Assistant based on the GLaDOS character from Portal video game...	40	Emerging	voice-chatgpt-interfaces	338	C
1525	saadbutt32/Conversion-of-Pakistan-Sign-Languag-into-Text-and-Speech-using-OpenPose-and-Machine-Learning Real-time translation of Pakistan sign language into text and speech using...	40	Emerging	sign-language-translation	28	Python
1526	naschorr/hawking The retro text-to-speech bot for Discord	40	Emerging	discord-tts-bots	27	Python
1527	RoySheffer/im2wav Official implementation of the pipeline presented in I hear your true...	40	Emerging	audio-noise-reduction	124	Python
1528	AEmotionStudio/ComfyUI-FFMPEGA Intelligent FFMPEG agent node for ComfyUI - transforms natural language...	40	Emerging	speech-to-text-transcription	5	Python
1529	akinsella/yt-transcript-rs 🎬️ A Rust library for accessing YouTube Video Infos & Transcripts	40	Emerging	video-transcription-extraction	6	Rust
1530	AndroidMaryTTS/AndroidMaryTTS Android MARY TTS - an open-source, offline HMM-Based text-to-speech...	40	Emerging	java-tts-libraries	197	Java
1531	RapidAI/RapidTTS A cross platform implementation of Text-to-Speech based on ONNXRuntime.	40	Emerging	lightweight-tts-runtimes	32	Python
1532	PhuocElec/zipformer-asr-api REST-API implementation of ZipFormer for automatic speech recognition (ASR)...	40	Emerging	funasr-speech-recognition	2	Python
1533	moeru-ai/ortts 𖣘🔊 Simple and Easy-to-use local TTS inference server, Powered by ONNX Runtime	40	Emerging	rust-tts-libraries	14	Rust
1534	myuan19/voiceInput Windows AI 语音输入🎙 — 按快捷键说话即输入，支持润色。摆脱打字限制，实现无拘束、高效率的表达。	40	Emerging	local-voice-dictation	9	Python
1535	dmatekenya/Chichewa-Speech2Text Automated Speech Recognition for Chichewa.	40	Emerging	automatic-speech-recognition	24	Jupyter Notebook
1536	CoffeeMethod/KokoroGUI An advanced TTS software, built for audiobooks, podcasts, videos, and more.	40	Emerging	kokoro-tts-ecosystem	6	Python
1537	keonlee9420/Robust_Fine_Grained_Prosody_Control PyTorch Implementation of Robust and fine-grained prosody control of...	40	Emerging	zero-shot-voice-synthesis	41	Python
1538	skshadan/WhisCall A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo,...	40	Emerging	voice-ai-assistants	29	Python
1539	speechio/BigCiDian Pronunciation lexicon covering both English and Chinese languages for...	40	Emerging	multilingual-speech-datasets	262	Python
1540	mapluisch/OpenAI-Text-To-Speech-for-Unity Implementation of OpenAI's Text-To-Speech in Unity. Synthesize any text and...	40	Emerging	dotnet-tts-libraries	82	C#
1541	rapidaai/rapida-go Open-source Golang SDK for Rapida to build real-time, observable Voice AI...	40	Emerging	go-tts-libraries	2	Go
1542	robmsmt/ASR-Audio-Data-Links A list of publically available audio data that anyone can download for ASR...	40	Emerging	speech-corpora-datasets	231	Shell
1543	soupslurpr/Transcribro Private and on-device speech recognition keyboard and service for Android.	40	Emerging	android-speech-apps	683	Kotlin
1544	Hritikraj8804/Autotube 🤖 Automated YouTube Shorts creation using n8n, AI script generation, and...	40	Emerging	ai-video-generation	18	Python
1545	foamliu/Listen-Attend-Spell-v2 PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).	40	Emerging	conformer-asr-implementations	39	Shell
1546	eellak/gsoc2019-sphinx Creation of an online Greek mail dictation system, using Sphinx and...	40	Emerging	web-speech-api-libraries	21	Python
1547	FaceOnLive/Spleeter-Android-iOS On-device, Offline Spleeter Solution For Mobile	40	Emerging	audio-source-separation	224	Java
1548	DmitryRyumin/INTERSPEECH-2023-24-Papers INTERSPEECH 2023-2024 Papers: A complete collection of influential and...	40	Emerging	automatic-speech-recognition	686	—
1549	zw76859420/ASR_Syllable 基于卷积神经网络的语音识别声学模型的研究	40	Emerging	ctc-asr-implementations	181	Python
1550	pymike00/YouTube-Tutorials :open_file_folder: Source Code for (some of) the Programming Tutorials from...	40	Emerging	voice-ai-learning-collections	34	Python
1551	alan890104/sumi Sumi — Free, open-source voice dictation for macOS. Local-first Whisper +...	40	Emerging	audio-transcription-tools	9	Rust
1552	hcy71o/SNAC Unofficial Pytorch implementation of SNAC: Speaker-normalized affine...	40	Emerging	tacotron-tts-models	57	Python
1553	atakanakin/TutunSabri He is not our hero. He is a silent guardian. A watchful protector.	40	Emerging	telegram-voice-transcription	12	Python
1554	Warma10032/easytts 打造最简单的TTS前端集合，最简单的有声小说制作工作流。基于正则规则对小说进行分句，基于RoBERTa对小说中的对话进行说话人识别，从而实现一键式生成多人...	40	Emerging	gradio-tts-webuis	147	Python
1555	zh217/torch-asg Auto Segmentation Criterion (ASG) implemented in pytorch	40	Emerging	end-to-end-asr-frameworks	51	C++
1556	tristan-mcinnis/Multimodal-voice-assistant This project is a multi-modal AI voice assistant that uses LM Studio, OpenAI...	40	Emerging	local-voice-assistants	9	Python
1557	Igorcbraz/Calculadora 📐 Calculadora simples e intuitiva com suporte a comandos de voz e temas...	40	Emerging	voice-controlled-calculators	33	JavaScript
1558	apluka34/Bud500 Bud500: A Comprehensive Vietnamese ASR Dataset	40	Emerging	multilingual-speech-datasets	69	—
1559	WeiChiaChang/happy-halloween 🗣 Say "happy halloween" to your browser 🎃	40	Emerging	web-speech-api-libraries	14	JavaScript
1560	markmiddo/synthia AI-powered voice assistant that respects your privacy. Control your desktop,...	40	Emerging	local-voice-assistants	4	Python
1561	FedericaPaoli1/stm32-speech-recognition-and-traduction stm32-speech-recognition-and-traduction is a project developed for the...	40	Emerging	wake-word-detection	39	C
1562	marytts/gradle-marytts-voicebuilding-plugin A replacement for the legacy VoiceImportTools in MaryTTS	40	Emerging	java-tts-libraries	16	Groovy
1563	lokkelvin2/tacotron2-tts-GUI Text To Speech (TTS) GUI wrapper for NVIDIA Tacotron 2+Waveglow. For custom...	40	Emerging	tacotron-tts-models	37	Python
1564	AcTePuKc/Kokoro-Local-Gui Hyper-fast, local, high-quality TTS based on Kokoro-82M. PySide6 GUI included.	40	Emerging	kokoro-tts-ecosystem	19	Python
1565	grebtsew/Text_To_Speech_Server_Node A super simple speaking server node that receives requests and reads them...	40	Emerging	self-hosted-tts-servers	1	Python
1566	Allan-Nava/fakeyou.go A powerful golang sdk library for interacting with the FakeYouAPI easily	40	Emerging	go-tts-libraries	2	Go
1567	Jdreioe/Wingmate A project to make people who cannot speak, speak!	40	Emerging	android-speech-apps	2	Kotlin
1568	vkosuri/dialogflow-lite [Maintainer Required] A light-weight python library REST agent for Dialogflow	40	Emerging	voice-command-assistants	2	Python
1569	yeyupiaoling/VITS-Pytorch 本项目是基于Pytorch的语音合成项目，使用的是VITS，VITS是一种语音合成方法，这种时端到端的模型使用起来非常简单，不需要文本对齐等太复杂的流程，...	40	Emerging	vits-tts-implementations	55	Python
1570	user3301/ssml_builder :sound: a general SSML(Speech Synthesis Markup Language) builder	40	Emerging	aws-polly-tts	10	Python
1571	sunshine0523/MNNServer A third-party MNN server supporting external calls, embedding model, TTS...	40	Emerging	llm-docker-deployments	149	C++
1572	pschatzmann/arduino-espeak-ng eSpeak NG is an open source speech synthesizer that supports more than...	40	Emerging	espeak-ng-ecosystem	43	C
1573	FlooferLand/ttvoice-mod A Minecraft mod that lets you type to speak!	40	Emerging	dotnet-tts-libraries	4	Kotlin
1574	shahules786/mayavoz Pytorch based speech enhancement toolkit.	40	Emerging	speaker-diarization-embedding	336	Python
1575	daanzu/speech-training-recorder Simple GUI application to help record audio dictated from given text...	40	Emerging	speech-recognition-apis	41	Python
1576	maum-ai/nuwave2 NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling...	40	Emerging	audio-noise-reduction	307	Python
1577	ShadowForests/VoiceToSpeech Live speech recognition to synthesized speech with hundreds of voices, TTS,...	40	Emerging	web-speech-api-tts	44	JavaScript
1578	sophiefy/StellaVoiceChanger Deep-learning-based voice changer, supporting local inference.	40	Emerging	text-to-speech-frameworks	96	Python
1579	weimeng23/speech-recognition-learning-resources :white_check_mark: A list of speech recognition learning resources including...	40	Emerging	speaker-diarization-embedding	68	—
1580	felivalencia3/RealVoiceGPT RealVoiceGPT is a web application that lets you have voice conversations...	40	Emerging	voice-chatgpt-interfaces	29	JavaScript
1581	itspyguru/Tkinter-Applications A collection of small tkinter apps made by me	40	Emerging	voice-ai-learning-collections	32	Python
1582	Adamiito0909/mlx-swift-audio 🎤 Enhance your apps with MLX Swift Audio, offering robust text-to-speech and...	40	Emerging	wake-word-detection	5	Swift
1583	reybahl/Assistant A machine learning powered, voice-based virtual assistant for Raspberry Pi....	40	Emerging	general-purpose-voice-assistants	33	Python
1584	smx-smx/KodiSharp Use Kodi python APIs in C#, and write rich addons using the .NET framework/Mono	40	Emerging	dotnet-tts-libraries	31	C#
1585	1ytic/pytorch-edit-distance Levenshtein edit-distance on PyTorch and CUDA	40	Emerging	end-to-end-asr-frameworks	93	Cuda
1586	MattePalte/Verbify-TTS Simple and free Text-to-Speech (TTS) engine that reads for you any text on...	40	Emerging	lightweight-tts-libraries	135	Python
1587	aks-devs/mod_google_asr Freeswitch Speech-to-Text module	40	Emerging	vosk-asr-implementations	4	C
1588	TeaPoly/Conformer-Athena Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.	40	Emerging	conformer-asr-implementations	44	Python
1589	andi611/TTS-Tacotron-Pytorch Pytorch implementation of Tacotron, a speech synthesis end-to-end generative...	40	Emerging	tacotron-tts-models	29	Python
1590	pviotti/sayit A text-to-speech command line tool backed by Azure Cognitive Services.	40	Emerging	dotnet-tts-libraries	19	F#
1591	hyeonsangjeon/computing-Korean-STT-error-rates STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지	40	Emerging	asr-evaluation-metrics	68	Python
1592	LetovKai/call-translator Real-time voice translator for video calls. Speak your language on Google...	40	Emerging	tts	10	Rust
1593	TigreGotico/phoonnx A Python library for multilingual phonemization and Text-to-Speech (TTS)...	40	Emerging	lightweight-tts-runtimes	20	Python
1594	shi-gg/Auditional-Text The source code of the Auditional Text discord Boat	40	Emerging	discord-tts-bots	5	TypeScript
1595	double22a/asr_nlp_paper_code Papers of ASR, Tools of ASR	40	Emerging	text-to-speech-frameworks	41	—
1596	johunsang/octo-captures 화면 녹화의 모든 것 — Auto Zoom, 아바타, 음성 변조, BGM, 타임라인 편집을 지원하는 무료 오픈소스 macOS 앱....	40	Emerging	—	43	JavaScript
1597	racai-ai/RobinASR Romanian Automatic Speech Recognition from the ROBIN project	40	Emerging	automatic-speech-recognition	31	Python
1598	abus-aikorea/kara-audio Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports...	40	Emerging	speech-to-text-converters	67	Python
1599	bnsantoso/sub-to-audio Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS...	40	Emerging	whisper-subtitle-generation	121	Python
1600	dusty-nv/jetson-voice ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch...	40	Emerging	lightweight-tts-runtimes	224	Python

« Prev 1 2 3 … 14 15 16 17 18 … 80 81 82 Next »