All Voice AI Tools

8,165 tools ranked by quality score · Page 19 of 82

Showing 1801–1900 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
1801	evilC/HotVoice Adds Speech Recognition support to AutoHotkey, via a C# DLL	38	Emerging	dotnet-tts-libraries	66	AutoHotkey
1802	ElmTran/praises Praises is a text-to-speech tool that can help you read text easily.	38	Emerging	web-speech-api-tts	269	TypeScript
1803	falabrasil/kaldi-br ☕🇧🇷 Scripts para o Kaldi em Português Brasileiro	38	Emerging	kaldi-asr-ecosystem	58	Shell
1804	Proteusiq/saa Making Time Speak! 🎙️	38	Emerging	temporal-expression-parsing	29	Python
1805	mxvsh/wave Native macOS dictation app focused on fast voice-to-text workflows.	38	Emerging	local-voice-dictation	2	C++
1806	eminemahjoub/pdf-voice-reader "PDF Reader: A Python application for seamless PDF viewing with enhanced...	38	Emerging	pdf-to-audio-conversion	13	Python
1807	noco-ai/spellbook-docker AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many...	38	Emerging	multi-modal-ai-assistants	168	Shell
1808	lars76/fastspeech2-clean Clean and modernized implementation of FastSpeech2/LightSpeech using IPA	38	Emerging	fastspeech-tts-models	18	Python
1809	CMsmartvoice/One-Shot-Voice-Cloning :relaxed: One Shot Voice Cloning base on Unet-TTS	38	Emerging	voice-cloning-tools	245	Jupyter Notebook
1810	ckaytev/tgisper Telegram bot with ASR	38	Emerging	telegram-voice-transcription	22	Python
1811	1038lab/ComfyUI-MegaTTS A ComfyUI custom node based on ByteDance MegaTTS3, enabling high-quality...	38	Emerging	comfyui-tts-nodes	49	Python
1812	soldier444xd/KittenTTS KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with...	38	Emerging	gradio-tts-webuis	24	Python
1813	mdingena/att-voodoo A community-made magic mod for A Township Tale, a VR MMORPG game.	38	Emerging	dotnet-tts-libraries	8	TypeScript
1814	Citadawn/VoiceDAO 语道 (VoiceDAO) - 专注于文本转语音功能的 Android 应用	38	Emerging	java-tts-libraries	1	Java
1815	telecombcn-dl/2018-dlsl UPC Deep Learning for Speech and Language 2018	38	Emerging	speech-ai-coursework	17	—
1816	CarrotYuan/openclaw-voice-control A macOS local voice-control companion for OpenClaw with Siri-like wakeword...	38	Emerging	openclaw-voice-assistants	4	Python
1817	paladini/voice-separator-demucs A simple and efficient self-hosted application to separate vocals from music...	38	Emerging	audio-source-separation	23	Python
1818	deepgram-devs/dg-translation-chrome-ext A TypeScript chrome extension that uses Deepgram to provide live...	38	Emerging	deepgram-starter-projects	11	HTML
1819	andi611/CS-Tacotron-Pytorch Pytorch implementation of CS-Tacotron, a code-switching speech synthesis...	38	Emerging	tacotron-tts-models	23	Python
1820	AndroidCodility/SpeechToText Android application to text through which you can provide speech input to...	38	Emerging	android-speech-apps	18	Kotlin
1821	HelloChatterbox/py_responsivevoice unoficial python api for responsive voice	38	Emerging	espeak-ng-ecosystem	16	Python
1822	GloomyGrave/Sinsy-NG (discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis...	38	Emerging	espeak-ng-ecosystem	21	C++
1823	OpenVoiceOS/ovos-tts-plugin-beepspeak experiment adding new r2d2 tts engine for mycroft	38	Emerging	espeak-ng-ecosystem	4	Python
1824	leduckhai/wav2graph wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech	38	Emerging	graph-database-rag	95	Python
1825	QuantiusBenignus/BlahST Input text from speech in any Linux window, the lean, fast and accurate way,...	38	Emerging	conversational-chatbot-applications	167	Shell
1826	SpeechColab/Leaderboard SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform...	38	Emerging	automatic-speech-recognition	541	Python
1827	alam025/ai-voice-assistant-appointment-booking Enterprise-grade AI voice assistant for automated appointment scheduling...	38	Emerging	voice-agent-applications	24	Python
1828	Kyubyong/specAugment Tensor2tensor experiment with SpecAugment	38	Emerging	tacotron-tts-models	46	Python
1829	AA-Factory/aafactory-prototype ⚡ AI Avatar Factory is an interface for creating and managing AI avatars. ⚡	38	Emerging	ai-avatar-platforms	62	Python
1830	xingchensong/Speech-Transformer-tf2.0 transformer for ASR-systerm (via tensorflow2.0)	38	Emerging	end-to-end-asr-frameworks	114	Python
1831	asiff00/Training-TTS Train and finutune text-to-speech models for Bengali and many other languages!	38	Emerging	tts-model-finetuning	18	Jupyter Notebook
1832	AI-TOOLKIT/VoiceBridge VoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit	38	Emerging	lightweight-tts-runtimes	17	C++
1833	funway/audible-epub3-maker Generate audiobooks from plain EPUB files in EPUB 3 Media Overlays format...	38	Emerging	ebook-to-audiobook-conversion	15	Python
1834	iceychris/LibreASR :speech_balloon: An On-Premises, Streaming Speech Recognition System	38	Emerging	voice-cloning-synthesis	682	Python
1835	instavar/qwen3-tts-lora-finetuning Qwen3‑TTS LoRA fine‑tuning tools (companion repo) for custom voice adaptation	38	Emerging	qwen3-tts-applications	2	Shell
1836	ondrejklejch/learning_to_adapt Coordinate-wise meta-learner for speaker adaptation of ASR models.	38	Emerging	end-to-end-asr-frameworks	20	Python
1837	fcjr/ltts Quick CLI for local text-to-speech using Qwen3-TTS or Kokoro TTS.	38	Emerging	qwen3-tts-applications	8	Python
1838	Harsh-0-7/PDF-Reader PDF reader with read aloud feature	38	Emerging	ai-powered-ereaders	8	JavaScript
1839	siddhant-vij/Health-Fitness-Tracker Health & fitness app with natural language processing, custom...	38	Emerging	ai-tutoring-platforms	9	Python
1840	gkrsv/split_audio A rough and ready Python utility which splits audio files based on silence...	38	Emerging	speech-to-text-transcription	16	Python
1841	scarletcho/prep4kaldi Data preparation code for building Kaldi ASR system	38	Emerging	kaldi-asr-ecosystem	14	Python
1842	ayshrv/memento-app Android App which serves as an AI assistant for human memory	38	Emerging	android-voice-assistants	15	Java
1843	krestaino/prankstr 📞 Prank your friends with text-to-speech phone calls powered by Twilio and...	38	Emerging	ai-tutoring-platforms	21	JavaScript
1844	sskorol/vosk-api-gpu Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC	38	Emerging	vosk-asr-implementations	45	Shell
1845	bedriyan/speaky Voice-to-text for macOS, powered by on-device AI. Press a hotkey, speak, and...	38	Emerging	local-voice-dictation	5	Swift
1846	jbmiller10/transcribrr Transcribrr is a python desktop gui application that uses transcribes ...	38	Emerging	audio-transcription-tools	4	Python
1847	tochilkinva/tg_bot_stt_tts Telegram bot with voice message recognition and generation. Speech to Text...	38	Emerging	telegram-voice-transcription	68	Python
1848	naeruru/mimiuchi a free, customizable, osc capable speech-to-text interface for relaying text...	38	Emerging	dotnet-tts-libraries	60	TypeScript
1849	JSON2Video/json2video-php-sdk Video automation with PHP: add watermarks, resize videos, create slideshows,...	38	Emerging	ai-video-generation	25	PHP
1850	kroko-ai/kroko-onnx Kroko ASR - Speech-to-text	38	Emerging	funasr-speech-recognition	138	C++
1851	aiola-lab/drax Drax: Speech Recognition with Discrete Flow Matching	38	Emerging	zero-shot-voice-synthesis	75	Python
1852	taresh18/orpheus-streaming Orpheus TTS Server with streaming support (TTFB ~160ms)	38	Emerging	gradio-tts-webuis	24	Python
1853	HawkAaron/RNN-Transducer MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction...	38	Emerging	end-to-end-asr-frameworks	139	Python
1854	amadeomano/persian-tts 🔊 A simple human-based text-to-speach synthesiser and ReactNative app for...	38	Emerging	web-speech-api-libraries	24	JavaScript
1855	kaiaai/kaia.js Kaia.ai platform's JS client library	38	Emerging	google-tts-libraries	1	TypeScript
1856	rxlabz/sytody a Flutter "speech to todo" app example	38	Emerging	educational-voice-apps	82	Dart
1857	ericc-ch/edge-tts Use Microsoft Edge's online text-to-speech service from JS code directly!	38	Emerging	edge-tts-implementations	16	TypeScript
1858	hutchresearch/latex2speech TeX2Speech is an application that turns LaTeX documents into spoken audio.	38	Emerging	pdf-to-audio-conversion	19	Python
1859	BraceYourselfGames/UE-BYGTextToSpeech A plugin that uses the Windows Speech API to speak text in Unreal Engine 4.	38	Emerging	dotnet-tts-libraries	22	C++
1860	sexfrance/RecaptchaV2-Solver A Python-based solution for solving Google's reCAPTCHA v2 challenges...	38	Emerging	ibm-watson-speech	38	Python
1861	UFOAlastor/AI-Waifu-Project-LaIN 一个拥有长期记忆, 表情动作, 语音对话/打断/声纹识别, FunctionCall, 多模型支持的AI Waifu客户端.	38	Emerging	interactive-ai-avatars	26	Python
1862	AsaoluElijah/say-it A mobile web application that helps you convert spoken words to...	38	Emerging	web-speech-api-libraries	21	HTML
1863	Ronik22/Voice-Controlled-Email A python-based voice-controlled email application for visually impaired persons.	38	Emerging	voice-controlled-robotics	15	Python
1864	ng-web-apis/speech A library for using Web Speech API with Angular	38	Emerging	web-speech-api-libraries	33	TypeScript
1865	zalo/OpenAI-Voice A simple proof of concept for voice-to-voice interaction.	38	Emerging	voice-chatgpt-interfaces	9	JavaScript
1866	dokterbob/macos-speech-server Local, fast and efficient Speech to Text (STT) and Text to Speech (TTS) on...	38	Emerging	local-voice-dictation	11	Swift
1867	lcraver/ProxiTalk This is the repo for ProxiTalk OS. ProxiTalk is a custom operating system...	38	Emerging	self-hosted-tts-servers	7	Python
1868	aidayang/LatentSync-OneClick 免费视频对口型软件LatentSync一键启动整合包	38	Emerging	speech-synthesis-diffusion	28	—
1869	bhashini-ai/bhashini-api-examples Sample programs for calling Bhashini.ai REST/WebSocket APIs - TTS, STT/ASR,...	38	Emerging	speech-recognition-apis	1	Python
1870	mozilla/deepspeech-playbook A crash course for training speech recognition models using DeepSpeech.	38	Emerging	ctc-asr-implementations	24	—
1871	Fooftilly/kokoro-extension Send text from browser to Kokoro-FastAPI for TTS generation	38	Emerging	kokoro-tts-ecosystem	2	JavaScript
1872	Better-Player/espeakng-sys Rust bindings to eSpeak NG	38	Emerging	rust-tts-libraries	13	C
1873	cristofima/AI-Tech-Interview-Preparation An AI-powered technical interview preparation platform that generates...	38	Emerging	ai-interview-simulators	2	TypeScript
1874	karrarkazuya/ArabicTTS ArabicTTS (TextToSpeech) Android library with a sample	38	Emerging	android-speech-apps	16	Java
1875	HCI-LAB-UGSPEECHDATA/speech_data_ghana_ug The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani,...	38	Emerging	llm-scaling-architecture	11	HTML
1876	hcy71o/SC-CNN SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker...	38	Emerging	zero-shot-voice-synthesis	39	Python
1877	Frida7771/PyVoice A Python-based speech processing tool that supports both speech-to-text...	38	Emerging	coqui-tts-applications	3	Python
1878	speechsuper/SpeechSuper-API-Samples Deep learning based speech and pronunciation assessment API for 8 languages.	38	Emerging	dotnet-tts-libraries	60	C#
1879	botbahlul/whisper_autosrt A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using...	38	Emerging	whisper-subtitle-generation	29	Python
1880	IBM/text-to-speech-code-pattern WARNING: This repository is no longer maintained	38	Emerging	google-tts-libraries	14	JavaScript
1881	wannaphong/KhanomTan-TTS-v1.0 KhanomTan TTS (ขนมตาล) is an open-source Thai text-to-speech model that...	38	Emerging	lightweight-tts-runtimes	43	Python
1882	sciforce/phones-las Articulatory features estimation using Listen Attend and Spell architecture.	38	Emerging	conformer-asr-implementations	33	Python
1883	sayak-brm/espeakng-python An eSpeak NG TTS binding for Python3.	38	Emerging	espeak-ng-ecosystem	15	Python
1884	henry-richard7/Natural-Text-to-Speech This python program uses https://naturaltts.com API to convert given text to...	38	Emerging	lightweight-tts-libraries	19	Python
1885	manhph2211/ViSR This repo builds an end-to-end deep learning application that supports...	38	Emerging	end-to-end-asr-frameworks	38	Jupyter Notebook
1886	AkishinoShiame/Chinese-Speech-Emotion-Datasets Datasets of A Deep Convolutional Neural Network Based Virtual Elderly...	38	Emerging	speech-emotion-recognition	38	—
1887	jenswittmann/CurlyFramework Tiny Framework for accessibility and sustainability, not only for MODX or Kirby CMS.	38	Emerging	tts	10	HTML
1888	tmanderson/ivona-node Ivona Cloud (via Amazon services) client library for Node	38	Emerging	google-tts-libraries	31	JavaScript
1889	HnDK0/NoveLA Free Android reader for web novels, light novels, ranobe & EPUB. 25+...	38	Emerging	ai-powered-ereaders	8	Kotlin
1890	npuichigo/ttsflow tensorflow speech synthesis c++ inference for voicenet	38	Emerging	lightweight-tts-runtimes	16	C++
1891	andi611/ZeroSpeech-TTS-without-T A Pytorch implementation for the ZeroSpeech 2019 challenge.	38	Emerging	fastspeech-tts-models	112	Python
1892	askrella/speech-rest-api Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)	38	Emerging	whisper-transcription-apps	99	Python
1893	alan-ai/alan-sdk-reactnative The Self-Coding System for Your App — Alan AI SDK for React Native	38	Emerging	voice-command-assistants	584	Ruby
1894	nexmo-community/voice-azure-speechtotext-py Sample Code for Realtime Transcription using Nexmo, Microsoft Azure Speech...	38	Emerging	dotnet-tts-libraries	10	Python
1895	i4Ds/whisper-prep Data preparation utility for the finetuning of OpenAI's Whisper model.	38	Emerging	speech-to-text-transcription	11	Python
1896	Deepak5j/PyTranscriber Speech to Text	38	Emerging	speech-recognition-apis	18	Python
1897	persiandataset/PersianSpeech Persian ASR dataset	37	Emerging	persian-speech-ai	42	—
1898	asmith26/speech2caret Use your speech to write to the current caret position!	37	Emerging	text-to-speech-conversion	3	Python
1899	masonthemaker/saidwell Open Source Voice AI Dashboard	37	Emerging	ai-chatbot-interfaces	13	TypeScript
1900	Kalebu/image-to-sound-python- A python project for converting an Image into audible sound using OCR and...	37	Emerging	image-caption-generation	68	Python

« Prev 1 2 3 … 17 18 19 20 21 … 80 81 82 Next »