All Voice AI Tools

8,165 tools ranked by quality score · Page 17 of 82

Showing 1601–1700 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
1601	adelacvg/ttts Train the next generation of TTS systems.	40	Emerging	zero-shot-voice-synthesis	171	Python
1602	rryam/SakuraKit Swift SDK for Prototyping AI Speech Generation	40	Emerging	ios-speech-frameworks	26	Swift
1603	Ijwi-ry-Ikirundi-AI/Kirundi_Dataset 🇧🇮 The first large-scale, open-source speech and text dataset for Kirundi...	40	Emerging	speech-recognition-datasets	7	Jupyter Notebook
1604	DrewThomasson/ebook2audiobookpiper-tts Converts ebooks into audiobooks with piper-tts	40	Emerging	ebook-to-audiobook-conversion	102	Jupyter Notebook
1605	ninjahuttjr/hal-answering-service I'm sorry, Dave. I'm afraid I can't let that spam call through. — Local AI...	39	Emerging	voice-agent-applications	9	Python
1606	1ytic/open_stt_e2e PyTorch end-to-end speech recognition	39	Emerging	end-to-end-asr-frameworks	49	Python
1607	MuGuiLin/VoiceDictation 迅飞语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息，让机器能够“听懂”人类语言，相当于给机器安装上“耳朵”，使其具备“能听”的功能。	39	Emerging	web-speech-api-libraries	137	JavaScript
1608	taikun114/VOICEVOX-TTS-for-Home-Assistant Custom integration for Japanese TTS using VOICEVOX in Home Assistant.	39	Emerging	home-assistant-tts	5	Python
1609	collectivat/cmusphinx-models Acoustic and language models for minorised languages.	39	Emerging	kaldi-asr-ecosystem	26	Python
1610	rhasspy/piper-samples Samples for Piper text to speech system	39	Emerging	piper-tts-ecosystem	13	JavaScript
1611	M0Rf30/shisper A quick & dirty script to generate and view subtitles and transcriptions for...	39	Emerging	whisper-subtitle-generation	16	Shell
1612	Anwarvic/RasaChatbot-with-ASR-and-TTS This repository contains an attempt to incorporate Rasa Chatbot with...	39	Emerging	voice-chatbot-applications	23	JavaScript
1613	pkozul/ha-tts-bluetooth-speaker TTS Bluetooth Speaker for Home Assistant	39	Emerging	home-assistant-tts	210	Python
1614	rcspam/dictee Push-to-talk voice dictation for Linux — 100% local, multilingual (25+...	39	Emerging	voice-dictation-typing	3	Python
1615	spokestack/spokestack-android Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS....	39	Emerging	android-speech-apps	74	Java
1616	JusperLee/Conv-TasNet Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech...	39	Emerging	audio-noise-reduction	535	Python
1617	oleges1/quartznet-pytorch Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]	39	Emerging	end-to-end-asr-frameworks	26	Jupyter Notebook
1618	Supremesujay/murf-voice-agent-starter 🎤 Build a low-latency voice agent with real-time TTS and STT, powered by...	39	Emerging	voice-agent-applications	1	Python
1619	just-ai/aimybox-ios-sdk Voice assistant SDK for iOS devices written in Swift	39	Emerging	ios-speech-frameworks	21	Swift
1620	takahi-ro/ConvivialChat This system provides the web space where text and speech coexist, and you...	39	Emerging	voice-command-assistants	3	JavaScript
1621	hariketsheth/Article_Repository_Management_System In this Tech Savvy era, with lot of advancements in the field of AI, ML, IoT...	39	Emerging	face-recognition-systems	22	PHP
1622	moutaouakkil/tts-text-to-speech Text-to-Speech (TTS) enables developers to synthesize natural-sounding...	39	Emerging	lightweight-tts-libraries	6	Python
1623	nuance-communications/mix-demo-client-azstaticwebapps Nuance Mix Demo Client for use with Azure Static Web Apps	39	Emerging	dotnet-tts-libraries	14	JavaScript
1624	WismutHansen/READ2ME Turn text from websites into spoken audio with edge-tts, F5, etc. and save...	39	Emerging	ebook-to-audiobook-conversion	48	Python
1625	TrevorS/qwen3-tts-rs Rust implementation of Qwen3-TTS speech synthesis	39	Emerging	rust-tts-libraries	111	Rust
1626	uetuluk/xcodec2-infer-lib CPU support for xcodec2	39	Emerging	zero-shot-voice-synthesis	6	Python
1627	ProperCode/Work-by-Speech Windows app which allows efficient work on a computer by speech alone.	39	Emerging	dotnet-tts-libraries	21	C#
1628	ShawnHymel/tflite-speech-recognition Demo for training a convolutional neural network to classify words and...	39	Emerging	wake-word-detection	105	Jupyter Notebook
1629	asticode/go-astibob Golang framework to build an AI that can understand and speak back to you,...	39	Emerging	go-tts-libraries	243	Go
1630	smartherd/SpeechToText Speech To Text in Android	39	Emerging	android-speech-apps	62	Java
1631	sljavi/handsfree-for-web-control-speech-recognition-module Handsfree for Web module useful to ask for start or stop listening for voice commands	39	Emerging	web-speech-api-libraries	2	JavaScript
1632	daisy/obi Obi is an open source audio book production tool that produces digital...	39	Emerging	ebook-to-audiobook-conversion	10	HTML
1633	poretsky/ru_tts Compact and portable Russian speech synthesizer	39	Emerging	espeak-ng-ecosystem	27	C
1634	uiuc-sst/asr24 24-hour Automatic Speech Recognition	39	Emerging	kaldi-asr-ecosystem	27	C++
1635	npuichigo/voicenet Speech synthesis platform based on tensorflow and sonnet	39	Emerging	lightweight-tts-runtimes	60	Makefile
1636	megaease/easevoice-trainer EaseVoice Trainer is a simple and user-friendly voice cloning and speech...	39	Emerging	tts-model-finetuning	350	Python
1637	kaieberl/paper2speech Convert any english paper or scientific book to audio	39	Emerging	text-to-speech-conversion	30	Python
1638	gauthelo/kallaama-speech-dataset A transcribed speech dataset in Wolof, Pulaar and Sereer, to support...	39	Emerging	nlp-dataset-collections	18	—
1639	SiddhantSadangi/st_deepgram_playground API playground for Deepgram built with Streamlit	39	Emerging	streamlit-tts-apps	21	Python
1640	SungFeng-Huang/Meta-TTS Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More...	39	Emerging	text-to-speech-frameworks	194	Python
1641	jorge-menjivar/super-stt Super STT enables effortless voice-to-text in any application, using the...	39	Emerging	voice-dictation-typing	44	Rust
1642	loretoparisi/htk HTK Toolkit with Linux 64 bit and Docker support	39	Emerging	kaldi-asr-ecosystem	20	C
1643	allseeteam/ai-secretary Smart assistant in Telegram bot format for transcribing online meetings	39	Emerging	meeting-transcription-summarizers	16	Python
1644	akku2005/VocalInk Next-gen open-source voice-to-blog platform with AI, TTS, gamification, and...	39	Emerging	ai-tutoring-platforms	3	JavaScript
1645	xifan2333/fcitx5-vinput Local offline voice input plugin for Fcitx5	39	Emerging	local-voice-dictation	51	C++
1646	brewusinc/Edge-TTS Edge-TTS is a Swift implementation of Microsoft Edge's Text-to-Speech (TTS)...	39	Emerging	edge-tts-implementations	23	Swift
1647	kauazin394/vibevoice.swift 🎤 Create low-latency text-to-speech on macOS with VibeVoice.swift,...	39	Emerging	qwen3-tts-applications	7	Swift
1648	art1415926535/yandex_speech Generation of speech using Yandex SpeechKit.	39	Emerging	yandex-speechkit-tools	24	Python
1649	felipefacundes/brasiltts Brasil TTS é um conjunto de sintetizadores de voz, em português do Brasil,...	39	Emerging	php-tts-libraries	66	HTML
1650	mostafaelaraby/Tensorflow-Keyword-Spotting Keyword spotting using various architecture like convolutional vggnet , 1D...	39	Emerging	wake-word-detection	29	Python
1651	manishdhakal/ASR-Nepali-using-CNN-BiLSTM-ResNet Automatic speech recognition for the Nepali language using CNN,...	39	Emerging	ctc-asr-implementations	24	Python
1652	royshil/cloudvocal Cloud AI live transcription and translation service plugin	39	Emerging	live-caption-generation	35	C++
1653	yuanshanhua/video-dubbing AI 驱动的视频译配工具. An AI powered tool to execute end-to-end video dubbing.	39	Emerging	video-dubbing-tools	6	Python
1654	fewieden/MMM-TTS Text-To-Speech Module for MagicMirror²	39	Emerging	web-speech-api-tts	19	JavaScript
1655	sooftware/speech-transformer Transformer implementation speciaized in speech recognition tasks using Pytorch.	39	Emerging	end-to-end-asr-frameworks	65	Python
1656	tomchang25/whisper-auto-transcribe Auto transcribe tool based on whisper	39	Emerging	whisper-transcription-apps	226	Python
1657	atrzaska/VoiceStressAnalysis VoiceStressAnalysis - Detects stress in your voice	39	Emerging	stress-detection-ml	22	Java
1658	JstnMcBrd/dectalk-tts API wrapper for the Dectalk TTS system	39	Emerging	dotnet-tts-libraries	1	TypeScript
1659	OpenVoiceOS/ovos-tts-plugin-pico pico-tts-plugin	39	Emerging	espeak-ng-ecosystem	—	Python
1660	ReneTode/My-AppDaemon My apps, my helpfiles, all about AppDaemon for Home Assistant	39	Emerging	vue-speech-recognition	113	JavaScript
1661	seanhweb/Twitch-Text-to-Speech Text to speech tool for twitch	39	Emerging	twitch-chat-tts	21	HTML
1662	privapps/TTS-Mandarin text to speech in mandarin	39	Emerging	lightweight-tts-runtimes	15	Shell
1663	asrajeh/arabic-tts Arabic TTS ( الناطق العربي )	39	Emerging	dotnet-tts-libraries	28	Shell
1664	6drf21e/ChatTTS_colab 🚀 一键部署（含离线整合包）！基于 ChatTTS ，支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用，无需复杂安装。	39	Emerging	self-hosted-tts-servers	2,578	Python
1665	harisbinzia/PronouncUR PronouncUR: An Urdu Pronunciation Lexicon Generator	39	Emerging	multilingual-speech-datasets	16	Python
1666	warisqr007/vocos Causal version of Vocos (neural vocoders for high-quality audio synthesis)...	39	Emerging	neural-vocoder-implementations	2	Jupyter Notebook
1667	wangz-code/legado-tts Book Reader阅读Legado 应用内置EdgeTTS大声朗读, 听书无需额外部署即装即听, 语音引擎采用rany2/edge-tts...	39	Emerging	edge-tts-implementations	31	Kotlin
1668	hathibelagal-dev/str2speech An easy-to-use library and command-line tool for TTS	39	Emerging	lightweight-tts-libraries	15	Python
1669	hmartelb/speech-denoising Speech Denoising project for the Deep Learning course at Tsinghua...	39	Emerging	audio-noise-reduction	18	Jupyter Notebook
1670	saurabhshri/CCAligner 🔮 Word by word audio subtitle synchronisation tool and API. Developed under...	39	Emerging	whisper-subtitle-generation	172	C++
1671	awexandrr/audioWhisper Listen to any audio stream on your machine and print out the transcribed or...	39	Emerging	speech-to-text-converters	119	Python
1672	liuhaozhe6788/voice-cloning-collab an improved version of Real-time-voice-cloning	39	Emerging	voice-cloning-synthesis	52	Python
1673	gmltmd789/UnitSpeech An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis...	39	Emerging	fastspeech-tts-models	138	Jupyter Notebook
1674	smtiitm/Fastspeech2_MFA Indic TTS for Indian Languages: This is a project on developing...	39	Emerging	tts-model-finetuning	17	Perl
1675	mrtrizer/UnityPiper Offline text to speech inside Unity	39	Emerging	piper-tts-ecosystem	36	C#
1676	ivanvovk/compressed-tacotron2-pytorch Compressed version of Tacotron 2 using Tensor Train + Waveglow.	39	Emerging	tacotron-tts-models	22	Jupyter Notebook
1677	Yazdi9/TTS-MultiLingual Text To Speech Multilingual Support (+20 Language)	39	Emerging	speech-translation-apps	52	Python
1678	unza-speech-lab/zambezi-voice Repository for multilingual speech data resources for native languages of Zambia.	39	Emerging	speech-corpora-datasets	20	—
1679	rishikksh20/SoundStorm-pytorch Google's SoundStorm: Efficient Parallel Audio Generation	39	Emerging	text-to-speech-tts	131	Python
1680	Executedone/Chinese-FastSpeech2 基于标贝数据继续训练，同时对原本的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏	39	Emerging	fastspeech-tts-models	278	Python
1681	twn39/EdgeTTS.DotNet EdgeTTS.DotNet is a C# (.NET) library that allows you to use Microsoft...	39	Emerging	edge-tts-implementations	2	C#
1682	souvikg544/TTS_Data_Maker Text to speech is an emerging zone of AI. This repository helps to create a...	39	Emerging	tts-dataset-creation	28	Python
1683	AIFSH/ComfyUI-GPT_SoVITS a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now	39	Emerging	comfyui-tts-nodes	249	Python
1684	hiteshsahu/Android-TTS-STT One line solution for Android Text to speech(TTS) & Speech to Text(STT)...	39	Emerging	android-speech-apps	123	Kotlin
1685	second-state/gsv_tts Streaming TTS API server written in Rust	39	Emerging	voice-ai-assistants	19	HTML
1686	harvard-edge/multilingual_kws Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus	39	Emerging	wake-word-detection	186	Jupyter Notebook
1687	llm-believer/slide-to-video A tool that converts a slide deck into a video, complete with your voice...	39	Emerging	ai-video-generation	9	Python
1688	tnicola/vue-voice Speech to text and text to speech Vue library	39	Emerging	vue-speech-recognition	21	Vue
1689	umair13adil/background_stt A flutter plugin to run always-on speech to text service in the background.	39	Emerging	educational-voice-apps	12	Kotlin
1690	SergeyShk/Speech-to-Text-Russian Проект для распознавания речи на русском языке на основе pykaldi.	39	Emerging	automatic-speech-recognition	341	Python
1691	LedoKun/028-simple-queue-system A real-time, responsive queue calling system designed for TV displays,...	39	Emerging	rust-tts-libraries	1	Rust
1692	syhw/wer_are_we Attempt at tracking states of the arts and recent results (bibliography) on...	39	Emerging	ctc-asr-implementations	1,865	—
1693	espnet/interspeech2019-tutorial INTERSPEECH 2019 Tutorial Materials	39	Emerging	speech-ai-coursework	194	Jupyter Notebook
1694	usabarashi/voicevox-cli Japanese text-to-speech using VOICEVOX Core	39	Emerging	rust-tts-libraries	6	Rust
1695	DataXujing/ASR-paper :fire: ASR教程: https://dataxujing.github.io/ASR-paper/	39	Emerging	end-to-end-asr-frameworks	25	—
1696	westonruter/spoken-word Spoken Word	39	Emerging	web-speech-api-tts	51	JavaScript
1697	tabahi/contexless-phonemes-CUPE pytorch model for contexless-phoneme prediction from speech audio	39	Emerging	end-to-end-asr-frameworks	32	Python
1698	18F/tts-buy-bug-bounty Solicitation and acquisition documents created for the TTS Bug Bounty...	39	Emerging	government-procurement-docs	19	—
1699	VITA-Group/Audio-Lottery [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight,...	39	Emerging	end-to-end-asr-frameworks	32	Python
1700	chrisvdev/obs-chat Also known as CVTalk is a Twitch chat viewer made with React for use in OBS...	39	Emerging	twitch-chat-tts	30	JavaScript

« Prev 1 2 3 … 15 16 17 18 19 … 80 81 82 Next »