All Voice AI Tools

8,165 tools ranked by quality score · Page 11 of 82

Showing 1001–1100 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
1001	shenasa-ai/speech2text A Deep-Learning-Based Persian Speech Recognition System	44	Emerging	keyword-speech-recognition	234	Jupyter Notebook
1002	maum-ai/assem-vc Official Code for Assem-VC @ICASSP2022	44	Emerging	text-to-speech-frameworks	269	Jupyter Notebook
1003	siva-sub/NekoSpeak Private, offline AI Text-to-Speech for Android with Kokoro, KittenTTS,...	44	Emerging	kokoro-tts-ecosystem	43	Kotlin
1004	wangz-code/legado-edge-tts edge大声朗读微软TTS服务, 在阅读legado中配置语音引擎方式收听微软TTS / Edge大声朗读, 如果没有 vps 部署可以看看阅读内置...	44	Emerging	edge-tts-implementations	23	Kotlin
1005	SynHub/syn-speech Syn.Speech is a flexible speaker independent continuous speech recognition...	44	Emerging	dotnet-tts-libraries	66	C#
1006	talin190/Qwen3-TTS-Daggr-UI 🎤 Create dynamic voice experiences with Qwen3-TTS-Daggr-UI, a Gradio app for...	44	Emerging	qwen3-tts-applications	3	Python
1007	husniadil/cc-hooks Audio feedback plugin for Claude Code with TTS announcements, sound effects,...	44	Emerging	voice-enabled-coding-assistants	17	Python
1008	jim-schwoebel/download_audioset 📁 This repo makes it easy to download the raw audio files from AudioSet...	44	Emerging	speech-corpora-datasets	105	Python
1009	DrDroidLab/voicesummary Open Source AI Database for Voice Agent Transcripts \| Call Analysis &...	44	Emerging	voice-agent-applications	23	Python
1010	OpenMOSS/MOSS-Speech MOSS-Speech is a true speech-to-speech large language model without text guidance.	44	Emerging	voice-assistant-devices	127	Python
1011	bookbot-kids/speech-recognizer-bahasa-indonesian A cross platform (Android/iOS/MacOS) Bahasa Indonesia speech recognizer...	44	Emerging	educational-voice-apps	12	C++
1012	cuinjune/text2video A software tool that converts text to video for more engaging learning experience	44	Emerging	text-to-video-generation	71	JavaScript
1013	yerfor/SyntaSpeech SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022;...	44	Emerging	neural-vocoder-implementations	203	Python
1014	Pikurrot/whisper-gui A simple GUI to use Whisper.	44	Emerging	whisper-speech-transcription	414	Python
1015	r0227n/flutter_whisper_kit 🎤 A Flutter plugin for running WhisperKit speech-to-text models on-device,...	44	Emerging	whisper-speech-transcription	13	Dart
1016	drmfinlay/pyjsgf JSpeech Grammar Format (JSGF) compiler, matcher and parser package for Python.	44	Emerging	automatic-speech-recognition	54	Python
1017	djmango/obsidian-transcription Obsidian plugin to create high-quality transcriptions from markdown linked...	44	Emerging	edge-tts-implementations	217	TypeScript
1018	lucasnewman/f5-tts-swift Implementation of F5-TTS in Swift using MLX	44	Emerging	zero-shot-voice-synthesis	91	Swift
1019	murf-ai/murf-python-sdk Python sdk for Murf text to speech API	44	Emerging	audio-transcription-apps	109	Python
1020	holgern/kokorog2p A unified multi-language G2P (Grapheme-to-Phoneme) library for Kokoro TTS.	44	Emerging	grapheme-to-phoneme-conversion	3	Python
1021	algolia/voice-overlay-android 🗣 An overlay that gets your user’s voice permission and input as text in a...	44	Emerging	android-speech-apps	263	Kotlin
1022	BernieTv/ElevenLabs-Clone A self-hosted ElevenLabs clone for text-to-speech, voice conversion, and AI...	44	Emerging	elevenlabs-integrations	66	Python
1023	Candida18/Virtual-Assistance-For-The-Blind The proposed Voice-based Email System uses AI (voice commands) that will...	44	Emerging	voice-controlled-robotics	40	Python
1024	nixonyh/UnityTTS Text to Speech in Unity.	44	Emerging	dotnet-tts-libraries	142	C#
1025	isaiahbjork/expo-kokoro-onnx Run Kokoro TTS locally on device using Expo & ONNX Runtime	44	Emerging	kokoro-tts-ecosystem	74	TypeScript
1026	jpescada/TwitterPiBot A Python based bot for Raspberry Pi that grabs tweets with a specific...	44	Emerging	telegram-voice-transcription	89	Python
1027	mozilla-ai/speech-to-text-finetune Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language	44	Emerging	tts-model-finetuning	63	Python
1028	rishikksh20/TFGAN TFGAN: Time and Frequency Domain Based Generative Adversarial Network for...	44	Emerging	neural-vocoder-implementations	88	Python
1029	zycv/awesome-keyword-spotting This repository is a curated list of awesome Speech Keyword Spotting...	44	Emerging	wake-word-detection	283	—
1030	tonesto7/echo-speaks Integrate your Amazon Echo devices into your Hubitat environment to create...	44	Emerging	voice-controlled-robotics	113	Groovy
1031	travisvn/edge-tts-extension Chrome extension to generate free, high-quality text-to-speech using...	44	Emerging	browser-tts-extensions	69	TypeScript
1032	Amirrezahmi/Zozo-Assistant Zozo Assistant is a voice-activated chatbot that performs tasks based on...	44	Emerging	general-purpose-voice-assistants	62	Python
1033	Berkeley-Speech-Group/sylber Sylber: Syllabic Embedding Representation of Speech from Raw Audio	44	Emerging	speaker-diarization-embedding	74	Jupyter Notebook
1034	verbio-technologies/python-verbio-speech-center Python integration with the Verbio Speech Center Cloud....	44	Emerging	speech-recognition-apis	8	Python
1035	kosich/rxjs-tts RxJS wrapper for Text-to-Speech Web API	44	Emerging	web-speech-api-tts	9	TypeScript
1036	ttaoREtw/Tacotron-pytorch A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model	44	Emerging	tacotron-tts-models	110	Python
1037	pufanyi/GenderRecognitionByVoice NTU SC1015 Group Project - Gender Recognition by Voice	44	Emerging	facial-attribute-classification	5	HTML
1038	matteo-convertino/vosk-build-model How to create your own model for vosk	44	Emerging	voice-cloning-synthesis	75	Shell
1039	hirofumi0810/asr_preprocessing Python implementation of pre-processing for End-to-End speech recognition	44	Emerging	end-to-end-asr-frameworks	69	Python
1040	apaar97/translate Android app to translate text conversations, supporting 90+ languages with...	44	Emerging	android-speech-apps	59	Java
1041	momysnow/Momy-Desk-Robot Smart desktop robot.	44	Emerging	voice-controlled-robotics	90	C
1042	CheshireCC/faster-whisper-GUI faster_whisper GUI with PySide6	44	Emerging	speech-to-text-converters	2,911	Python
1043	m3hrdadfi/soxan Wav2Vec for speech recognition, classification, and audio classification	44	Emerging	wav2vec2-asr-models	273	Jupyter Notebook
1044	Azure-Samples/sonic-brief Sonic Brief Project is an Azure-based system that transcribes and...	44	Emerging	dotnet-tts-libraries	35	TypeScript
1045	JosefAlbers/e2tts-mlx Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX	44	Emerging	zero-shot-voice-synthesis	29	Python
1046	seven-io/home-assistant HACS supporting Home Assistant integration for seven	44	Emerging	home-assistant-tts	3	Python
1047	Aratako/MioTTS-Inference Inference server for MioTTS, a lightweight and fast LLM-based TTS model.	44	Emerging	llm-inference-serving	109	Python
1048	resemble-ai/resemble-alexa This is sample code for an Alexa skill that uses realistic voice cloning...	44	Emerging	voice-cloning-synthesis	87	Python
1049	Justmalhar/open-audio Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech...	44	Emerging	voice-ai-assistants	95	JavaScript
1050	n0th1ng-else/voice-to-text-bot Telegram bot that converts Voice messages into text	44	Emerging	telegram-voice-transcription	8	TypeScript
1051	vieledatengutedaten/better-teletask-extension Browser extension that adds useful features like subtitles to HPI Tele-Task.	44	Emerging	stt	4	JavaScript
1052	ycyy/faster-whisper-webui a gradio webui for faster whisper	44	Emerging	speech-to-text-converters	275	Python
1053	syntithenai/hermod voice services stack from audio hardware through hotword, ASR, NLU, AI...	44	Emerging	local-voice-assistants	93	Python
1054	subho406/TF-Speech-Recognition-Challenge-Solution Source code of the model used in Tensorflow Speech Recognition Challenge...	44	Emerging	keyword-speech-recognition	58	Jupyter Notebook
1055	iamjanvijay/rnnt_decoder_cuda An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.	44	Emerging	end-to-end-asr-frameworks	67	Cuda
1056	bytectlgo/edge-tts Edge TTS is a command-line tool based on Microsoft Edge's text-to-speech...	44	Emerging	edge-tts-implementations	6	Go
1057	Gr122lyBr/voicetag Speaker identification powered by pyannote and resemblyzer	44	Emerging	speech-to-text-transcription	32	Python
1058	just-ai/aimybox-android-sdk Voice assistant SDK for Android	44	Emerging	android-voice-assistants	93	Kotlin
1059	am-sokolov/videodubber The program for automatic dubbing any video file for a lot of languages.	44	Emerging	video-dubbing-tools	85	Python
1060	nl8590687/ASRT_SDK_Java ASRT Speech Recognition SDK for Java. 用于ASRT语音识别系统的Java SDK	44	Emerging	java-tts-libraries	53	Java
1061	ShaerWare/AI_Secretary_System 📞 Локальный AI-секретарь, тех. поддержка и менеджер по продажам с...	44	Emerging	voice-ai-agents	5	Python
1062	PABannier/bark.cpp Suno AI's Bark model in C/C++ for fast text-to-speech generation	44	Emerging	voice-cloning-synthesis	857	C++
1063	botbahlul/vosk_autosrt A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using...	44	Emerging	whisper-subtitle-generation	11	Python
1064	huschen/kaggle_speech_recognition Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.	44	Emerging	ctc-asr-implementations	72	Python
1065	igorshmukler/kokoro-ruslan Kokoro Language Model Training Script for Russian (Ruslan Corpus)	44	Emerging	kokoro-tts-ecosystem	39	Python
1066	rajkishorbgp/JARVIS-AI-Assistant JARVIS AI Assistant 🤖 A virtual assistant project inspired by Tony Stark's...	44	Emerging	python-voice-assistants	42	Python
1067	byhow/yanyu A Text-to-Speech node package with pinyin audio library.	44	Emerging	google-tts-libraries	9	TypeScript
1068	mobilepadawan/Speakit-JS Elevate your web applications with the power of JavaScript speech synthesis.	43	Emerging	web-speech-api-tts	13	JavaScript
1069	bakaburg1/minutemaker Generate meeting minutes starting from an audio recording or a transcripts...	43	Emerging	meeting-transcription-summarizers	21	R
1070	BobRandomNumber/ComfyUI-DiaTTS ComfyUI Dia safetensors implementation	43	Emerging	comfyui-tts-nodes	7	Python
1071	huakunyang/SummerTTS SummerTTS...	43	Emerging	lightweight-tts-runtimes	524	C++
1072	ryanleary/patter speech-to-text in pytorch	43	Emerging	end-to-end-asr-frameworks	82	Python
1073	beyondwords-io/wordpress-plugin BeyondWords is the AI voice platform that brings frictionless audio...	43	Emerging	google-tts-libraries	2	PHP
1074	keonlee9420/VAENAR-TTS PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based...	43	Emerging	tacotron-tts-models	73	Python
1075	caizexin/tf_multispeakerTTS_fc the Tensorflow version of multi-speaker TTS training with feedback constraint	43	Emerging	fastspeech-tts-models	40	Python
1076	gladiaio/normalization A lightweight library for normalizing speech transcripts before computing WER	43	Emerging	text-normalization-engines	10	Python
1077	asticode/go-astideepspeech Golang bindings for Mozilla's DeepSpeech speech-to-text library	43	Emerging	go-tts-libraries	182	Go
1078	andresayac/edge-tts-php Edge TTS is a PHP package that allows access to the online text-to-speech...	43	Emerging	edge-tts-implementations	15	PHP
1079	jianchang512/zh_recogn 将音频或视频中的中文语音识别并导出为srt字幕，基于魔塔社区Paraformer模型	43	Emerging	automatic-speech-recognition	116	Python
1080	mgonzs13/tts_ros Text-to-Speech for ROS 2	43	Emerging	lightweight-tts-libraries	21	Python
1081	lukaszliniewicz/Pandrator Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos...	43	Emerging	ai-podcast-generation	541	Python
1082	metame-ai/awesome-audio-plaza Daily tracking of awesome audio papers, including music generation,...	43	Emerging	voice-ai-learning-collections	411	—
1083	Kardbord/hfapigo Unofficial (Golang) Go bindings for the Hugging Face Inference API	43	Emerging	go-tts-libraries	63	Go
1084	nodef/extra-amazontts Generate speech audio from super long text through machine (via "Amazon...	43	Emerging	aws-polly-tts	5	JavaScript
1085	Sgvkamalakar/Azure-Talking-Avatar Explore the power of Azure Text-to-Speech with interactive talking avatar,...	43	Emerging	ai-avatar-platforms	40	Python
1086	agentvoiceresponse/avr-tts-elevenlabs This repository demonstrates the integration between Agent Voice Response...	43	Emerging	deepgram-starter-projects	3	JavaScript
1087	mgonzs13/piper_ros piper Text-to-Speech for ROS 2	43	Emerging	piper-tts-ecosystem	6	C++
1088	hi-paris/Prosody-Control-French-TTS An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control	43	Emerging	zero-shot-voice-synthesis	31	Python
1089	Jakobovski/free-spoken-digit-dataset A free audio dataset of spoken digits. An audio version of MNIST.	43	Emerging	speech-recognition-datasets	667	Python
1090	meemalabs/laravel-text-to-speech 💬 A wrapper for popular TTS services to create a more simple & uniform API....	43	Emerging	aws-polly-tts	42	PHP
1091	cdimascio/watson-html5-speech-recognition Speech Recognition for Browsers via Webkit, HTML5, and Watson	43	Emerging	web-speech-api-libraries	4	JavaScript
1092	mush42/sonata A cross-platform inference engine for neural TTS models.	43	Emerging	rust-tts-libraries	73	Rust
1093	bjoernkarmann/project_alias Alias is a teachable “parasite” that is designed to give users more control...	43	Emerging	voice-assistant-applications	1,701	Python
1094	agan-j/xiaoniu 小牛视频翻译是一款支持本地视频翻译、字幕翻译和 YouTube 视频翻译下载的 AI...	43	Emerging	video-dubbing-tools	326	—
1095	p0p4k/pflowtts_pytorch Unofficial implementation of NVIDIA P-Flow TTS paper	43	Emerging	text-to-speech-frameworks	230	Python
1096	xxbb1234021/speech_recognition 中文语音识别	43	Emerging	keyword-speech-recognition	848	Python
1097	garvys-org/rustfst Rust re-implementation of OpenFST - library for constructing, combining,...	43	Emerging	rust-tts-libraries	180	Rust
1098	devnen/Kitten-TTS-Server Self-host the ultra-lightweight Kitten TTS model with this enhanced API...	43	Emerging	gradio-tts-webuis	246	Python
1099	sdip15fa/safecantonese.ai.app Free, open-source, offline, safe and secure AI Cantonese transcription, in...	43	Emerging	speech-to-text-converters	19	TypeScript
1100	algolia/voice-overlay-ios 🗣 An overlay that gets your user’s voice permission and input as text in a...	43	Emerging	ios-speech-frameworks	556	Swift

« Prev 1 2 3 … 9 10 11 12 13 … 80 81 82 Next »