All Voice AI Tools

8,165 tools ranked by quality score · Page 21 of 82

Showing 2001–2100 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
2001	brailcom/speechd-el Emacs speech and Braille output interface	37	Emerging	cross-platform-tts-frameworks	14	Emacs Lisp
2002	Julia-Roman/pepega-tts Discord bot for Google and Polly Text-to-Speech	37	Emerging	discord-tts-bots	10	JavaScript
2003	01-vyom/End_2_End_Automatic_Speech_Recognition_For_Gujarati [ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition...	37	Emerging	ctc-asr-implementations	13	Python
2004	Abhishek-op/SR 💡Kivy-android speech recognition	37	Emerging	automatic-speech-recognition	15	Python
2005	IndieCoderMM/smart-one-ai 🤖 AI assistant that can listen to user input and provide responses. It...	37	Emerging	voice-controlled-desktop-automation	16	Python
2006	soniqo/speech-android On-device speech SDK for Android — ASR, TTS, VAD, and noise cancellation...	37	Emerging	java-tts-libraries	4	C++
2007	artcore-c/AI-Voice-Clone-with-Qwen3-TTS Free voice cloning and TTS for creators using Qwen3-TTS on Google Colab....	37	Emerging	voice-cloning-tools	13	—
2008	jonelo/jAdapterForNativeTTS A simple pure Java library that allows you to use the native Text To Speech...	37	Emerging	java-tts-libraries	12	Java
2009	ScottishFold007/Cosyvoice_DPO_NOTES CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO...	37	Emerging	coqui-tts-applications	121	Python
2010	calinalexandru/pericles A browser extension offering intuitive text-to-speech functionality, making...	37	Emerging	browser-tts-extensions	15	TypeScript
2011	nchudleigh/sc2-ultra Voice-controlled StarCraft II - command Zerg, Protoss, or Terran using...	37	Emerging	voice-controlled-robotics	2	Python
2012	aks-devs/mod_openai_tts Freeswitch Speech-To-Text module	37	Emerging	vosk-asr-implementations	9	C
2013	shafaypro/PYSHA A Simple Virtual Assistant Build in Python 3.5	37	Emerging	general-purpose-voice-assistants	19	Python
2014	scripty-bot/scripty Speech to text bot for Discord	37	Emerging	discord-tts-bots	80	Rust
2015	iron-mukakin/Emoji-TTS Irodori-TTSのフォーク、echo-TTSのwebuiになります。	37	Emerging	speech-synthesis-diffusion	7	Python
2016	Martouta/speech_processor Speech-to-text from videos and audios (including youtube and tiktok links)	37	Emerging	speech-recognition-apis	20	Python
2017	rishikksh20/iSTFT-Avocodo-pytorch Ultrafast GAN based Vocoder for Text to Speech	37	Emerging	neural-vocoder-implementations	50	Python
2018	parthgupta1208/VoiceCraft Voice Craft is a desktop AI assistance tool designed to help people with...	37	Emerging	voice-assistant-devices	17	Python
2019	deepily/genie-in-the-box Genie in the Box: Distill Whisper STT => Mistral-7B =>...	37	Emerging	audio-transcription-tools	16	Jupyter Notebook
2020	mozi1924/Qwen3-TTS-EasyFinetuning Easy fine-tuning for Qwen3-TTS: Fast voice cloning and high-quality...	37	Emerging	llm-fine-tuning	32	Python
2021	kurianbenoy/malayalam_asr_benchmarking A study to benchmark whisper based ASRs in Malayalam	37	Emerging	automatic-speech-recognition	11	Jupyter Notebook
2022	audioku/cross-accent-maml-asr Meta-learning model agnostic (MAML) implementation for cross-accented ASR	37	Emerging	end-to-end-asr-frameworks	45	Python
2023	williamxhero/ttsmaker TTSMaker: A Python library for interacting with the TTSMaker API to easily...	37	Emerging	lightweight-tts-libraries	9	Python
2024	loushou/flutter_tts_improved A fork of the Flutter_TTS (https://github.com/dlutton/flutter_tts) plugin,...	37	Emerging	educational-voice-apps	10	Java
2025	skit-ai/speech-recognition SDKs and docs for Skit's speech to text service	37	Emerging	voice-ai-sdks	21	Python
2026	superU-ai/voice-agent-QA A unified benchmarking framework for evaluating Voice AI agents across...	37	Emerging	voice-agent-applications	3	Python
2027	jfainberg/lattice_combination Lattice combination algorithm to combine inaccurate transcripts with...	37	Emerging	automatic-speech-recognition	16	Jupyter Notebook
2028	phineas-pta/speech-synthesis-ngngngan python script to download & process data to train a speech-synthesis model...	37	Emerging	voice-cloning-synthesis	14	Python
2029	chameleon-ai/vevo Simple GUI for Amphion Vevo	37	Emerging	coqui-tts-applications	14	Python
2030	acyclics/speech-to-speech-translator Enables a device to input speech from a microphone, translate speech to a...	37	Emerging	speech-translation-apps	12	C++
2031	mirfan899/CTTS Cantonese TTS frontend	37	Emerging	lightweight-tts-runtimes	16	Python
2032	frrobledo/AutoDub An advanced AI-powered tool that automatically translates and dubs YouTube...	37	Emerging	video-dubbing-tools	16	Python
2033	hcoles/voices Fast, in-process text to speech for Java	37	Emerging	piper-tts-ecosystem	54	Java
2034	ferosai/feros Open-source voice agent OS. Rust runtime, AI-driven builder, sub second...	37	Emerging	—	4	Rust
2035	qiujiali/lattice_rnn Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation	37	Emerging	ctc-asr-implementations	15	Python
2036	liou666/audiread 📻 A simple and user-friendly online TTS tool. (简单易用的在线文本转语音工具)	37	Emerging	google-tts-libraries	11	TypeScript
2037	stevenhillis/awesome-asr-contextualization A curated list of awesome papers on contextualizing E2E ASR outputs	37	Emerging	end-to-end-asr-frameworks	80	—
2038	mishrababhishek/chatbot AI Chatbot answers students' queries about their college program using...	37	Emerging	voice-chatbot-applications	9	Python
2039	botbahlul/js-live-audio-video-translate HTML Web template that can RECOGNIZE any live audio/video streaming (using...	37	Emerging	live-meeting-translation	19	JavaScript
2040	ameerbadri/twilio-asr-realtime-dashboard Twilio ASR and Intent Realtime Dashboard	37	Emerging	ai-tutoring-platforms	15	JavaScript
2041	ndenicolais/SpeechAndText Android application built with Kotlin and Jetpack Compose that shows how to...	37	Emerging	android-speech-apps	16	Kotlin
2042	OpenASR/idiolect 🎙️ Handsfree Audio Development Interface	37	Emerging	android-voice-assistants	102	Kotlin
2043	SaptakBhoumik/easySpeech easySpeech is an open-source Python wrapper for google speech to text API...	37	Emerging	speech-recognition-apis	16	Python
2044	weespin/RequestifyTF2 Client side commands for mic spamming and more!	37	Emerging	dotnet-tts-libraries	16	C#
2045	clloret/speaking-practice An Android application to practice English pronunciation	37	Emerging	android-speech-apps	16	Kotlin
2046	theaifutureguy/Vocal-Agent A sophisticated real-time voice assistant that seamlessly integrates speech...	37	Emerging	conversational-chatbot-applications	25	Python
2047	Helow19274/aiogTTS Async Python library to interface with Google Translate's text-to-speech API	37	Emerging	lightweight-tts-libraries	8	Python
2048	SkyDocs/speaker-identification Speaker Identification using Neural Net.	37	Emerging	keyword-speech-recognition	20	Python
2049	haiodo/oaitt An OpenAI compatible transcriber using transformers and whisperx.	37	Emerging	whisper-speech-transcription	6	Python
2050	LibraryOfCongress/speech-to-text-viewer AWS Transcribe evaluation pipeline: bulk-process audio files and view the results	37	Emerging	real-time-voice-translation	17	Python
2051	DrAchernar/location-based-AR-app This Flutter project is an example for a location based AR app with...	37	Emerging	educational-voice-apps	78	Dart
2052	abinashmeher999/voice-data-extract A command line interface to combine text information from subtitles with...	37	Emerging	speech-recognition-apis	19	Python
2053	LuluW8071/Conformer End-to-End Speech Recognition Training with Conformer CTC using PyTorch Lightning⚡	37	Emerging	conformer-asr-implementations	13	Jupyter Notebook
2054	cmsflash/deep-learning-sota State-of-the-art results for deep learning tasks in various fields.	37	Emerging	speech-ai-coursework	15	—
2055	linto-ai/linto-diarization Speaker diarization service	37	Emerging	whisper-diarization	28	Python
2056	ORI-Muchim/One-Click-MB-iSTFT-VITS2 MB-iSTFT-VITS2(Data Preprocessing + Whisper + Text Preprocessing + Making...	37	Emerging	vits-tts-implementations	13	Python
2057	niteshsharmacodes/neutts-ultimate NeuTTS-Ultimeate - Advanced Text-to-Speech generation with unlimited...	36	Emerging	coqui-tts-applications	5	Python
2058	Mohamed-samy2/Video-Interview-Analysis PRVIA is an AI-powered system that automates the evaluation of pre-recorded...	36	Emerging	ai-interview-coaching	9	JavaScript
2059	csyan5/AttnGAN-Audio-to-image-geneation CMPT726 Machine Learning Final Project	36	Emerging	speech-ai-coursework	12	Python
2060	nate-russell/Scholar2Go Make MP3 albums out of Academic PDFs. Works by gluing together Grobid and...	36	Emerging	pdf-to-audio-conversion	12	Python
2061	arora-r/chatapp-with-voice-and-openai This project uses OpenAI's GPT-3 model to create a simple assistant that can...	36	Emerging	voice-chatgpt-interfaces	7	JavaScript
2062	javichur/fitness-voice AI voice-controlled trainer in your web browser, using NLP (wit.ai), body...	36	Emerging	health-app-development	14	JavaScript
2063	speechly/browser-client-example A demo app showcasing Speechly browser-client and detailed api responses.	36	Emerging	web-speech-api-libraries	15	TypeScript
2064	Fraunhofer-AISEC/towards-resistant-audio-adversarial-examples Generation tool for offset-resistant audio adversarial examples against Deepspeech	36	Emerging	neural-vocoder-implementations	10	Python
2065	nixonyh/UnityASR Automatic Speech Recognition in Unity.	36	Emerging	dotnet-tts-libraries	32	C#
2066	KoalaV2/K.A.I Home automation program controlled by your voice.	36	Emerging	voice-controlled-robotics	15	Python
2067	nheidloff/unity-watson-vr-sample Virtual Reality Sample using IBM Watson, Unity and Google Cardboard	36	Emerging	dotnet-tts-libraries	11	C#
2068	piotrkawa/deepfake-whisper-features Implementation of the paper "Improved DeepFake Detection Using Whisper Features"	36	Emerging	deepfake-detection-systems	112	Python
2069	mike-nott/smart-announcements Intelligent context-aware voice announcements for Home Assistant....	36	Emerging	home-assistant-tts	7	Python
2070	Vishnu-tppr/NEXORA-AI Made with Python, crafted by Vishnu 💻✨ Nexora AI – A smart Python voice...	36	Emerging	general-purpose-voice-assistants	13	Python
2071	Franck-Dernoncourt/ASR_benchmark Program to benchmark various speech recognition APIs	36	Emerging	automatic-speech-recognition	81	Python
2072	chirag127/WebSpeak-TextToSpeech-Browser-Extension High-fidelity browser extension leveraging the Web Speech API for precise,...	36	Emerging	browser-tts-extensions	2	JavaScript
2073	Hagsten/Talkify Javascript Text to speech library	36	Emerging	web-speech-api-tts	239	JavaScript
2074	arham-kk/openai-tts This repository features a Gradio interface designed to leverage the OpenAI...	36	Emerging	gradio-tts-webuis	14	Python
2075	manab-kb/Voice-Based-Translator A Voice Based Translator - Speak in English or any of the available selected...	36	Emerging	speech-translation-apps	9	Python
2076	chattylabs/conversational-flow The Conversational Flow combines both native built-in resources and cloud...	36	Emerging	voice-command-assistants	9	Java
2077	gaborvecsei/whisper-live-transcription Live-Transcription (STT) with Whisper PoC	36	Emerging	whisper-transcription-apps	201	Python
2078	thc1006/whisper-colab-tpu-transcriber High-performance Google Colab Notebook for fast & accurate audio...	36	Emerging	whisper-transcription-apps	14	Jupyter Notebook
2079	richardassar/SampleRNN_torch Torch implementation of SampleRNN: An Unconditional End-to-End Neural Audio...	36	Emerging	audio-noise-reduction	156	Lua
2080	neurlang/gospeak A Golang Text to Speech System	36	Emerging	go-tts-libraries	18	Go
2081	b4rtaz/voice-assistant Voice assistant for Visual Studio Code.	36	Emerging	voice-command-assistants	296	TypeScript
2082	yh1008/speech-to-text mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras	36	Emerging	keyword-speech-recognition	71	Jupyter Notebook
2083	resemble-ai/resemble-unity-text-to-speech Resemble's voice cloning engine within Unity	36	Emerging	dotnet-tts-libraries	184	C#
2084	jvandenaardweg/ssml-split Splits SSML strings into batches AWS Polly ánd Google's Text to Speech API...	36	Emerging	aws-polly-tts	15	TypeScript
2085	bdim404/Qwen3-TTS-WebUI 基于阿里巴巴 Qwen3-TTS 模型（17 亿参数）的全栈文本转语音 Web 应用，支持语音定制、语音设计和语音克隆，有声书生成功能。A...	36	Emerging	qwen3-tts-applications	16	Python
2086	ArchitParnami/Few-Shot-KWS Few-Shot Keyword Spotting	36	Emerging	wake-word-detection	71	Jupyter Notebook
2087	ohmstone/pocket-tts-deno WASM ONNX build of Pocket TTS with voice cloning adapted from...	36	Emerging	text-to-speech	4	TypeScript
2088	aperepel/claude-mlx-tts Voice-cloned smart attention TTS notifications for Claude Code. AI...	36	Emerging	voice-enabled-coding-assistants	7	Python
2089	azu/vscode-read-aloud-text VSCode extension that read aloud text like Markdown and text etc...	36	Emerging	ai-powered-ereaders	14	TypeScript
2090	AceCentre/TextAloud iOS app. Built in Swift. Reads out text - sentence by sentence, paragraph by...	36	Emerging	ios-speech-frameworks	11	C++
2091	alecokas/BiLatticeRNN-Confidence Confidence Estimation for Black Box Automatic Speech Recognition Systems...	36	Emerging	ctc-asr-implementations	14	Python
2092	manish-4007/YT-video-Transcription An AI tools which helps to analyze any YouTube video, give the sentiment of...	36	Emerging	video-transcription-extraction	11	Jupyter Notebook
2093	ga642381/FastSpeech2 Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to...	36	Emerging	fastspeech-tts-models	99	Python
2094	bhashini-ai/g2p Grapheme-to-phoneme (G2P) conversion for Tamil / Kannada languages - a...	36	Emerging	grapheme-to-phoneme-conversion	12	Java
2095	prateekralhan/Speech2Text-for-Long-Audio-Files Perform SOTA Speech2Text on Long Audio Files with/without diarization Using...	36	Emerging	speech-recognition-apis	14	Python
2096	vijethph/Insight A Flutter app to help blind people.	36	Emerging	educational-voice-apps	12	Dart
2097	anwar-gazi/ivrworks Build IVR, run voice campaign, with machine detection, speech recognition...	36	Emerging	ai-tutoring-platforms	14	HTML
2098	asus4/unity-speech-recognizer iOS Speech Recognizer for Unity	36	Emerging	dotnet-tts-libraries	12	C#
2099	marcominerva/TranslatorService A lightweight library that uses Cognitive Translator Service for text...	36	Emerging	dotnet-tts-libraries	12	C#
2100	kwebby/Qwen3-TTS-Voice-Studio A Text to Speech App for Qwen3-TTS Family Models to create custom voices,...	36	Emerging	qwen3-tts-applications	4	JavaScript

« Prev 1 2 3 … 19 20 21 22 23 … 80 81 82 Next »