All Voice AI Tools

8,165 tools ranked by quality score · Page 47 of 82

Showing 4601–4700 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
4601	dunkbing/text2audio Simple TTS tool made with Fresh	23	Experimental	web-speech-api-tts	17	TypeScript
4602	ZarredFelicite/parakeet-transcriber An audio transcription tool using NVIDIA Parakeet, available as a CLI or...	23	Experimental	parakeet-asr-implementations	2	Python
4603	sonhm3029/Realtime-Vietnamese-ASR-React-Native-and-Whisper This project implement end to end realtime vietnamese speech recognition...	23	Experimental	whisper-fine-tuning	4	JavaScript
4604	ShunsukeHayashi/byteplus-voice-ai BytePlus音声対話AIアプリケーション - ASR, TTS, Voice Cloning統合（WebSocket対応、日本語対応✅）	23	Experimental	google-tts-libraries	1	JavaScript
4605	neosapience/typecast-js The official Node.js SDK for the Typecast API.	23	Experimental	google-tts-libraries	6	TypeScript
4606	KarinBrisker/Video-Subtitler Automatically Generating Multilingual Subtitles Using OpenAI's Whisper and...	23	Experimental	whisper-subtitle-generation	4	Python
4607	kongju7/my_project6 Personal project 6: Speech Recognition Deep Learning Chatbot -...	23	Experimental	voice-chatbot-applications	4	Python
4608	bitgineer/Speakeasy Privacy-first local voice-to-text using Whisper AI. Cross-platform desktop...	23	Experimental	voice-dictation-typing	2	Python
4609	Caliope-SpeechProcessingLab/SpeechTester Speech Tester is a set of Python scripts conceived as an extension to HTK...	23	Experimental	speech-recognition-apis	4	—
4610	lispking/qwen3-tts-mlx A simple and easy-to-use wrapper package for Qwen3 TTS based on MLX Audio....	23	Experimental	qwen3-tts-applications	2	Python
4611	my-north-ai/semantic_audio_filtering Synthetic data augmentation technique via LLM for Automatic Speech...	23	Experimental	whisper-fine-tuning	10	Python
4612	loserbcc/openclaw-gateway Open-source WSS gateway for connecting phones to moltbots. Speaks OpenClaw...	23	Experimental	openclaw-voice-assistants	2	Python
4613	danijcom/whisper-telegram-bot Simple Telegram bot for transcribing voice messages into text (STT) in...	23	Experimental	speech-to-text-converters	2	Python
4614	blastheart1/voice-ai-braincx 🎤 Real-time voice AI conversational agent with LiveKit, FastAPI & React....	23	Experimental	voice-agent-applications	2	TypeScript
4615	shashankchandak/AutoSMSReader An android application that allows users to read all incoming messages loudly	23	Experimental	android-speech-apps	4	Java
4616	sudonitin/MediumScraper Scraping articles of medium and providing audio versions 📑 to 🔊 using django	23	Experimental	news-audio-bulletins	18	Python
4617	zzpuser/SnapDict macOS AI 翻译词典，基于 DeepSeek 提供智能翻译、词根助记、拼写纠正和语音朗读 \| AI-powered dictionary app...	23	Experimental	local-voice-dictation	2	Swift
4618	FarzadForuozanfar/Speech-Recognition I recorded 10 voices with the same words from myself and compared them with...	23	Experimental	keyword-speech-recognition	26	Jupyter Notebook
4619	smcantab/speak11 Select text, press ⌥⇧/, hear it read aloud. macOS text-to-speech powered by...	23	Experimental	system-tts-wrappers	2	Shell
4620	Usman-bin-Khalid/Jarvis-AI-Voice-and-Text-Assistant-Python- Jarvis AI Voice & Text Assistant – A Python-based desktop AI assistant with...	23	Experimental	python-voice-assistants	6	Python
4621	labrijisaad/Youtube-video-transcriptor In this notebook, I implemented a script to transcribe YouTube videos (and...	23	Experimental	video-transcription-extraction	17	Jupyter Notebook
4622	boltomli/speech-api Demo to show how to use Azure Speech Services API in app	23	Experimental	web-speech-api-libraries	2	TypeScript
4623	Mohamed-Ashik-S/Speech-to-Text This is a Speech to text project which uses openAI's Whisper model.	23	Experimental	whisper-transcription-apps	4	Jupyter Notebook
4624	language-org/voice-activ-detect-deepnet ASR: Light deep net for real-time voice activity detection	23	Experimental	ios-speech-frameworks	4	Jupyter Notebook
4625	Mohamedfat7i/local-voice-cloning-app 🔊 Clone voices easily with this lightweight Python app that synthesizes...	23	Experimental	voice-cloning-tools	2	HTML
4626	dusionlike/unplugin-string-to-audio 在打包过程中自动将字符串转换为语音文件并添加到最终的打包文件里面, 支持Vite and Webpack	23	Experimental	google-tts-libraries	1	TypeScript
4627	chrismarquezz/voice-chess An interactive chess app that lets you play and control the game entirely...	23	Experimental	sign-language-translation	2	Swift
4628	adelacvg/DPTTS An AR+AR TTS attempt.	23	Experimental	zero-shot-voice-synthesis	18	Python
4629	sandeepmukku12/vocodine 🎙️ VocoDine: Book your table with your voice! Speak your booking details,...	23	Experimental	react-speech-recognition	2	JavaScript
4630	TakumiSenaha/Nreal_IoT This project aims to visualize the sensor information of the surroundings...	23	Experimental	assistive-vision-ai	4	Python
4631	priyanshpsalian/VISION-THE-BLIND An all in one solution for safety and security of blind. Features covered in...	23	Experimental	assistive-vision-ai	4	Python
4632	Kimosabey/vox-agent-neural Neural Voice Agent core constructs for conversational AI.	23	Experimental	voice-agent-applications	2	TypeScript
4633	nsourlos/voice_cloning_tools Various tools to clone a voice	23	Experimental	voice-cloning-tools	16	Jupyter Notebook
4634	MaurerKrisztian/vrc-tts-osc Text-to-Speech & AI Bot With OSC Integration	23	Experimental	dotnet-tts-libraries	2	Python
4635	guptakushal03/Virtual-Voice-Assistant This Python script creates a voice-controlled desktop assistant capable of...	23	Experimental	voice-controlled-desktop-automation	1	Python
4636	dangvansam/nvidia-nemo-jasper-quartznet-asr-vietnamese Nhận dạng giọng nói Tiếng Việt sử dụng model Quartznet (Nvidia) + flask demo	23	Experimental	automatic-speech-recognition	2	Python
4637	axzml/VoxLinkAI_Client Native macOS voice input assistant. Hold a hotkey, speak, and let AI...	23	Experimental	local-voice-dictation	1	Swift
4638	leo01102/lumen Lumen – Asistente IA Empático y Multimodal (rostro y voz) en tiempo real....	23	Experimental	voice-agent-applications	2	TypeScript
4639	Uknowme-h/Audiollect Audiollect is a Notes to AudioBook Web App built with MERN stack , where...	23	Experimental	audio-transcription-apps	2	JavaScript
4640	vaibhav-init/AskCrow Voice Bot using Gemini Model	23	Experimental	gemini-api-applications	3	Dart
4641	vishishttiwari/Android_Application_for_understanding_ASL_using_gesture_recognition An Android Application that uses gesture recognition to understand alphabets...	23	Experimental	sign-language-recognition	14	Java
4642	mohammad-zolghadr/Pro-Todo A professional todolist that stores information in local storage and uses...	23	Experimental	vue-speech-recognition	3	JavaScript
4643	swarnayuroy/Web-Automation-using-speech-recognition Generate results on web browser i.e. automated after user speaks out the...	23	Experimental	general-purpose-voice-assistants	2	Python
4644	ArielDelRio/evernote-clone Notes App is an application to record notes and store them in the cloud in...	23	Experimental	stt	3	JavaScript
4645	LEMAS-Project/LEMAS-Project LEMAS: A 150K-Hour Large-scale Extensible Multilingual Audio Suite with...	23	Experimental	tts-model-finetuning	8	HTML
4646	egorsmkv/w2v2-bert-aligner Aligner for wav2vec2-bert models	23	Experimental	wav2vec2-asr-models	3	Python
4647	taufiq-ai/Bengali-AI-Recieptionist An AI Recieptionist Flask App with STT, TTS, FaceRecognition,...	23	Experimental	speech-recognition-apis	2	Jupyter Notebook
4648	ccj242/Audible-Deaf-Communications A non-profit app designed to make help the deaf communicate in person and...	23	Experimental	sign-language-translation	4	JavaScript
4649	kamya-ai/Talk2Text-Live "Talk2Text Live" is a cutting-edge project that harnesses the power of...	23	Experimental	whisper-transcription-apps	3	Python
4650	alwalid54321/AI-Voice-Assistant A modern, voice assistant built with React, TypeScript, and the Hugging Face...	23	Experimental	voice-assistant-applications	2	TypeScript
4651	theimpossibleastronaut/pennyworth Voice recognition based digital home assistant in progress. Quite unusable...	23	Experimental	general-purpose-voice-assistants	2	JavaScript
4652	metacore-stack/vocalcanvas-studio Craft expressive speech from text using a streamlined pipeline of voices,...	23	Experimental	text-to-speech-conversion	10	JavaScript
4653	cagataygedik/TTS Internship Text-to-Speech research project.	23	Experimental	ios-speech-frameworks	4	Swift
4654	itsanthonio/Vision-To-Speech A vision to speech project	23	Experimental	image-caption-generation	4	Jupyter Notebook
4655	sancliffe/ollama-STT-TTS A simple, hands-free Python voice assistant that runs 100% locally. This...	23	Experimental	local-voice-assistants	7	Python
4656	YIZHUANG/InstrumHack For tieto hackathon 2018 to improve Finnish people financial well-being	23	Experimental	assistive-vision-ai	3	CSS
4657	RafaelCenzano/Marvin-v3-client Marvin Version 3 client version	23	Experimental	general-purpose-voice-assistants	2	Python
4658	Mrzhangxiaoduo/react-native-speech-recognizer react-native-speech-recognizer	23	Experimental	react-native-voice-libraries	1	Objective-C
4659	Jmi2020/HowdyVox A privacy focused offline STT TTS interface for your favorite LLM	23	Experimental	local-voice-assistants	1	Python
4660	zainibaloch/Quran-App---All-in-one A fully responsive Next.js 13 Quran web app with audio recitation,...	23	Experimental	chatgpt-api-tutorials	1	TypeScript
4661	lkwbr/structured-prediction Machine learning algorithms for structured inputs and outputs, such as on...	23	Experimental	speech-ai-coursework	4	Python
4662	Adexandria/TextToSpeechAPI A REST API that converts a text image to an mp3 file. The text image can...	23	Experimental	dotnet-tts-libraries	3	C#
4663	geniusrise/audio Audio components for geniusrise framework	23	Experimental	voice-ai-learning-collections	2	Python
4664	sobrunmoksesh/Intellifacts_Android_Project An application that allows you to read facts. It includes voice interaction...	23	Experimental	android-voice-assistants	3	Java
4665	thewh1teagle/phonikud-assistant Local AI assistant in Hebrew with Phonikud ✨	23	Experimental	voice-command-assistants	2	Python
4666	daniel-szulc/Speech_Recognition 🎙 Automatic Keyword Speech Recognition for Polish and English in Tensorflow 🧠	23	Experimental	wake-word-detection	4	Python
4667	ctoth/Qlatt Explainable WebAudio Klatt formant synthesizer with declarative TTS frontend...	23	Experimental	web-speech-api-libraries	2	TypeScript
4668	jqi41/Subrank ICASSP 2020	23	Experimental	automatic-speech-recognition	3	C++
4669	falniak95/TurkishSpeechRecognition Tamamen Türkçe Konuşma Algılama Sistemi. Google Cloud Platform API desteği...	23	Experimental	dotnet-tts-libraries	1	C#
4670	spacelatte/Basic-Digital-Signage This is a android application that serves as simple digital signage...	23	Experimental	android-speech-apps	3	Java
4671	oarthurfc/AI-outgoing-call An intelligent voice agent that automatically calls leads, promoting...	23	Experimental	voice-agent-applications	1	JavaScript
4672	dantasl/parrot-ai This is a proof of concept that generates speech based on parameters...	23	Experimental	parakeet-asr-implementations	3	HTML
4673	algorithmio/accent-conversion-ai Real-time accent conversion during phone calls using Twilio, Deepgram, and...	23	Experimental	voice-agent-applications	6	JavaScript
4674	thirteenkai/bob-plugin-qwen-tts Bob TTS 插件 - 使用阿里云 Qwen3-TTS-Flash 模型进行语音合成，支持 45+ 种语音角色	23	Experimental	system-tts-wrappers	2	JavaScript
4675	peterxubuaa/Voice-Assistant Voice Assistant	23	Experimental	android-voice-assistants	3	Java
4676	dwain-barnes/DeepSeek-Thinking-TTS Listen to DeepSeek's thinking process in real-time! This script converts...	23	Experimental	deepseek-deployment-tools	3	Python
4677	tubexchat/interpreter-zh2en-gemini An interpreter web app between Chinese and English that is powered by Gemini-2.0-fash	23	Experimental	content-to-podcast-converters	6	TypeScript
4678	ahmedoubadi/kokoro-tts Open-source Kokoro-TTS API server (FastAPI) and web UI (React) for...	23	Experimental	kokoro-tts-ecosystem	6	TypeScript
4679	RGonza1529/Nura A Full-Stack React/Node.js AI-powered web application that provides...	23	Experimental	audio-transcription-apps	2	JavaScript
4680	apluka34/audio-crawler A tool for crawling and creating audio dataset	23	Experimental	speech-corpora-datasets	3	Python
4681	sagar-alias-jacky/F.R.I.D.A.Y A basic but fun virtual assistant made using Python	23	Experimental	python-voice-assistants	3	Python
4682	Yuanshi9815/LiteFocus [Interspeech 2024] LiteFocus is a tool designed to accelerate...	23	Experimental	diffusion-model-frameworks	34	Python
4683	Amiannn/Simple-HmmGmm Simple HMM implementation	23	Experimental	keyword-speech-recognition	3	Python
4684	nathanyaqueby/roche-dementia-hackathon AI and AR-based digital memory lane and cognitive stimulation for dementia patients	23	Experimental	assistive-vision-ai	3	Python
4685	OldBonhart/TensorFlow_Speech_Recognition_Challenge TensorFlow Speech Recognition Challenge -...	23	Experimental	keyword-speech-recognition	3	Jupyter Notebook
4686	tez3998/audio-output-to-text VOSKを使ったスピーカーやヘッドフォンから出力される音声のオフライン文字起こし	23	Experimental	vosk-asr-implementations	3	Python
4687	ilya16/speech-synthesis-course An introduction course on Speech Synthesis and Voice Cloning (Skoltech ISP'25)	23	Experimental	speech-ai-coursework	6	—
4688	bobbymay/Dictation-for-macOS Speech Recognition for macOS that allows you to define words, phrases, or...	23	Experimental	local-voice-dictation	4	Swift
4689	Zhang-Nian/Intelligent_CustomerService Speech Recognition 、Speech Synthesis 、Intelligent Dialogue	23	Experimental	voice-chatbot-applications	3	Python
4690	saharmor/EchoScribe Local AI transcription workspace with cloud APIs (OpenAI Whisper) or local...	23	Experimental	audio-transcription-tools	1	TypeScript
4691	derpeloper/ostinato giving a voice to the voiceless.	23	Experimental	discord-tts-bots	2	JavaScript
4692	LiBinZyu/VAI Implement highly precise natural language voice control in any Unity...	23	Experimental	conversational-rag-agents	23	C#
4693	jitendrakw09/Voice-Sangam Voice Sangam is a modern text-to-speech platform built with Next.js 16,...	23	Experimental	voice-ai-agents	1	—
4694	YoussefBechara/Enhanced-Custom-ChatBot Custom Built AI Chatbot using Huggingface's ai, enhanced with features such...	23	Experimental	voice-chatbot-applications	2	Python
4695	ashwin2k/LibraAI Libra.AI is a women's safety-focused voice-activated A.I. assistant android...	23	Experimental	multimodal-medical-assistants	3	Java
4696	Kunal-Kumar-Sahoo/iCompanion-AssistanceMadeSimple This is a Python3 based virtual assistant developed for Computer Science...	23	Experimental	general-purpose-voice-assistants	3	Python
4697	imsanjoykb/Speech-NLP-Bootcamp Speech NLP Bootcamp	23	Experimental	speech-ai-coursework	3	Jupyter Notebook
4698	Aslm-Fawzy/Speech-Recognition-Using-Raspberry-Pi Simple Speech Recognition Program Run on Raspberry Pi	23	Experimental	automatic-speech-recognition	3	Jupyter Notebook
4699	NEURASCOPE/neurascreen Automate product tour videos with JSON scenarios. Real browser recording, AI...	23	Experimental	ai-video-generation	2	Python
4700	ErolOZKAN-/TurkishSpeechRecognition Turkish Speech Recognition Project / Türkçe Konuşma Tanıma Projesi	23	Experimental	web-speech-api-libraries	3	HTML

« Prev 1 2 3 … 45 46 47 48 49 … 80 81 82 Next »