All Voice AI Tools

8,165 tools ranked by quality score · Page 48 of 82

Showing 4701–4800 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
4701	thewh1teagle/whisper.zig Transcribe audio with whisper in zig	23	Experimental	speech-to-text-converters	2	Zig
4702	halisuyanik/speech-recognition-note-app-vue.js-regex Note application that converts voice command to text and performs voice...	23	Experimental	stt	3	Vue
4703	MichaelFeng87/CGN_speech_recognition Speech recognition using DNNs, script to create features, use kaldi for...	23	Experimental	ctc-asr-implementations	3	Python
4704	keymastervn/htksupport Minimal HTK for supporting HTK in Vietnamese.	23	Experimental	kaldi-asr-ecosystem	4	Ruby
4705	sherry-exec/urdu-tts-lib Microsoft Speech SDK 11 - C# .Net 4 - Urdu Text-to-Speech System	23	Experimental	dotnet-tts-libraries	3	C#
4706	JmKanmo/VoiceRecognitionMemoApp Speech recognition and memo application	23	Experimental	android-speech-apps	2	Java
4707	brihijoshi/iterative-feature-normalisation-ICASSP-2011 This repository contains a Python implementation of the paper "Iterative...	23	Experimental	speech-emotion-recognition	3	Jupyter Notebook
4708	balavenkatesh3322/speech_to_text It will convert our voice into text using Google speech API	23	Experimental	speech-recognition-apis	3	Python
4709	mict-zhaw/chall_e2e_stt End-to-end ASR experiments for language learning, focusing on...	23	Experimental	end-to-end-asr-frameworks	4	Python
4710	ACinesi/nao-strips-planner Ai project work about NAO robot strips planner.	23	Experimental	voice-chatbot-applications	3	Python
4711	auralshin/python python tryout projects	23	Experimental	voice-ai-learning-collections	3	HTML
4712	Shaashwat05/Smart_clock A smart clock which understands voice command and performs tasks accordingy	23	Experimental	general-purpose-voice-assistants	3	Python
4713	rahulkarda/Speech-Recognition A Speech Recognition web app that converts speech to text in real time.	23	Experimental	web-speech-api-libraries	3	CSS
4714	arch-ith/voice_to_signLanguage Voice to Sign Language Conversion	23	Experimental	sign-language-recognition	34	Python
4715	UltraInstinct0x/vlc-auto-dub AI-powered automatic video dubbing and transcription extension for VLC....	23	Experimental	video-dubbing-tools	2	Python
4716	restacksyj/speech-emotion-detection Final Year Project on Speech Emotion Recognition with CNN and LSTM.	23	Experimental	speech-emotion-recognition	3	HTML
4717	alexiusstrauss/AudioTopic Aplicação que processa arquivos de áudio (.mp3 ou .wav), convertendo-os em...	23	Experimental	ai-video-generation	3	Vue
4718	Rajesh42/VoiceAssistant Build your own AI personal assistant using Python (Alexa and Jarvis both are...	23	Experimental	python-voice-assistants	3	Python
4719	divyanshuio/GPT_App its a smart assistant that can answer any question	23	Experimental	voice-chatgpt-interfaces	3	Python
4720	tellang/sonote AI 에이전트를 위한 소리 노트 — 실시간 한국어 음성 전사 CLI	23	Experimental	voice-ai-agents	1	Python
4721	zsl24/Speech-Processing-Doc 一个关于语音算法技术汇总的文档	23	Experimental	speaker-diarization-embedding	4	—
4722	chirag127/ComicSpeak-AI-Web-Comic-Dubber-Browser-Extension Transforms web comics into audio with AI-powered OCR and TTS	23	Experimental	browser-tts-extensions	1	JavaScript
4723	the-bird-F/Expressive-Vectors [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal...	23	Experimental	zero-shot-voice-synthesis	38	Python
4724	dvamsidhar2002/Project-VIVA-Personal-Desktop-and-Voice-Assistant This is a personal desktop assistant which will do few tasks for you. It is...	23	Experimental	voice-controlled-desktop-automation	3	Python
4725	Heatwave114/wazobia-open-speech-mobile This is an open-source mobile application that augments the wazobia...	23	Experimental	android-speech-apps	2	Dart
4726	yuhanwang14/ASR-Pipeline Local GPU-accelerated speech transcription pipeline with speaker diarization...	23	Experimental	funasr-speech-recognition	2	Python
4727	chandong83/NaverTTS_with_CSharp NaverTTS with C#	23	Experimental	dotnet-tts-libraries	3	C#
4728	Vilhaem/Teams-Notification-Bot Notification Bot that calls user via teams or phone number and plays a...	23	Experimental	dotnet-tts-libraries	3	C#
4729	NikhilKalloli/Voice-Recognition A Streamlit web application for Voice recognition using a pre-trained speech...	23	Experimental	voice-cloning-synthesis	2	PureBasic
4730	DuyguA/TSD2025-Mind-the-Gap Innovative ASR model to keep named entities intact, offered as a conference paper.	23	Experimental	end-to-end-asr-frameworks	1	Python
4731	lliWcWill/maVoice-Linux 🎙️ Lightning-fast voice dictation Desktop Web App powered by Groq's Whisper...	23	Experimental	voice-dictation-typing	3	Rust
4732	yulinliu101/ASR_ATC speech recognition system to transcribe ATC voice data	23	Experimental	automatic-speech-recognition	4	Python
4733	sse-digital-man/TTS-Core 数字人项目-TTS部分	23	Experimental	lightweight-tts-libraries	4	Python
4734	standing-o/Combined_Dataset_for_Speech_Emotion_Recognition A collection of dataset consists of a total of 8 English speech datasets for SER	23	Experimental	speech-emotion-recognition	30	Jupyter Notebook
4735	jaju/voissistant Voiss Aceistant - Apple only, with mlx.	23	Experimental	local-voice-dictation	2	Python
4736	daniel-keogh/wwtbam A voice-controlled spin on "Who Wants to Be a Millionaire?", made with Unity	23	Experimental	dotnet-tts-libraries	3	C#
4737	mbailey/push2type Turn CAPSLOCK key into Dictation Key	23	Experimental	voice-dictation-typing	19	Shell
4738	code-spirit-369/text-to-speech-yt This AI TTS web application allows you to convert any text into realistic,...	23	Experimental	elevenlabs-integrations	4	TypeScript
4739	andreluizsecco/IoTVoiceControl Demonstração do acionamento de dispositivos IoT através de comandos de voz,...	23	Experimental	dotnet-tts-libraries	2	C#
4740	nihal-5/ditch-speechify Free Speechify alternative - Stop paying $139/year. Listen to PDFs,...	23	Experimental	openai-tts-applications	7	JavaScript
4741	KubiakJakub01/Valle2 Implementation of TTS and ASR model based on VALL-E X architecture	23	Experimental	tacotron-tts-models	4	Python
4742	WhaddaMakers/RPi-colour-checker-tutorial A Raspberry Pi is useful in all kinds of ways, even if you are looking to...	23	Experimental	assistive-vision-ai	3	Python
4743	suryanktiwari/Artlet Concept of a multi-content sharing and reading social platform. In app...	23	Experimental	android-speech-apps	3	Java
4744	Kadir-Atmaca/Asistan-STT-Vosk Bu depo stt yani speech to text Türkçesiyle sesi yazıya çevirme Türkçe şekilde	23	Experimental	dotnet-tts-libraries	2	C#
4745	lgpearson1771/openwakeword-trainer Train custom wake word models with openWakeWord. A granular 13-step pipeline...	23	Experimental	wake-word-detection	2	Python
4746	hash2004/conformer-fine-tuned-urdu This repository includes all the essential scripts and notebooks required...	23	Experimental	conformer-asr-implementations	3	Jupyter Notebook
4747	GioPicci/videowise VideoWise is a video transcription and AI-powered analysis tool that helps...	23	Experimental	audio-transcription-tools	4	HTML
4748	suzumushi0/SoundObject_source SoundObject source code distribution.	23	Experimental	audio-source-separation	9	C
4749	xaeksx/ComfyUI-AudioSR 🎶 Enhance audio quality with ComfyUI-AudioSR, a versatile tool for upscaling...	23	Experimental	comfyui-extensions	2	Python
4750	Monal5031/TextToSpeech-Converter A Simple Text To Speech Converter in java	23	Experimental	android-speech-apps	3	Java
4751	ankuragrwl/google-tts Application to try out Google Text to Speech API	23	Experimental	google-tts-libraries	2	TypeScript
4752	yujiliu/oresta Oresta - is the first voice assistant in the Ukrainian language.	23	Experimental	general-purpose-voice-assistants	4	Python
4753	zvz23/vProfanity A software solution that automates the detection and censorship of profanity...	23	Experimental	meeting-transcription-summarizers	2	C#
4754	anujsahani01/Classification-Project Intent and Entity Extraction and Classification from audio files	23	Experimental	speech-ai-coursework	3	Jupyter Notebook
4755	chiragjoshi12/pdf-to-podcast Convert any PDF into a podcast episode using Gemini and Elevenlabs!	23	Experimental	content-to-podcast-converters	2	Python
4756	nyumaya/libnyumaya_esp32 Experimental support for nyumaya audio recognition on ESP32	23	Experimental	wake-word-detection	4	C++
4757	InboraStudio/Google-Cloud-Speech-Recognition-Unity Unity Speech Recognition with Google Cloud A cross-platform speech...	23	Experimental	dotnet-tts-libraries	2	C#
4758	pulkitsxn059/Jarvis-PC-Assistant- Implemented a Desktop PC Assistant Application in Java. The Application can...	23	Experimental	python-voice-assistants	3	Java
4759	ldl805/QuickSpeechPi Very, very lightweight and simple text to speech (TTS) program that outputs...	23	Experimental	lightweight-tts-libraries	2	Python
4760	bonniepeng2002/Apollo Apollo: your intuitive, virtual nurse.	23	Experimental	android-voice-assistants	1	Java
4761	brayden-s-haws/speak_easy_text_to_speech A straightforward way to convert text to speech.	23	Experimental	web-based-tts-apps	1	Python
4762	alvarosg88/Talk-to-the-Bot A WebGL demo that combines virtual reality, speech recognition and synthetic...	23	Experimental	ai-tutoring-platforms	3	JavaScript
4763	yepicaiaaron/awesome-audio-generation-2026 🎙️ Curated collection of open-source audio generation models released in...	23	Experimental	voice-ai-learning-collections	2	—
4764	python019/subui-speech-assistant Python AI project	23	Experimental	general-purpose-voice-assistants	36	Python
4765	Kaljurand/K6nele-service Kõnele service is an Android app that offers a speech-to-text service to...	23	Experimental	android-speech-apps	39	Java
4766	Simone-Convertini/Speech-Summarization-Demo A Web Api written using Go and Gin capable to perform Speech Summarization...	23	Experimental	go-tts-libraries	4	Go
4767	nicolas-dufour/self-supervised-low-res-speech This project transfert the self supervised Wav2vec2 representation to low...	23	Experimental	wav2vec2-asr-models	3	Jupyter Notebook
4768	supevil/SoulX-Singer-Eval 🎤 Evaluate zero-shot Singing Voice Synthesis systems for quality, accuracy,...	22	Experimental	tts-model-finetuning	—	Python
4769	atmehedi/Speech-to-text-in-Assamese TASK ORIENTED DIALOG SYSTEM IN NATIVE LANGUAGE(ASSAMESE)	22	Experimental	automatic-speech-recognition	2	Jupyter Notebook
4770	gaelic-ghost/speak-to-user Local FastMCP text-to-speech server for shared macOS playback, voice...	22	Experimental	voice-enabled-coding-assistants	—	Python
4771	leszini/spoken-mcp Voice interface for Claude Desktop — hands-free conversations using...	22	Experimental	voice-enabled-coding-assistants	—	Python
4772	k1rk11/CriTTS A modern, free Text-to-Speech (TTS) application using Microsoft Edge's TTS engine	22	Experimental	edge-tts-implementations	—	Python
4773	smivv/python-vosk-trial Vosk Speech Recognition Trial	22	Experimental	vosk-asr-implementations	2	Python
4774	donapart/klatsch Klatsch 🐾 — OpenClaw Local Agent: always-on voice assistant, peer...	22	Experimental	openclaw-voice-assistants	—	Python
4775	seanox/seanox-ai-podcast Automated podcast generation pipeline using a YAML-defined structure and...	22	Experimental	content-to-podcast-converters	—	Python
4776	hwpoison/vosk-voice-recognition-c Offline voice recognition using pure C and vosk lib. (from file and from...	22	Experimental	vosk-asr-implementations	6	C
4777	Chrisisaac948/RealWonder Generate real-time videos conditioned on physical actions from a single...	22	Experimental	ai-video-generation	—	Python
4778	ouracademy/speech-to-text A project that show input text with speech recognition trought angular directive	22	Experimental	web-speech-api-libraries	1	TypeScript
4779	ArMohadWaseem90/text2epub 📚 Convert TXT files to EPUB quickly with this Python script, ensuring smooth...	22	Experimental	ebook-to-audiobook-conversion	—	Python
4780	abcname61/audiobook-creator 🎧 Convert MP3 files into professional-quality audiobooks in M4B format with...	22	Experimental	ebook-to-audiobook-conversion	—	JavaScript
4781	edwindoremi/Asterisk 🎮 Streamline esports tournaments with Asterisk, a real-time management...	22	Experimental	ai-avatar-platforms	—	HTML
4782	jibon57/nativescript-azure-cognitiveservices Azure cognitive services implementation for NativeScript.	22	Experimental	dotnet-tts-libraries	1	TypeScript
4783	0x61space/pu-cit371-helicopter-commander Control a helicopter in Grand Theft Auto: San Andreas using speech recognition	22	Experimental	dotnet-tts-libraries	1	C++
4784	ivsergeev/voicer Голосовой ввод, GigaAM v3 e2e, opencode-plugin, русский язык	22	Experimental	dotnet-tts-libraries	—	C#
4785	Noor-khalid/Selena 🚀 Accelerate your .NET applications with Selena, a zero-dependency library...	22	Experimental	dotnet-tts-libraries	—	C#
4786	orbxball/timit-preprocessor Extract mfcc vectors and phones from TIMIT dataset	22	Experimental	automatic-speech-recognition	16	Shell
4787	Mliviu79/cartesia-go Go SDK for the Cartesia AI API — TTS, STT, voice cloning, agents, WebSocket streaming	22	Experimental	go-tts-libraries	—	Go
4788	yauhenipakala/Yandex.SpeechKit.Xamarin Yandex SpeechKit Mobile SDK for Xamarin	22	Experimental	yandex-speechkit-tools	1	C#
4789	Artavazd2009/yandex-speechkit-php Provide easy PHP access to Yandex SpeechKit API for audio transcription,...	22	Experimental	yandex-speechkit-tools	—	PHP
4790	MarceloSalazarV/Multimodal_Med_Ai_with_Deployment 🩺 Enhance patient care with MediBot 2.0, an AI doctor assistant that...	22	Experimental	multimodal-medical-assistants	—	Python
4791	Ashish-Patnaik/Sonya-TTS High-fidelity AI speech with emotion, rhythm, and audiobook mode	22	Experimental	lightweight-tts-libraries	4	Python
4792	A-AhkUser/Dictation-Interface dictation interface using UI automation via a chrome extension	22	Experimental	web-speech-api-libraries	6	AutoHotkey
4793	priyanshu-baran/Voice_Assistant_Using_Java Tried to make JARVIS (Voice Assistant) using Java	22	Experimental	android-voice-assistants	2	Java
4794	denz-pro/CoAI-PCB CoAI-PCB offers an AI-driven PCB inspection module that detects defects with...	22	Experimental	assistive-vision-ai	—	—
4795	gustavhartz/voxtir Collaborative transcription service that keeps getting better	22	Experimental	whisper-speech-transcription	23	TypeScript
4796	rshivam08/Deaf-Assistant An Android application for assisting deaf people	22	Experimental	sign-language-recognition	6	Java
4797	aitoraznar/ionic2-speech-recognition ionic2 JS Speech Recognition	22	Experimental	web-speech-api-libraries	1	TypeScript
4798	zry98/pomumd Wyoming Protocol TTS and STT & MLX LLM server for iOS/macOS	22	Experimental	lightweight-tts-runtimes	1	Swift
4799	duongdz-create/Voicebot-Reservation-system-for-Hotels 🛏️ Explore and book hotels effortlessly with our AI-driven voicebot,...	22	Experimental	voice-chatbot-applications	—	Python
4800	lancetodjk14/react-native-sherpa-onnx-stt 🎤 Enable offline speech recognition in React Native using sherpa-onnx,...	22	Experimental	react-native-voice-libraries	—	C

« Prev 1 2 3 … 46 47 48 49 50 … 80 81 82 Next »