All Voice AI Tools
8,165 tools ranked by quality score · Page 50 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 4901 |
isayahc/Semi-Automated-Youtube-Channel
Semi automated youtube channel that has a lot of cool features for someone... |
|
Experimental |
| 4902 |
rafalposwiata/text-normalization
Repository for text normalization research. |
|
Experimental |
| 4903 |
SPACESODA/read-txt
Read TXT is a lightweight text-to-speech reader with auto language detection... |
|
Experimental |
| 4904 |
SAMKhadka/ace-step-ui
🎵 Generate AI music effortlessly with ACE-Step UI, the open source... |
|
Experimental |
| 4905 |
allisonandreyev/WhisperQuantization
WhisperCPP (FP32) INT8, INT4, INT5, quantization effect on model latency and... |
|
Experimental |
| 4906 |
CaesiumY/dding-dong
Claude Code notification plugin — Sound alerts & OS notifications on task... |
|
Experimental |
| 4907 |
vpakarinen/kokorotts-webui
WebUI for Kokoro text-to-speech. |
|
Experimental |
| 4908 |
Mayank17M/vocalize
A speech recognition app that helps you keep track of your mental health and... |
|
Experimental |
| 4909 |
m-cheicki/VoiceOver_front
🎙️🎤 VoiceOver is a web application that allows you to transcribe English... |
|
Experimental |
| 4910 |
Bushramjad/XLSR-Wav2Vec2-Speech-Recognition-Urdu
Speech Recognition in Urdu language by fine-tuning the pretrained... |
|
Experimental |
| 4911 |
Gyvastis/google-speech-tts
A wrapper for Google Translate to generate an audio from a text. |
|
Experimental |
| 4912 |
fpaupier/tts-distil-whisper
Distil whisper on web |
|
Experimental |
| 4913 |
toavina2018/task-pilot
📋 Manage projects efficiently with TaskPilot, a full-stack application... |
|
Experimental |
| 4914 |
nilkanthshirodkar/Speech-Recognition-Using-HMM
Automatic Speech Recognition (ASR) system was implemented using the HMM... |
|
Experimental |
| 4915 |
ladykot/Butler
Прототип виртуального дворецкого на базе Yandex SpeechKit |
|
Experimental |
| 4916 |
skykongkong8/AI_device_with_RaspberryPi
Python/GPIO code for Tangible Artificial Intelligence device with RaspberryPi |
|
Experimental |
| 4917 |
NickSwardh/StreamSpeechToText
Stream Mp3 & Opus to Azure's Speech to Text without GStreamer |
|
Experimental |
| 4918 |
FelixWaweru/Copresenter
A virtual co-host that makes presentations a breeze by using AI to read out... |
|
Experimental |
| 4919 |
sergix44/oddcast-tts-php
A PHP interface to the online Oddcast demo API. |
|
Experimental |
| 4920 |
hahaanisha/digipal
Bridging the digital divide with interactive learning, voice guidance, and... |
|
Experimental |
| 4921 |
thewh1teagle/zipvoice-onnx
TTS with ZipVoice and onnxruntime |
|
Experimental |
| 4922 |
Shuichi346/qwen-voice-clone-webui
A Gradio WebUI for voice cloning powered by Qwen3-TTS. Provide reference... |
|
Experimental |
| 4923 |
f76tbntbww-crypto/VoiceForge
One-click local AI voice assistant powered by ASR+LLM+TTS, 100% coded by... |
|
Experimental |
| 4924 |
waltervanheuven/speech2text
Speech2Text |
|
Experimental |
| 4925 |
elemarmar/joke-teller
🤖💬 Joke Teller gets random jokes from third party API and converts them to... |
|
Experimental |
| 4926 |
NJUxlj/hotel-voice-agent-manual
一个RAG语音对话助手,用于上海的旅游信息查询。用户语音输入用ASR转文本,再用智谱api搜知识库+RAG生成回复,最后用TTS转语音输出。 |
|
Experimental |
| 4927 |
x07x08/waveboard
A simple cross-platform soundboard |
|
Experimental |
| 4928 |
milosgajdos/playht_rs
PlayHT TTS Rust crate |
|
Experimental |
| 4929 |
tristan-mcinnis/Realtime-Whisper-Console-Transcriber
A real-time speech-to-text transcriber using the Whisper model, designed for... |
|
Experimental |
| 4930 |
edgarmedrano/javier-js-code
JAvascript Voicexml InterpretER. This is the JavaScript implementation, if... |
|
Experimental |
| 4931 |
burritosoftware/mira
A modular text-to-speech Discord bot for Bay Area public transit systems. |
|
Experimental |
| 4932 |
fruxc/Voice-Assistant-Based-News-App
Artificial-Intelligence based news application - A web application which... |
|
Experimental |
| 4933 |
ab-smith/kokoro-tts-webui
Gradio-based web ui for Kokoro to simplify its usage with multiple voices,... |
|
Experimental |
| 4934 |
Jayden-X-L/lobster-radio-skill
个性化qwen3本地模型驱动的资讯电台生成服务 - OpenClaw Skill |
|
Experimental |
| 4935 |
ali-ibnouf/SmartTalker
Digital Human AI Agent Platform — Real-time talking avatar with Arabic-first support |
|
Experimental |
| 4936 |
JhonatanAiT14/dictate.sh
🎤 Transcribe speech with low-latency on Apple Silicon using dictate.sh;... |
|
Experimental |
| 4937 |
dragonchen0131/Ai_Lee_translator
An ancient/modern chinese translator with a unique voice |
|
Experimental |
| 4938 |
nerdpudding/nerdpudding
The proof is in the pudding. Real-time AI video commentary with... |
|
Experimental |
| 4939 |
famda/semantics
Semantics CLI - Unified interface for media intelligence |
|
Experimental |
| 4940 |
faizalichsan1337/ai-podcast-clipper-saas
🎥 Create engaging short clips from podcasts using AI to boost visibility on... |
|
Experimental |
| 4941 |
HQQHQ/FinetuneSpeechT5-Spanish
This repository hosts the code and resources for fine-tuning a SpeechT5... |
|
Experimental |
| 4942 |
speak-rs/speakly
High-performance, extensible speech recognition toolkit for Rust — OpenAI... |
|
Experimental |
| 4943 |
neshani/Kitten-Offline-TTS
Kitten Offline Mobile TTS Webapp |
|
Experimental |
| 4944 |
venusdev85/Speech-Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow |
|
Experimental |
| 4945 |
Tanaka-zi/VoiceR
VoiceR is a Linux voice control app that lets you control games using speech... |
|
Experimental |
| 4946 |
type-a/speechnet
Automatic Speech Recognition |
|
Experimental |
| 4947 |
sajjadabbasi1383/Voice-Translation
Online translation of text and voice and scanning of images |
|
Experimental |
| 4948 |
gheyret/thuyg20_scripts
Script files of THUYG-20(A free Uyghur speech database Released by... |
|
Experimental |
| 4949 |
ashisbehera/Smart_Alarm
This project is based on text to speech alarm application. |
|
Experimental |
| 4950 |
sayak119/Express
Express Yourself. |
|
Experimental |
| 4951 |
manasmodak/SpeechRecognition
WPF App to show text-speech and speech recognition |
|
Experimental |
| 4952 |
Bugsbunnydev2000/Analysis-of-body-language-and-speech-in-video
Analysis of body language and speech in video with LLMs |
|
Experimental |
| 4953 |
boboyiyi/multi_speaker_tacotron
A TensorFlow implementation of multi speaker Tacotron speech synthesis |
|
Experimental |
| 4954 |
nullbyte91/amazon-polly-TTS
A Simple Text To Speech application using Amazon Polly - Excel to MP3 |
|
Experimental |
| 4955 |
rupac4530-creator/ai-desktop-assistant
Voice-controlled AI desktop assistant | 100% local & private | Whisper +... |
|
Experimental |
| 4956 |
loganngarcia/chaplin-ui
Web interface for a real-time silent speech recognition tool. |
|
Experimental |
| 4957 |
RiccardoGrin/TerminalWhisper
Voice-to-text for Windows using OpenAI Whisper. Hold a hotkey, speak, text appears. |
|
Experimental |
| 4958 |
natelindev/voice-agent
Low-latency real-time terminal voice assistant with VAD, ASR, LLM, and TTS |
|
Experimental |
| 4959 |
artryazanov/gemini-speech-to-speech-translator
Transform your audio content into any language with high accuracy and... |
|
Experimental |
| 4960 |
LINSUISHENG034/Qwen3-ASR-Desktop
Modern PyQt6 desktop GUI for Qwen3-ASR with batch transcription support |
|
Experimental |
| 4961 |
jerrykuo7727/ASR-common-voice-zh-tw
HMM-based ASR systems trained on CommonVoice(zh-TW) using Kaldi. |
|
Experimental |
| 4962 |
Huzaifa-code/SpeakFlow
SpeakFlow: A React-based web app for real-time speech transcription and... |
|
Experimental |
| 4963 |
ZeiraxGaming/captainslog-whisper
Convert your voice to text locally using Whisper without sending data to the... |
|
Experimental |
| 4964 |
VirtualZer0/StreamTalkerServer
AI text-to-speech server powered by Qwen3-TTS with voice cloning, batch... |
|
Experimental |
| 4965 |
A5hG0/Lyrics-To-Song-Generator
Step-by-step toolkit for DiffSinger voice synthesis. Preprocessing scripts +... |
|
Experimental |
| 4966 |
svn05/vietnamese-whisper-asr
Fine-tuned Whisper for Vietnamese ASR with Librosa preprocessing and Gradio demo. |
|
Experimental |
| 4967 |
siva-sub/pocket-tts-openapi-gpu
GPU-enhanced Pocket TTS with Remotion + TikTok captions |
|
Experimental |
| 4968 |
opensource-spraakherkenning-nl/ASR_NL_results
Results of Dutch ASR models, collected by the community |
|
Experimental |
| 4969 |
Jobijoba2000/add_dub
Automated video voice-over tool for Windows. Converts subtitles to speech... |
|
Experimental |
| 4970 |
burrmill/burrmill
BurrMill core |
|
Experimental |
| 4971 |
hritools/speech-to-text
A speech recognition library with a primary use for Russian language |
|
Experimental |
| 4972 |
GirlsInICT2023-Winner/smart-outdoor-activity-alerts
[Ericsson-LG] Girls in ICT 2023 Hackathon |
|
Experimental |
| 4973 |
alx741/kaldi_spanish_dimex100
Kaldi ASR Spanish example using the DIMEx100 corpus |
|
Experimental |
| 4974 |
sanbabyfrancis/sruthi
A malayalam voice assistant built using python |
|
Experimental |
| 4975 |
matin91/Kasko
Kasko is a Talking To-do List app, which allows the user to set up Reminders... |
|
Experimental |
| 4976 |
rahelmartim/IBM-STT-TTS
Project exploring IBM-watson speech-to-text and text-to-speech services in python. |
|
Experimental |
| 4977 |
BrotatoBoiV2/Live-Translate
Local, real-time AI translator for language immersion. Filters English,... |
|
Experimental |
| 4978 |
Synapsr/Selaou
Validez et corrigez vos transcriptions audio pour créer des datasets... |
|
Experimental |
| 4979 |
europanite/client_side_audio_transcription
A Browser-Based AI Audio Transcription Playground Powered by Whisper. |
|
Experimental |
| 4980 |
YuriyGuts/gdg-speech-classifier
A machine learning system that recognizes the word 'Google' in human speech... |
|
Experimental |
| 4981 |
Salut1231/wyoming-voice-match
🗣 Verify speaker identity and clean voice audio for accurate speech-to-text... |
|
Experimental |
| 4982 |
LyounJAP/TTSRadioLib
基于百度合成语音的语音合成工具类 |
|
Experimental |
| 4983 |
xanderstevenson/community-content-pipeline
A Source of Truth for the Cisco Community Engagement, with creation and... |
|
Experimental |
| 4984 |
idsudd/tricahue
🦜 Tricahue: modelo de transcripción de voz especializado en español chileno |
|
Experimental |
| 4985 |
speechly/react-ui
A collection of React components for Speechly-powered applications |
|
Experimental |
| 4986 |
monish6666/avro-phonetic-go
📜 Convert Banglish to Bangla script seamlessly with this Go library,... |
|
Experimental |
| 4987 |
diogosapessoa/speech-to-text
Speech recognizer using xamarin monoandroid |
|
Experimental |
| 4988 |
ZacDair/SER_Platform_AICS
This repository contains the code to create and conduct emotion recognition... |
|
Experimental |
| 4989 |
mk-knight23/37-tool-text-to-speech
Production-grade Text-to-Speech utility built with Vue 3 and Web Speech API.... |
|
Experimental |
| 4990 |
huss2342/x_news_station
turn x/twitter feed into audio |
|
Experimental |
| 4991 |
vinsis/speech-commands-recognition
Single word speech recognition using PyTorch |
|
Experimental |
| 4992 |
shr1324/orpheus-tts-docker
🔊 Deploy Orpheus TTS with ease using Docker, featuring GPU management,... |
|
Experimental |
| 4993 |
sglkc/live-translate
🎙️ Translate as you speak using Google Chrome's Web Speech API for speech... |
|
Experimental |
| 4994 |
Pierillo/hallucination-check
Pipeline automatizado que cura, redacta y envía un newsletter diario de IA... |
|
Experimental |
| 4995 |
bacharyehya/outloud
Beautiful TUI for text-to-speech. Gemini, OpenAI, or local. One command. |
|
Experimental |
| 4996 |
kilogramme/nerdpudding
Provide live AI video commentary with text-to-speech for any video source,... |
|
Experimental |
| 4997 |
davidsuragan/tulga-cli
TulgaCLI is a tool that allows you to chat and voice chat with virtual... |
|
Experimental |
| 4998 |
gkcomputers040/santa-claus-is-calling
🎅 Create magical moments with real AI phone calls from Santa, delivering... |
|
Experimental |
| 4999 |
ihsacm/ComfyUI-KittenTTS
Integrate KittenTTS into ComfyUI to enable fast, lightweight text-to-speech... |
|
Experimental |
| 5000 |
kayrugold/andyai
A self-evolving, tri-brain autonomous AI agent featuring local subconscious... |
|
Experimental |