All Voice AI Tools
8,165 tools ranked by quality score · Page 60 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 5901 |
Masihtabaei/reswhis
A lightweight, WebSocket-based server for real-time, remote audio... |
|
Experimental |
| 5902 |
fatehmtd/gladiapp
C++ Client Library for Gliadia API |
|
Experimental |
| 5903 |
dudarev/speechdown
CLI tool to transcribe your spoken audio notes into timestamped,... |
|
Experimental |
| 5904 |
moziarnj07-sys/doubaoime-asr
🎤 Enable voice recognition for the Doubao input method using Python; ideal... |
|
Experimental |
| 5905 |
Joyeah/videomaker
批量图片生成视频 |
|
Experimental |
| 5906 |
HoangLayor/LiveTranslator
LiveTranslator is a real-time speech translation system that captures spoken... |
|
Experimental |
| 5907 |
Maksim-Goncharovskiy/video-dubbing
Dubbing english videos into russian. |
|
Experimental |
| 5908 |
patelritiq/CodeClause-Internship-Projects
A comprehensive collection of 4 Python applications developed during a... |
|
Experimental |
| 5909 |
001kenji/Text_To_Speech_AI
A modern web application that converts text to speech using advanced TTS... |
|
Experimental |
| 5910 |
porcelluscavia/audio-model
My Master's thesis project in audio classification using PyTorch and... |
|
Experimental |
| 5911 |
shrey802/PyTTSeval
Evaluation tool for TTS systems |
|
Experimental |
| 5912 |
skye-cyber/ttskit3
A lightweight text to speeach toolkit |
|
Experimental |
| 5913 |
deepgram-starters/ruby-text-to-speech
Get started using Deepgram's Text-to-Speech with this Ruby demo app |
|
Experimental |
| 5914 |
JuanJRA20/Conversor-Texto-a-Voz
🎙️ Sistema inteligente de conversión de texto a audio con detección... |
|
Experimental |
| 5915 |
laustke/jimlet_classic
Offline text-to-speech GUI converter with drag-and-drop support,... |
|
Experimental |
| 5916 |
James-P-D/SDRTranscriber
SDR audio transcriber in Python |
|
Experimental |
| 5917 |
crrrowz/Vosk-STT-Chrome-Extension
Real-time Speech-to-Text Chrome Extension — dictate into any input field... |
|
Experimental |
| 5918 |
deepgram-starters/php-text-to-speech
Get started using Deepgram's Text-to-Speech with this PHP demo app |
|
Experimental |
| 5919 |
Sergey004/Phone_Guy
An AI phone character based on Phone Guy from FNAF |
|
Experimental |
| 5920 |
Sgvkamalakar/Gita_Summarizer
Gita Summarizer extracts key insights from the Bhagavad Gita, aiding... |
|
Experimental |
| 5921 |
sknadig/ASR_2018_T01
Example repository for 2018 DS/NC 821 / Automatic Speech Recognition projects |
|
Experimental |
| 5922 |
madhura-23/ai-voice-assistant
🎤 AI Voice Assistant - Real-time speech recognition, production-ready, fully... |
|
Experimental |
| 5923 |
Pchambet/tp-hmm-markov
Markov Chains and Hidden Markov Models: weather modeling with discrete... |
|
Experimental |
| 5924 |
dsalnikov/wav2vec
pure numpy implementation of wav2vec 2.0 |
|
Experimental |
| 5925 |
YoungloLee/tf2-speech-recognition-transformer
Tensorflow 2 Speech Recognition Code (Transformer) |
|
Experimental |
| 5926 |
deepgram-starters/csharp-text-to-speech
Get started using Deepgram's Text-to-Speech with this C# demo app |
|
Experimental |
| 5927 |
bryanstevensacosta/tts-studio
Personal voice cloning CLI tool using XTTS-v2 |
|
Experimental |
| 5928 |
shantoshdurai/GhostTalker
AI voice cloning and text-to-speech using XTTS — talk to historical figures... |
|
Experimental |
| 5929 |
ErenBalkis/rvc-tts-studio
A Streamlit-based web interface that converts text to speech using edge-tts... |
|
Experimental |
| 5930 |
karthikrshet/text-to-speech
Convert any text into lifelike speech. Choose your language and voice. |
|
Experimental |
| 5931 |
rookiemann/portable-tts-server
Portable multi-GPU text-to-speech server for Windows — 10 AI models, gateway... |
|
Experimental |
| 5932 |
Uchastnick/malisa
Malisa, the voice assistant robot |
|
Experimental |
| 5933 |
ikeoffiah/kokoro_tts
On-device Kokoro TTS for Flutter — high-quality text-to-speech using ONNX... |
|
Experimental |
| 5934 |
MrThinkins/text-to-speach-native-to-web
A TTS that runs natively on the browser using the kokoro.js library. |
|
Experimental |
| 5935 |
Tharindu-Senanayake12/Sign-Language-Interpreter
Real-time AI sign language interpreter with gesture recognition, NLP... |
|
Experimental |
| 5936 |
indaco/md2audio
Convert markdown ections to audio files using multiple TTS providers - a... |
|
Experimental |
| 5937 |
vroomfondel/sipstuff
SIP telephony automation toolkit — place calls via PJSIP, play WAV/TTS... |
|
Experimental |
| 5938 |
Jhanwi/Intelligent-Desktop-Companion
This project developed a personalized Python-based voice controlled... |
|
Experimental |
| 5939 |
shervinnd/Persian-Voice-Assistant-for-Home-Appliance-Repairs
🛠️ A Persian voice assistant to help with diagnosing and repairing home... |
|
Experimental |
| 5940 |
200-DevelopersFound/Havo
The mobile application you envision is designed to facilitate the conversion... |
|
Experimental |
| 5941 |
chirag127/SystemAudioTranscriber-RealTime-SystemAudio-To-Text-Windows-App
Real-time transcription of Windows system audio to text via a floating,... |
|
Experimental |
| 5942 |
zefie/multi-tts
Docker for multiple TTS Engines with a GRadio interface |
|
Experimental |
| 5943 |
gcryptonlabs/FlowCue
FlowCue — native macOS teleprompter with real-time speech tracking, AI... |
|
Experimental |
| 5944 |
wehomemove/WhisprByTheo.spoon
Push-to-talk voice transcription for macOS using MLX Whisper. Beautiful UI,... |
|
Experimental |
| 5945 |
jswallez/jetvoice
Voice to text for macOS. Press a hotkey, speak, get instant transcription. |
|
Experimental |
| 5946 |
myl7/doubao-voice-input-electron
豆包实时语音转文字桌面应用,按下快捷键或长按指定按键,语音识别结果自动输入到当前应用 |
|
Experimental |
| 5947 |
Revocalize/revocalize-docs
🎤 Revocalize AI API: Sing like your favorite artist with our powerful AI... |
|
Experimental |
| 5948 |
lukeocodes/clarion
macOS menu bar app that reads text aloud using Deepgram TTS |
|
Experimental |
| 5949 |
giefferre/texttospeech
Google Cloud Text-to-Speech API Client Library for Go |
|
Experimental |
| 5950 |
lask3802/live-translator
Real-time AI-powered transcription and translation Chrome extension for live... |
|
Experimental |
| 5951 |
qora-protocol/QORA-TTS-12Hz-0.6B
Pure Rust TTS engine with 9 built-in speakers. No Python, no CUDA, no... |
|
Experimental |
| 5952 |
GustasG/vits
VITS Text-to-Speech Model for Lithuanian Language |
|
Experimental |
| 5953 |
davideferrari95/alexa_voice_control
This repository allows you to establish a communication between ROS / ROS2... |
|
Experimental |
| 5954 |
SurveAditya/StudentManagementSystem
A student management system with graph plotting and voice recognition implemented. |
|
Experimental |
| 5955 |
GlobussBiogestion/text-to-signals-and-voice
This API works 100% in HTML with Javascipt so it is very light and easy to... |
|
Experimental |
| 5956 |
mneme-verse/mneme
Open-source mobile app for memorizing poetry using Spaced Repetition and... |
|
Experimental |
| 5957 |
Ask149/friday
A macOS desktop companion with an animated face, voice I/O, and personality... |
|
Experimental |
| 5958 |
Robertinoos13/PyroSpeak-Library
PyroSpeak is a small Python wrapper library that uses big technologies like... |
|
Experimental |
| 5959 |
dgaida/text2speech
Provides text2speech capabilities using ElevenLabs and Kokoro TTS |
|
Experimental |
| 5960 |
KF-R/turk-chat
Lightweight speech-to-speech web-based chat app combining speech... |
|
Experimental |
| 5961 |
Bsh54/AI_Phone_Call
Application web qui transforme la synthèse vocale traditionnelle en... |
|
Experimental |
| 5962 |
Eleven1111/groq-whisper
Groq-powered OpenClaw speech tools for local audio transcription and... |
|
Experimental |
| 5963 |
trentw/script-to-speech
Convert screenplays into multi-voiced audiobooks using various... |
|
Experimental |
| 5964 |
martins-vds/my-assistant
A voice-driven personal task-tracking assistant for tech workers who... |
|
Experimental |
| 5965 |
BedirT/NarratorX
📖 NarratorX: Turn your PDFs into captivating audiobooks in 16 languages,... |
|
Experimental |
| 5966 |
zyascend/End-to-End-Speech-Recognition-Learning
ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别 |
|
Experimental |
| 5967 |
avreliusdante-web-creator/voice-input
Browser extension: convert voice to text and send it with one click in open... |
|
Experimental |
| 5968 |
noly24/spoken-subtitles
"Chrome extension that reads subtitles aloud on streaming sites for accessibility" |
|
Experimental |
| 5969 |
fromis-9/audio-fm
Create narrated countdowns of your top tracks from Last.fm |
|
Experimental |
| 5970 |
MnAkash/aalap
A speech to speech dialogue management package using faster-whisper ASR,... |
|
Experimental |
| 5971 |
LiiLk/Local-AI-Companion
A private, offline AI assistant running entirely on your local machine. |
|
Experimental |
| 5972 |
punyamodi/Speech-to-Speech-Local-LLM
Local speech-to-speech AI assistant with voice cloning, Gradio UI,... |
|
Experimental |
| 5973 |
Bailie-L/VelaNova
Fully offline voice assistant powered by local LLMs — no cloud, no... |
|
Experimental |
| 5974 |
seanghay/vits.cpp
VITS Inference using ONNX Runtime on C++ |
|
Experimental |
| 5975 |
michael-borck/talk-buddy
Provides AI-powered conversation practice with speech recognition and... |
|
Experimental |
| 5976 |
ebisuryu/vision-ai-intern-assignment
This repository contains my solution for the Vision AI intern assignment at... |
|
Experimental |
| 5977 |
okamyuji/HomeCareVoiceLog
Offline iOS voice-first care journal with automatic on-device transcription... |
|
Experimental |
| 5978 |
theubie/OpenTAAI
Read chat log from a Twitch channel and get a natural response from OpenAI. ... |
|
Experimental |
| 5979 |
eryk-mazus/sigh
Seamless Voice Interactions with LLMs |
|
Experimental |
| 5980 |
miranda1000/TwitchTTSBot
A Twitch bot that reads point redemptions with a custom trained voice. |
|
Experimental |
| 5981 |
ttsaigit/tts-ios
TTS.ai iOS app — 18 AI text-to-speech models, voice cloning, speech-to-text |
|
Experimental |
| 5982 |
msalhab96/Listen-Attend-and-Spell
PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper |
|
Experimental |
| 5983 |
nkm90/HearMeWhenYouCanNotSeeMe
Sign language recognition, using multihand tracking solution from Mediapipe,... |
|
Experimental |
| 5984 |
erich2s/native-speak
A simple text-to-speech library using system native tts engines for Node.js |
|
Experimental |
| 5985 |
pyzskw/meeting-teleprompter
线上会议提词器 - 语音识别自动跟读、防截屏、专注模式、离线模型 | Meeting Teleprompter with offline ASR |
|
Experimental |
| 5986 |
brlin-tw/whisper.cpp-snap
Provides easy access to the whisper.cpp application on snap-enabled OS distributions. |
|
Experimental |
| 5987 |
hubetcardenasi/SpeechApp
Convertir tu celular en una aplicación de voz. |
|
Experimental |
| 5988 |
laravieira/reddit-to-tiktok
This project is a Python rendering and publishing pipeline that takes Reddit... |
|
Experimental |
| 5989 |
mostlyvirtual/book-to-audiobook
Convert PDFs and EPUBs into MP3 audiobooks with a clean local web UI,... |
|
Experimental |
| 5990 |
upskyy/RNN-Transducer
PyTorch Implementation of RNN-Transducer |
|
Experimental |
| 5991 |
avrtt/MoE-speech-recognition
Mixture of experts architecture for speech-to-text and language... |
|
Experimental |
| 5992 |
Yacinewhatchandcode/VoiceCloning
🎙️ Real-Time TTS & Voice Cloning Pipeline — F5-TTS · PyTorch · Gradio · Voice Agent |
|
Experimental |
| 5993 |
xAlpharax/edge-tts-gradio
Gradio Interface for Text-To-Speech using Edge TTS. |
|
Experimental |
| 5994 |
wacumov/stttool
A command-line utility for converting audio files to text using a pretrained model. |
|
Experimental |
| 5995 |
Inexpli/Discord-Jarvis
A real-time Discord voice assistant powered by Llama 3, Whisper, and Web... |
|
Experimental |
| 5996 |
Matrixxboy/vermeil
Vermeil is personal assistant just like Jarvis |
|
Experimental |
| 5997 |
cybernahx/urdu-voice-assistant
An Urdu language voice assistant built with Python for speech recognition and TTS |
|
Experimental |
| 5998 |
Dragon745/urdu-roman-dictionary
A growing open-source Urdu → Roman Urdu dictionary and lexicon for... |
|
Experimental |
| 5999 |
Largo-m/AutoCaption
AutoCaption is a complete, fully automated tool for generating video... |
|
Experimental |
| 6000 |
mcp-tool-shop-org/voice-soundboard
TTS library for AI agents — compiler/graph/engine architecture, swappable... |
|
Experimental |