All Voice AI Tools
8,165 tools ranked by quality score · Page 58 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 5701 |
Vidyut/vidyut-tts
Streamlit frontend for Coqui-tts |
|
Experimental |
| 5702 |
Sid-V5/EchoSynth
Voice synthesis platform with TTS and STT. FastAPI backend, voice cloning,... |
|
Experimental |
| 5703 |
abpai/tts-gateway
A local text-to-speech gateway with a pluggable engine architecture |
|
Experimental |
| 5704 |
d1pankarmedhi/CascadeS2S
A low-latency (<5s) cascade-style speech-to-speech conversational system |
|
Experimental |
| 5705 |
Xalab/recognizer
Desktop app for recognize speech offline by using Vosk. |
|
Experimental |
| 5706 |
iuliiakr/TTS-Project-Framework
Architecture framework for building production-grade text-to-speech systems,... |
|
Experimental |
| 5707 |
jorelius/Speak
Speak is a command line utility for reading text aloud or writting the audio... |
|
Experimental |
| 5708 |
furkankarakuz/TranslateAI
TranslateAI is a powerful real-time speech translation desktop application... |
|
Experimental |
| 5709 |
sriramsme/VidCaptio
video captioning software |
|
Experimental |
| 5710 |
dehyabi/textor-ai
A powerful Speech-to-Text API built with Django REST Framework and... |
|
Experimental |
| 5711 |
graphcore/whisper-ai
Speech Recognition (ASR) on Graphcore IPUs using OpenAI's Whisper |
|
Experimental |
| 5712 |
vpakarinen2/omnilocal
Local voice-enabled assistant. |
|
Experimental |
| 5713 |
powerpig99/readaloud
Local-first text-to-speech reader powered by Qwen3-TTS. 9 voices, 10... |
|
Experimental |
| 5714 |
princesingh-ai-dev/JARVIS-Voice-Assistant
🤖 AI-powered voice assistant with Whisper STT, Groq LLM, real-time TTS,... |
|
Experimental |
| 5715 |
Thijsn04/MediClear-AI
An intelligent medical translator powered by Google Gemini 2.5. Simplifies... |
|
Experimental |
| 5716 |
Inc44/TheTTS
Synthesize speech using state-of-the-art open and closed-source tools |
|
Experimental |
| 5717 |
lostvikx/reddisyte
A program to extract content off of Reddit 🐛 The name is derived by reddit + parasite |
|
Experimental |
| 5718 |
saxil/mareen
Mareen - A privacy-focused voice assistant with 3D orb UI, powered by Ollama... |
|
Experimental |
| 5719 |
marcusau2/VOX-1-Audiobook-Maker
VOX-1 Audiobook Maker is a local, GPU-accelerated studio for creating... |
|
Experimental |
| 5720 |
ssharanyab/persona-tts
PersonaTTS is a personalized neural text-to-speech system that learns a... |
|
Experimental |
| 5721 |
egorsmkv/asr-datasets-cleaner
A pipeline to make ASR datasets better |
|
Experimental |
| 5722 |
shahruk10/go-sctk
Go CLI wrapper around SCTK binaries for word error rate evaluation and error... |
|
Experimental |
| 5723 |
weimeng23/audio-speech-datasets
:scroll: A list of various Audio/Speech datasets about Speech Recognition,... |
|
Experimental |
| 5724 |
brailcom/festival-czech
Czech support for Festival |
|
Experimental |
| 5725 |
adamelkholyy/whisper-yt
Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper... |
|
Experimental |
| 5726 |
arjunbazinga/speak
Select any text and have it read out loud |
|
Experimental |
| 5727 |
innerNULL/simpler-distil-whisper
Simpler Distil-Whisper |
|
Experimental |
| 5728 |
msalhab96/AraSpot
The official implementation of the AraSpot research paper |
|
Experimental |
| 5729 |
JoeBiellik/speechlauncher
Very simple, yet functional voice activated launcher |
|
Experimental |
| 5730 |
caitunai/wake_demo
An android project to show how to use snowboy to wake up app by voice |
|
Experimental |
| 5731 |
SprtnDio/Complete-Local-Discord-AI-Voice-Chat-Bot
AI Discord bot that acts as an insulting oracle. Ask questions by voice or... |
|
Experimental |
| 5732 |
Pasqual3/Stories-Teaching-Autism-Reality-storieAmiche
Piattaforma web innovativa per il supporto dell'autismo e della... |
|
Experimental |
| 5733 |
5j9/cliptalk
Clipboard monitor that converts copied text to speech (TTS) using... |
|
Experimental |
| 5734 |
nicremo/qwen3-tts-chunked-webui
Qwen3-TTS Voice Cloning WebUI with automatic text chunking - Optimized for... |
|
Experimental |
| 5735 |
muhammedsaban/coqui-xtts-v2-turkish-local
A locally running Turkish text-to-speech application developed with Coqui... |
|
Experimental |
| 5736 |
al-develop/SmartVocabulary
Dictionary, filled with your own words and phrases, for many languages. Uses... |
|
Experimental |
| 5737 |
jefrydco/text2speech-js
Wrapper around browser Text to Speech API |
|
Experimental |
| 5738 |
kuanyshbakytuly/camera-text-speech
Blind Text-Assistance |
|
Experimental |
| 5739 |
Voinic/microtts
Simple TTS library for MicroPython that works offline |
|
Experimental |
| 5740 |
robauto/bibli3.0
BiBli 3.0 for Raspberry Pi - Swarm Robotics and IoT Operating System - AI -... |
|
Experimental |
| 5741 |
khaykingleb/research-playground
Efficient ML/DL implementations across multiple domains with K3s multi-node... |
|
Experimental |
| 5742 |
Jyotibrat/Speech-To-Text
Speech to Text model |
|
Experimental |
| 5743 |
Adisol07/SharpSpeech
SharpSpeech is free, local and open source way to speech and wake word recognition. |
|
Experimental |
| 5744 |
SSusantAchary/AI_Resources
Have read and collected few Interesting Papers , Projects |
|
Experimental |
| 5745 |
ponchotitlan/google_text-to-speech_prompt_maker
Utility for Google Text-To-Speech batch audio files generator. Ideal for... |
|
Experimental |
| 5746 |
SouthernMethodistUniversity/whisper-transcription
Helm chart repo for application developed by OIT STARs students for audio... |
|
Experimental |
| 5747 |
tb0hdan/voiceplay
Client-side first music centered voice controlled player |
|
Experimental |
| 5748 |
tzneal/gopicotts
go wrapper around the pico text to speech engine |
|
Experimental |
| 5749 |
shun126/VoicevoxPlayer
VoicevoxのUnreal Engine 4.27.2 ~ / Unreal Engine 5 プラグイン |
|
Experimental |
| 5750 |
JacketsMask/Toland-Destiny-2-Bounty-Optimizer
Speech recognition to help optimize clearing bounties in Destiny 2 |
|
Experimental |
| 5751 |
slemonide/lost
A maze exploring game with TTS messages |
|
Experimental |
| 5752 |
Luigi-Pizzolito/YukkuriTalk
A command-line program which uses AquesTalk10's Yukkuri TTS. Offline, single-binary. |
|
Experimental |
| 5753 |
jackaduma/speaker_recognition_models.pytorch
speaker recognition / speaker verification models in pytorch implementation |
|
Experimental |
| 5754 |
iamnortey/ninolex-gh
Open Ghanaian pronunciation dictionary for TTS and AI systems — IPA, CSV,... |
|
Experimental |
| 5755 |
ubisoft/ubisoft-laforge-french-homograph-dataset
Dataset for La Forge Speech Synthesis System Submission to the Blizzard... |
|
Experimental |
| 5756 |
tuanio/conformer-rnnt
Conformer RNN-Transducer |
|
Experimental |
| 5757 |
moego0/custom_KWS
End-to-end pipeline for training a custom keyword detection model with... |
|
Experimental |
| 5758 |
Vlad1343/Gesture-Translator
British Sign Language Translator is a real-time AI-powered system that... |
|
Experimental |
| 5759 |
neeraj-nagiri/Assistant-Bro-
Assistant "Bro" is a voice-controlled personal assistant that opens... |
|
Experimental |
| 5760 |
pl146/manga-voice-reader
AI-powered Chrome extension that reads manga speech bubbles aloud. Bubble... |
|
Experimental |
| 5761 |
masonintokyo/voicevox-srt-to-speak
VOICEVOX Engine APIを使ってSubRipファイルから各セリフ時間内に収まるように音声合成します。 |
|
Experimental |
| 5762 |
UG-SEP/Text-to-speech-convertor
Blind people do not able to see so they cannot read text with their eyes so... |
|
Experimental |
| 5763 |
Thukyd/OpenAI-Spechify-Your-Docs
OpenAI-Spechify-Your-Docs is a Python project that converts text from... |
|
Experimental |
| 5764 |
Hexer10/HexTTS
Make client latedownload text to speech sounds |
|
Experimental |
| 5765 |
FairyDevicesRD/droid.josee.tts
軽量に動作するAndrid API対応のローカルTTSサービスアプリ |
|
Experimental |
| 5766 |
nmanikiran/ionic-allinone
This is to give a demo of each feature that are there in ionic and ionic-native |
|
Experimental |
| 5767 |
syedzubeen/podcasts
Podcasts.AI: Transcribe podcasts in a click and unlock a world of searchable... |
|
Experimental |
| 5768 |
myrmlbst/transcribe.AI
Webapp hosting machine learning models to generate downloadable audio... |
|
Experimental |
| 5769 |
ookgezellig/videotools
A collection of tools to cut, compress, extract, amplify and transcribe... |
|
Experimental |
| 5770 |
DuyguA/Interspeech2025-Smooth-Operating-LLMs-for-Disfluency
Innovative approach for modelling speech disfluencies with LLaMa and Conformer. |
|
Experimental |
| 5771 |
nick1udwig/ursr
UrSR: Urbit Speech Recognition |
|
Experimental |
| 5772 |
taeefnajib/Aximos
Aximos is an innovative AI-powered tool that transforms your content into... |
|
Experimental |
| 5773 |
isbendiyarovanezrin/SpeechDetection
Speech Detection 💬 |
|
Experimental |
| 5774 |
parula-app/assistant
Parula - Digital assistant - Running entirely on your own device |
|
Experimental |
| 5775 |
passion-27/openai-whisper-api
A sample speech transcription app implementing OpenAI Text to Speech API... |
|
Experimental |
| 5776 |
ReadieFur/Stream-Tools
A stream chat tool that features AWS text to speech, voice commands, chat... |
|
Experimental |
| 5777 |
zguesmi/image2speech
Ethereum ready Dapp to speak your images. |
|
Experimental |
| 5778 |
LiamBrandt/tts_decode
A decoder for TTS files from 7 Days to Die |
|
Experimental |
| 5779 |
khaykingleb/automatic-speech-recognition
QuartzNet and DeepSpeech implementation for ASR |
|
Experimental |
| 5780 |
markus-m-u-e-l-l-e-r/CTC.ISL
ISL Speech Recognition Toolkit for training neural networks with the CTC... |
|
Experimental |
| 5781 |
Omitg24/IIS-ASR
Repositorio para Administración de Sistemas y Redes (ASR), asignatura del... |
|
Experimental |
| 5782 |
CSFelix/audio-to-text
🔊 Extract Text from Audios 🔊 |
|
Experimental |
| 5783 |
Zuellni/XTTS-Server
XTTS Server for SillyTavern. |
|
Experimental |
| 5784 |
kiritoInd/YouTube_Audio_Transcripter
Youtube Audio transcription with WhisperAi , The script downloads audio from... |
|
Experimental |
| 5785 |
emirkaanozdemr/MultiLingualVoice
MultiLingualVoice is an innovative application designed to bridge language... |
|
Experimental |
| 5786 |
carolinezhao/speech-to-text
A google extension used for converting voice to text in real-time. |
|
Experimental |
| 5787 |
Ahmed5attab/Grades-Assistants-
Assistant iOS application helps the teacher review his students data and... |
|
Experimental |
| 5788 |
vladevelops/trainer
Your personal trainer, no yapping |
|
Experimental |
| 5789 |
adityajn105/google_speech_diarization_demo
A demo to show Speech Diarization (seperating audio of different speaker)... |
|
Experimental |
| 5790 |
attwad/cdf
Worker and elasticsearch for automated College de France audio transcripts |
|
Experimental |
| 5791 |
trypsynth/battery-mon
macOS application that lives in your menu bar and periodically reports your... |
|
Experimental |
| 5792 |
dannis999/trained_SpeechRecognition
此项目用于备份一个完整的中文语音识别环境,包括环境配置和预训练模型,以方便直接使用 |
|
Experimental |
| 5793 |
armados/automaticschoolbell
Automatic School Bell |
|
Experimental |
| 5794 |
romestylez/pocketChat
Dein Stream in der Tasche — Chat lesen, schreiben und moderieren, Events von... |
|
Experimental |
| 5795 |
tanvi355/Video-to-PDF
⚡ Convert any video of your choice to a PDF file using this Python script. |
|
Experimental |
| 5796 |
purarue/tts
CLI tool to convert text to speech using the StreamLabs API |
|
Experimental |
| 5797 |
agungmahardikka/ConnectWave
🌐 Enable seamless communication for deaf and mute individuals with... |
|
Experimental |
| 5798 |
daftmaple/soundboard-channel-points-v2
Second version of Twitch soundboard/TTS application, with slightly improved... |
|
Experimental |
| 5799 |
CingZeoi/OneCore-SAPI5
Allow calling OneCore voice engine with SAPI5 |
|
Experimental |
| 5800 |
alextsao1999/assistant
hypermind assistant 语音识别助手 |
|
Experimental |