All Voice AI Tools
8,165 tools ranked by quality score · Page 51 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 5001 |
11dome11/Lucy---Virtual-Assistant
Lucy - a simple virtual assistant with speech recognition |
|
Experimental |
| 5002 |
laithisgood/kokoclone
Deliver fast, real-time multilingual voice cloning with an efficient neural... |
|
Experimental |
| 5003 |
leejgdh/GPT-SoVITS-ko
한국어 전용 GPT-SoVITS TTS 서비스 |
|
Experimental |
| 5004 |
voothi/20250421115831-anki-gtts-player
A powerful Anki audio add-on with a 3-tier playback system: prioritizes your... |
|
Experimental |
| 5005 |
zoebchhatriwala/ICS-I-can-speak-
This Application Converts Your Input Text Into Speech. Developed For Windows... |
|
Experimental |
| 5006 |
Thatcherismkiwi946/rustfs
🌐 Build high-performance distributed object storage easily with RustFS,... |
|
Experimental |
| 5007 |
EDWINANGO/Synchronizer
Manage server-authoritative data channels for Roblox with automatic client... |
|
Experimental |
| 5008 |
SharunDeva/deep-delta-learning
🔍 Discover Deep Delta Learning, a new framework that transforms residual... |
|
Experimental |
| 5009 |
RJoshi141/utter
Voice capture app for Apple Watch and iPhone. Speak a thought on your wrist,... |
|
Experimental |
| 5010 |
LakshmiSravyaVedantham/cutto
AI Video Director for Kids' Education — describe a lesson, get a finished... |
|
Experimental |
| 5011 |
Bubblefox9473/AI-Waifu-Vtuber
🤖 Create a multilingual AI waifu VTuber with advanced TTS, real-time lip... |
|
Experimental |
| 5012 |
Ammar-create/Pollination-tools
Free AI tools hub powered by Pollinations.ai — translator, voice studio,... |
|
Experimental |
| 5013 |
nitrogoat74/aacs
🤖 Establish a clear standard for AI governance and accountability with the... |
|
Experimental |
| 5014 |
rjtsuri1000/Audio-Gain-Module-FPGA
🔊 Implement and scale audio gain in real-time using a fixed-point DSP module... |
|
Experimental |
| 5015 |
Twerionex/soprano-factory
🎤 Train or fine-tune your own Soprano text-to-speech models with ease using... |
|
Experimental |
| 5016 |
nisakson2000/Gizmo-AI
A fully local AI assistant — 9B LLM + vision on GPU, Voice Studio with voice... |
|
Experimental |
| 5017 |
Seda-Gtech/ai-voice-architecture
Flutter Web demo showcasing AI voice architecture — ElevenLabs TTS, Voice... |
|
Experimental |
| 5018 |
notvibhu8/VoiceLICT
📢 Empower LICT students to voice concerns using AI to identify common issues... |
|
Experimental |
| 5019 |
fabiolimace/espeak-playground
Espaço para experimentação do software espeak-ng. 🔬 🥼 |
|
Experimental |
| 5020 |
1urelius/atlas.cam
Display live webcam video as ASCII art in the terminal with real-time edge... |
|
Experimental |
| 5021 |
deuxksy/today-vn-news
베트남 뉴스 자동 생성 파이프라인 (TTS, FFmpeg, Hardware Acceleration) |
|
Experimental |
| 5022 |
KernicDE/nova-ed-monitor
NOVA — Navigation, Operations, and Vessel Assistance for Elite Dangerous |
|
Experimental |
| 5023 |
zsoltfrks/multimodal-story-generator
A rather simple story generator from images with text-to-speech integration... |
|
Experimental |
| 5024 |
DarkSide7839/PytDm
🌐 Streamline your downloads with PytDm, a modern Python download manager... |
|
Experimental |
| 5025 |
kvnpetit/BetterFrenchTTS
Intelligent Android TTS wrapper optimized for French — Kotlin DSL, SSML... |
|
Experimental |
| 5026 |
mizunashi-mana/cc-voice-reporter
Real-time voice reporting for Claude Code — hear what Claude is doing... |
|
Experimental |
| 5027 |
Narasimha1997/wavenet-stt
An end-to-end speech recognition system with Wavenet. Built using C++ and python. |
|
Experimental |
| 5028 |
ankitiscracked/usevoiceai
the Typescript toolkit for ambitious voice AI apps |
|
Experimental |
| 5029 |
ccoreilly/deepspeech-catala
Deepspeech ASR Model for the Catalan Language |
|
Experimental |
| 5030 |
jianchang512/speech2text-df
基于Dolphin模型的东方语言音视频转字幕api及webui |
|
Experimental |
| 5031 |
pselvana/VoiceCrafter
Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild |
|
Experimental |
| 5032 |
pragyak412/Improving-Voice-Separation-by-Incorporating-End-To-End-Speech-Recognition
Implementing the paper - |
|
Experimental |
| 5033 |
ivedants/Magic-Media-native-iOS-iPadOS-AR-App
Magic Media is an award-winning experimental native iOS/iPadOS application... |
|
Experimental |
| 5034 |
PineapplePie/SpeechHelper
SpeechHelper is an Android text-to-speech (TTS) library that simplifies the... |
|
Experimental |
| 5035 |
KISETU-ggwp/JpSignSpell
"Yubimoji-kun" is a web application that recognizes fingerspelling in... |
|
Experimental |
| 5036 |
MeDeity/LibBaiduTextToSpeech
一句话拥有 百度语音合成 能力 |
|
Experimental |
| 5037 |
LexMainye/Kasuku-Transcriber
A speech to text web app for people with speech impairments that has support... |
|
Experimental |
| 5038 |
IseduardoRezende/IAParty
Profile/Persona Call using LLM |
|
Experimental |
| 5039 |
hekmon/kyutai-rs
Golang bindings to Kyutai Delayed Streams Modeling Rust productions servers |
|
Experimental |
| 5040 |
exyezed/audiotts-pro
Text-to-Speech generator and audio downloader supporting Azure Speech, IBM... |
|
Experimental |
| 5041 |
dwain-barnes/vibevoice-0.5-realtime-fastrtc-plugin
A FastRTC-compatible wrapper for Microsoft's... |
|
Experimental |
| 5042 |
mkpoli/wenyan-book-video
Narration video rendering pipeline for 《文言陰符》 (wenyan-book) |
|
Experimental |
| 5043 |
SenalDolage/object-detection-TFJS-ReactNative
A mobile application that identifies nearby objects and gives a voice output... |
|
Experimental |
| 5044 |
ferrinweb/voicedictation-webapi-demo
A iflytek voice dictation web api demo. 讯飞语音听写接口纯前端demo. |
|
Experimental |
| 5045 |
adrenak/UniSpeech
A simple to use Speech Recognition library for Unity based on the Microsoft... |
|
Experimental |
| 5046 |
Astralchemist/Voice-Clone-TTS
This is a text to speech model that has many various uses |
|
Experimental |
| 5047 |
sellorm/rsay
Make R and your Mac speak |
|
Experimental |
| 5048 |
smch/tts
Text to speech with web speech synthesis api and amazon polly, reads and... |
|
Experimental |
| 5049 |
Hrithik1122/quizilla.github.io
Quizilla is a web application, use a (Text-to-Speech) API for listening... |
|
Experimental |
| 5050 |
maggieezzat/speech-to-speech-translation
A flask web-page hosting a speech to speech translation demo |
|
Experimental |
| 5051 |
RFebrians/AI-Assistant
I/O Voice Recognition using Conditional Rendering |
|
Experimental |
| 5052 |
DoubleCouponDay/TextToSpeechMod
Designed for the game space engineers |
|
Experimental |
| 5053 |
nikkoxgonzales/streaming-tts
A streamlined, Kokoro-based text-to-speech library with streaming support. |
|
Experimental |
| 5054 |
SunPCSolutions/DiarASR
Enterprise-Grade Secure ASR Diarization Pipeline - HIPAA-compliant speech... |
|
Experimental |
| 5055 |
Ali1gamer7798/StreamXBot
Stream music in your browser with a self-hosted Telegram bot that works... |
|
Experimental |
| 5056 |
dlacheal/AI-VoiceAssistant
AIVA es un ecosistema de asistencia de voz de baja latencia diseñado para... |
|
Experimental |
| 5057 |
NguyenPhamMC/whisperer
🎤 Record and transcribe voice dictation on Linux with push-to-talk... |
|
Experimental |
| 5058 |
gregormcw/notable
Voice-first note capture and semantic retrieval. |
|
Experimental |
| 5059 |
sankalp20436/E-ceptionist
Eceptionist-A smart receptionist is a facial recognition-based monitoring... |
|
Experimental |
| 5060 |
harmlessman/CoquiTTSGui
Gui for users who use the coqui-TTS vits model. |
|
Experimental |
| 5061 |
hadihaider055/vocal-dub
Dub audio into 50+ languages using AI. Whisper transcription, Google... |
|
Experimental |
| 5062 |
rockywuest/kawaii-bath-assistant
🛁 Cute AI-powered bathroom assistant for M5Stack Core 2 — kawaii face,... |
|
Experimental |
| 5063 |
husseinnsourr/NeuralChatter
A Next-Generation Neural TTS Engine. High-quality, human-like voice... |
|
Experimental |
| 5064 |
NormVg/AutoCaptionGenAI
A Python project that extracts audio from video files, transcribes the... |
|
Experimental |
| 5065 |
sherurox/Motion-Flow
Real-time, bidirectional sign language translation — powered entirely in the... |
|
Experimental |
| 5066 |
PatrickFanella/soundhash
A sophisticated system for matching audio clips from videos across social... |
|
Experimental |
| 5067 |
DarkKnightSgh/Dotslash5.0HackAttack
Team HackAttack:Our solution combines state-of-the-art technologies to... |
|
Experimental |
| 5068 |
alozowski/textplease
Upload an audio/video file, configure settings, and receive a text transcript |
|
Experimental |
| 5069 |
gouhaha/Whisper-App
Windows Whisper transcription app (PyInstaller + ffmpeg) |
|
Experimental |
| 5070 |
ggegoge/PyTDM
Pytońska treść do mowy – Polish Text to Speech library for Python |
|
Experimental |
| 5071 |
deepgram-starters/fastapi-text-to-speech
Get started using Deepgram's Text-to-Speech with this FastAPI demo app |
|
Experimental |
| 5072 |
talhabinjaved/voice-ai-agents-openai-telnyx
A FastAPI starter that turns a Telnyx phone number into a realtime,... |
|
Experimental |
| 5073 |
priya-kumari-04/-MindfulMate
Nurturing Mental Wellness Together |
|
Experimental |
| 5074 |
jina-ai/executor-coquiTTS
Executor that leverages CoquiTTS engine for text2speech |
|
Experimental |
| 5075 |
igorovh/tts
📢 !tts command for twitch.tv/kick.com |
|
Experimental |
| 5076 |
yxwyoyoyo/xf-tts
讯飞在线语音合成 |
|
Experimental |
| 5077 |
Erenyegar2/modular-auto-specch-recog-toolkit
🎤 Build and deploy advanced automatic speech recognition systems with this... |
|
Experimental |
| 5078 |
Superx11179/DC-Speech-VAE
🎤 Compress speech to 5 Hz with DC-Speech-VAE, ensuring high perceptual... |
|
Experimental |
| 5079 |
soanseng/voxpen-android
AI voice keyboard for Android — speak naturally, get polished text. Whisper... |
|
Experimental |
| 5080 |
Orca0917/TransformerTTS
Unofficial PyTorch implementation of Transformer-TTS, a Transformer-based... |
|
Experimental |
| 5081 |
analyticsinmotion/micstream
Cross-platform microphone audio capture for Node.js with pre-built... |
|
Experimental |
| 5082 |
loglux/SpeakItAI
Convert text to speech using Microsoft Azure Neural Text-to-Speech (TTS) and... |
|
Experimental |
| 5083 |
Jahangirbd23/WenetSpeech-Yue
📑 Explore WenetSpeech-Yue, a comprehensive Cantonese speech corpus with rich... |
|
Experimental |
| 5084 |
phith0n/v2srt
v2srt 是一个基于人工智能的视频字幕生成工具,为任意视频生成高质量的字幕文件。 |
|
Experimental |
| 5085 |
biraj21/open-voice
Open Source Voice AI Infrastructure with WebRTC backend, and web and mobile... |
|
Experimental |
| 5086 |
bseceenn/Fun-CosyVoice3-0.5B-2512-Deploy
🎤 Deploy a simplified voice synthesis service with Fun-CosyVoice3-0.5B-2512,... |
|
Experimental |
| 5087 |
BlackRoad-OS/whisper.cpp
Fork of whisper.cpp — speech-to-text inference for BlackRoad edge devices |
|
Experimental |
| 5088 |
bdcorps/VideoNews
An app experiment to develop a dynamic world news channel app |
|
Experimental |
| 5089 |
Zer0pa/ZPE-Prosody
ZPE-Prosody V0.0: DETERMINISTIC SPEECH PROSODY CODEC: Intonation | Rhythm |... |
|
Experimental |
| 5090 |
Salama1429/Text-to-speech_TTS_Model_Training
Training Text to speech model for German Language |
|
Experimental |
| 5091 |
Alex2135/ASR-proto
Implemintetion of linear attention conformer - LAC |
|
Experimental |
| 5092 |
hongkongkiwi/elevenlabs-cli
Community-built CLI for the ElevenLabs AI audio platform with TTS, STT,... |
|
Experimental |
| 5093 |
Ushaflow/merge-ssml
Combine multiple SSML documents in JS |
|
Experimental |
| 5094 |
lugia19/Echo-XI
Speech to text to speech using Elevenlabs |
|
Experimental |
| 5095 |
carmen-martin/Deep-Keyword-Spotting
A Small Footprint implementation of Keyword Spotting with different architectures. |
|
Experimental |
| 5096 |
ringabout/scim
[wip]Speech recognition tool-box written by Nim. Based on Arraymancer. |
|
Experimental |
| 5097 |
praneethpj/Unity-Android-Utilities
Open Source Unity-Android Platform Voice Text API and Text To Voice API. |
|
Experimental |
| 5098 |
antouanbg/Bulgarian_Linguistic
Collection and resources for Bulgarian Corpus, Datasets and Models used in... |
|
Experimental |
| 5099 |
katejay/Text-To-Speech
An android app for text to speech. |
|
Experimental |
| 5100 |
Mwamwaaaa/opentypeless
Provide seamless AI voice input for desktop to convert speech into clear,... |
|
Experimental |