All Voice AI Tools
8,165 tools ranked by quality score · Page 19 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 1801 |
evilC/HotVoice
Adds Speech Recognition support to AutoHotkey, via a C# DLL |
|
Emerging |
| 1802 |
ElmTran/praises
Praises is a text-to-speech tool that can help you read text easily. |
|
Emerging |
| 1803 |
falabrasil/kaldi-br
☕🇧🇷 Scripts para o Kaldi em Português Brasileiro |
|
Emerging |
| 1804 |
Proteusiq/saa
Making Time Speak! 🎙️ |
|
Emerging |
| 1805 |
mxvsh/wave
Native macOS dictation app focused on fast voice-to-text workflows. |
|
Emerging |
| 1806 |
eminemahjoub/pdf-voice-reader
"PDF Reader: A Python application for seamless PDF viewing with enhanced... |
|
Emerging |
| 1807 |
noco-ai/spellbook-docker
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many... |
|
Emerging |
| 1808 |
lars76/fastspeech2-clean
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA |
|
Emerging |
| 1809 |
CMsmartvoice/One-Shot-Voice-Cloning
:relaxed: One Shot Voice Cloning base on Unet-TTS |
|
Emerging |
| 1810 |
ckaytev/tgisper
Telegram bot with ASR |
|
Emerging |
| 1811 |
1038lab/ComfyUI-MegaTTS
A ComfyUI custom node based on ByteDance MegaTTS3, enabling high-quality... |
|
Emerging |
| 1812 |
soldier444xd/KittenTTS
KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with... |
|
Emerging |
| 1813 |
mdingena/att-voodoo
A community-made magic mod for A Township Tale, a VR MMORPG game. |
|
Emerging |
| 1814 |
Citadawn/VoiceDAO
语道 (VoiceDAO) - 专注于文本转语音功能的 Android 应用 |
|
Emerging |
| 1815 |
telecombcn-dl/2018-dlsl
UPC Deep Learning for Speech and Language 2018 |
|
Emerging |
| 1816 |
CarrotYuan/openclaw-voice-control
A macOS local voice-control companion for OpenClaw with Siri-like wakeword... |
|
Emerging |
| 1817 |
paladini/voice-separator-demucs
A simple and efficient self-hosted application to separate vocals from music... |
|
Emerging |
| 1818 |
deepgram-devs/dg-translation-chrome-ext
A TypeScript chrome extension that uses Deepgram to provide live... |
|
Emerging |
| 1819 |
andi611/CS-Tacotron-Pytorch
Pytorch implementation of CS-Tacotron, a code-switching speech synthesis... |
|
Emerging |
| 1820 |
AndroidCodility/SpeechToText
Android application to text through which you can provide speech input to... |
|
Emerging |
| 1821 |
HelloChatterbox/py_responsivevoice
unoficial python api for responsive voice |
|
Emerging |
| 1822 |
GloomyGrave/Sinsy-NG
(discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis... |
|
Emerging |
| 1823 |
OpenVoiceOS/ovos-tts-plugin-beepspeak
experiment adding new r2d2 tts engine for mycroft |
|
Emerging |
| 1824 |
leduckhai/wav2graph
wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech |
|
Emerging |
| 1825 |
QuantiusBenignus/BlahST
Input text from speech in any Linux window, the lean, fast and accurate way,... |
|
Emerging |
| 1826 |
SpeechColab/Leaderboard
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform... |
|
Emerging |
| 1827 |
alam025/ai-voice-assistant-appointment-booking
Enterprise-grade AI voice assistant for automated appointment scheduling... |
|
Emerging |
| 1828 |
Kyubyong/specAugment
Tensor2tensor experiment with SpecAugment |
|
Emerging |
| 1829 |
AA-Factory/aafactory-prototype
⚡ AI Avatar Factory is an interface for creating and managing AI avatars. ⚡ |
|
Emerging |
| 1830 |
xingchensong/Speech-Transformer-tf2.0
transformer for ASR-systerm (via tensorflow2.0) |
|
Emerging |
| 1831 |
asiff00/Training-TTS
Train and finutune text-to-speech models for Bengali and many other languages! |
|
Emerging |
| 1832 |
AI-TOOLKIT/VoiceBridge
VoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit |
|
Emerging |
| 1833 |
funway/audible-epub3-maker
Generate audiobooks from plain EPUB files in EPUB 3 Media Overlays format... |
|
Emerging |
| 1834 |
iceychris/LibreASR
:speech_balloon: An On-Premises, Streaming Speech Recognition System |
|
Emerging |
| 1835 |
instavar/qwen3-tts-lora-finetuning
Qwen3‑TTS LoRA fine‑tuning tools (companion repo) for custom voice adaptation |
|
Emerging |
| 1836 |
ondrejklejch/learning_to_adapt
Coordinate-wise meta-learner for speaker adaptation of ASR models. |
|
Emerging |
| 1837 |
fcjr/ltts
Quick CLI for local text-to-speech using Qwen3-TTS or Kokoro TTS. |
|
Emerging |
| 1838 |
Harsh-0-7/PDF-Reader
PDF reader with read aloud feature |
|
Emerging |
| 1839 |
siddhant-vij/Health-Fitness-Tracker
Health & fitness app with natural language processing, custom... |
|
Emerging |
| 1840 |
gkrsv/split_audio
A rough and ready Python utility which splits audio files based on silence... |
|
Emerging |
| 1841 |
scarletcho/prep4kaldi
Data preparation code for building Kaldi ASR system |
|
Emerging |
| 1842 |
ayshrv/memento-app
Android App which serves as an AI assistant for human memory |
|
Emerging |
| 1843 |
krestaino/prankstr
📞 Prank your friends with text-to-speech phone calls powered by Twilio and... |
|
Emerging |
| 1844 |
sskorol/vosk-api-gpu
Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC |
|
Emerging |
| 1845 |
bedriyan/speaky
Voice-to-text for macOS, powered by on-device AI. Press a hotkey, speak, and... |
|
Emerging |
| 1846 |
jbmiller10/transcribrr
Transcribrr is a python desktop gui application that uses transcribes ... |
|
Emerging |
| 1847 |
tochilkinva/tg_bot_stt_tts
Telegram bot with voice message recognition and generation. Speech to Text... |
|
Emerging |
| 1848 |
naeruru/mimiuchi
a free, customizable, osc capable speech-to-text interface for relaying text... |
|
Emerging |
| 1849 |
JSON2Video/json2video-php-sdk
Video automation with PHP: add watermarks, resize videos, create slideshows,... |
|
Emerging |
| 1850 |
kroko-ai/kroko-onnx
Kroko ASR - Speech-to-text |
|
Emerging |
| 1851 |
aiola-lab/drax
Drax: Speech Recognition with Discrete Flow Matching |
|
Emerging |
| 1852 |
taresh18/orpheus-streaming
Orpheus TTS Server with streaming support (TTFB ~160ms) |
|
Emerging |
| 1853 |
HawkAaron/RNN-Transducer
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction... |
|
Emerging |
| 1854 |
amadeomano/persian-tts
🔊 A simple human-based text-to-speach synthesiser and ReactNative app for... |
|
Emerging |
| 1855 |
kaiaai/kaia.js
Kaia.ai platform's JS client library |
|
Emerging |
| 1856 |
rxlabz/sytody
a Flutter "speech to todo" app example |
|
Emerging |
| 1857 |
ericc-ch/edge-tts
Use Microsoft Edge's online text-to-speech service from JS code directly! |
|
Emerging |
| 1858 |
hutchresearch/latex2speech
TeX2Speech is an application that turns LaTeX documents into spoken audio. |
|
Emerging |
| 1859 |
BraceYourselfGames/UE-BYGTextToSpeech
A plugin that uses the Windows Speech API to speak text in Unreal Engine 4. |
|
Emerging |
| 1860 |
sexfrance/RecaptchaV2-Solver
A Python-based solution for solving Google's reCAPTCHA v2 challenges... |
|
Emerging |
| 1861 |
UFOAlastor/AI-Waifu-Project-LaIN
一个拥有长期记忆, 表情动作, 语音对话/打断/声纹识别, FunctionCall, 多模型支持的AI Waifu客户端. |
|
Emerging |
| 1862 |
AsaoluElijah/say-it
A mobile web application that helps you convert spoken words to... |
|
Emerging |
| 1863 |
Ronik22/Voice-Controlled-Email
A python-based voice-controlled email application for visually impaired persons. |
|
Emerging |
| 1864 |
ng-web-apis/speech
A library for using Web Speech API with Angular |
|
Emerging |
| 1865 |
zalo/OpenAI-Voice
A simple proof of concept for voice-to-voice interaction. |
|
Emerging |
| 1866 |
dokterbob/macos-speech-server
Local, fast and efficient Speech to Text (STT) and Text to Speech (TTS) on... |
|
Emerging |
| 1867 |
lcraver/ProxiTalk
This is the repo for ProxiTalk OS. ProxiTalk is a custom operating system... |
|
Emerging |
| 1868 |
aidayang/LatentSync-OneClick
免费视频对口型软件LatentSync一键启动整合包 |
|
Emerging |
| 1869 |
bhashini-ai/bhashini-api-examples
Sample programs for calling Bhashini.ai REST/WebSocket APIs - TTS, STT/ASR,... |
|
Emerging |
| 1870 |
mozilla/deepspeech-playbook
A crash course for training speech recognition models using DeepSpeech. |
|
Emerging |
| 1871 |
Fooftilly/kokoro-extension
Send text from browser to Kokoro-FastAPI for TTS generation |
|
Emerging |
| 1872 |
Better-Player/espeakng-sys
Rust bindings to eSpeak NG |
|
Emerging |
| 1873 |
cristofima/AI-Tech-Interview-Preparation
An AI-powered technical interview preparation platform that generates... |
|
Emerging |
| 1874 |
karrarkazuya/ArabicTTS
ArabicTTS (TextToSpeech) Android library with a sample |
|
Emerging |
| 1875 |
HCI-LAB-UGSPEECHDATA/speech_data_ghana_ug
The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani,... |
|
Emerging |
| 1876 |
hcy71o/SC-CNN
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker... |
|
Emerging |
| 1877 |
Frida7771/PyVoice
A Python-based speech processing tool that supports both speech-to-text... |
|
Emerging |
| 1878 |
speechsuper/SpeechSuper-API-Samples
Deep learning based speech and pronunciation assessment API for 8 languages. |
|
Emerging |
| 1879 |
botbahlul/whisper_autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using... |
|
Emerging |
| 1880 |
IBM/text-to-speech-code-pattern
WARNING: This repository is no longer maintained |
|
Emerging |
| 1881 |
wannaphong/KhanomTan-TTS-v1.0
KhanomTan TTS (ขนมตาล) is an open-source Thai text-to-speech model that... |
|
Emerging |
| 1882 |
sciforce/phones-las
Articulatory features estimation using Listen Attend and Spell architecture. |
|
Emerging |
| 1883 |
sayak-brm/espeakng-python
An eSpeak NG TTS binding for Python3. |
|
Emerging |
| 1884 |
henry-richard7/Natural-Text-to-Speech
This python program uses https://naturaltts.com API to convert given text to... |
|
Emerging |
| 1885 |
manhph2211/ViSR
This repo builds an end-to-end deep learning application that supports... |
|
Emerging |
| 1886 |
AkishinoShiame/Chinese-Speech-Emotion-Datasets
Datasets of A Deep Convolutional Neural Network Based Virtual Elderly... |
|
Emerging |
| 1887 |
jenswittmann/CurlyFramework
Tiny Framework for accessibility and sustainability, not only for MODX or Kirby CMS. |
|
Emerging |
| 1888 |
tmanderson/ivona-node
Ivona Cloud (via Amazon services) client library for Node |
|
Emerging |
| 1889 |
HnDK0/NoveLA
Free Android reader for web novels, light novels, ranobe & EPUB. 25+... |
|
Emerging |
| 1890 |
npuichigo/ttsflow
tensorflow speech synthesis c++ inference for voicenet |
|
Emerging |
| 1891 |
andi611/ZeroSpeech-TTS-without-T
A Pytorch implementation for the ZeroSpeech 2019 challenge. |
|
Emerging |
| 1892 |
askrella/speech-rest-api
Transcription and TTS Rest API (OpenAI Whisper, Speechbrain) |
|
Emerging |
| 1893 |
alan-ai/alan-sdk-reactnative
The Self-Coding System for Your App — Alan AI SDK for React Native |
|
Emerging |
| 1894 |
nexmo-community/voice-azure-speechtotext-py
Sample Code for Realtime Transcription using Nexmo, Microsoft Azure Speech... |
|
Emerging |
| 1895 |
i4Ds/whisper-prep
Data preparation utility for the finetuning of OpenAI's Whisper model. |
|
Emerging |
| 1896 |
Deepak5j/PyTranscriber
Speech to Text |
|
Emerging |
| 1897 |
persiandataset/PersianSpeech
Persian ASR dataset |
|
Emerging |
| 1898 |
asmith26/speech2caret
Use your speech to write to the current caret position! |
|
Emerging |
| 1899 |
masonthemaker/saidwell
Open Source Voice AI Dashboard |
|
Emerging |
| 1900 |
Kalebu/image-to-sound-python-
A python project for converting an Image into audible sound using OCR and... |
|
Emerging |