All Voice AI Tools
8,165 tools ranked by quality score · Page 2 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 101 |
daanzu/kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set... |
|
Established |
| 102 |
roryeckel/wyoming_openai
OpenAI-Compatible Proxy Middleware for the Wyoming Protocol |
|
Established |
| 103 |
kishanrajput23/Jarvis-Desktop-Voice-Assistant
A python based desktop voice assistant capable of executing system-level... |
|
Established |
| 104 |
sandrohanea/whisper.net
Whisper.net. Speech to text made simple using Whisper Models |
|
Established |
| 105 |
ChetanXpro/nodejs-whisper
NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as... |
|
Established |
| 106 |
royshil/obs-localvocal
OBS plugin for local speech recognition and captioning using AI |
|
Established |
| 107 |
NVIDIA-AI-Blueprints/pdf-to-podcast
Transform PDFs into AI podcasts for engaging on-the-go audio content. |
|
Established |
| 108 |
nazdridoy/kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple... |
|
Established |
| 109 |
PyThaiNLP/PyThaiTTS
Open Source Thai Text-to-speech library in Python |
|
Established |
| 110 |
zuoban/tts
tts 服务 |
|
Established |
| 111 |
githubharald/CTCWordBeamSearch
Connectionist Temporal Classification (CTC) decoder with dictionary and... |
|
Established |
| 112 |
charleprr/redditube
A video generator from Reddit posts and comments |
|
Established |
| 113 |
Picovoice/web-voice-processor
A library for real-time voice processing in web browsers |
|
Established |
| 114 |
snakers4/silero-models
Silero Models: pre-trained text-to-speech models made embarrassingly simple |
|
Established |
| 115 |
deepgram/deepgram-python-sdk
Official Python SDK for Deepgram. |
|
Established |
| 116 |
Wikidepia/g2p-id
Indonesian Grapheme-to-Phoneme (IPA notation) |
|
Established |
| 117 |
sdkcarlos/artyom.js
A voice control - voice commands - speech recognition and speech synthesis... |
|
Established |
| 118 |
JamesBrill/react-speech-recognition
💬Speech recognition for your React app |
|
Established |
| 119 |
lugia19/elevenlabslib
Full python wrapper for the elevenlabs API. |
|
Established |
| 120 |
OpenVoiceOS/ovos-tts-server
simple flask server to host OpenVoiceOS tts plugins as a service |
|
Established |
| 121 |
yandexdataschool/speech_course
YSDA course in Speech Processing. |
|
Established |
| 122 |
mkiol/dsnote
Speech Note Linux app. Note taking, reading and translating with offline... |
|
Established |
| 123 |
morganney/tts-react
Convert text to speech using React. |
|
Established |
| 124 |
Vonage/vonage-ruby-sdk
Vonage REST API client for Ruby. API support for SMS, Voice, Text-to-Speech,... |
|
Established |
| 125 |
PyThaiNLP/pythaiasr
Python Thai Automatic Speech Recognition |
|
Established |
| 126 |
daswer123/xtts-api-server
A simple FastAPI Server to run XTTSv2 |
|
Established |
| 127 |
revdotcom/revai-node-sdk
Node.js SDK for the Rev AI API |
|
Established |
| 128 |
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art... |
|
Established |
| 129 |
istupakov/onnx-asr
A lightweight Python package for Automatic Speech Recognition using ONNX models |
|
Established |
| 130 |
MycroftAI/mycroft-precise
A lightweight, simple-to-use, RNN wake word listener |
|
Established |
| 131 |
Spr-Aachen/Easy-Voice-Toolkit
A user-friendly audio toolkit for voice recognition, voice transcription,... |
|
Established |
| 132 |
itsmevictor/clean-transcribe
A simple CLI to transcribe Youtube videos or local audio/video files and... |
|
Established |
| 133 |
OpenVoiceOS/ovos-tts-plugin-espeakNG
espeakNG plugin |
|
Established |
| 134 |
n1teshy/yapper-tts
offline text to speech and free SOTA LLM APIs to let your programs speak to you |
|
Established |
| 135 |
Ailln/cn2an
📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化) |
|
Established |
| 136 |
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching |
|
Established |
| 137 |
mdiller/MangoByte
A discord bot that provides the ability to play dota hero response clips, do... |
|
Established |
| 138 |
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time |
|
Established |
| 139 |
deepgram/deepgram-js-sdk
Official JavaScript SDK for Deepgram. |
|
Established |
| 140 |
ken107/read-aloud
An awesome browser extension that reads aloud webpage content with one click |
|
Established |
| 141 |
phuc-nt/my-translator
Real-time speech translation — macOS & Windows, free TTS, no server, your... |
|
Established |
| 142 |
mybigday/whisper.rn
React Native binding of whisper.cpp. |
|
Established |
| 143 |
kstonekuan/tambourine-voice
Your personal voice interface for any app. Speak naturally and your words... |
|
Established |
| 144 |
pilot51/voicenotify
Android app that speaks notifications |
|
Established |
| 145 |
linto-ai/WebVoiceSDK
Buildings block for voice-enabled applications in the browser |
|
Established |
| 146 |
p0n1/epub_to_audiobook
EPUB to audiobook converter, optimized for Audiobookshelf, WebUI included |
|
Established |
| 147 |
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research... |
|
Established |
| 148 |
Enemyx-net/VibeVoice-ComfyUI
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech... |
|
Established |
| 149 |
aichaos/rivescript-python
A RiveScript interpreter for Python. RiveScript is a scripting language for... |
|
Established |
| 150 |
tabahi/bournemouth-forced-aligner
Extract phoneme-level timestamps from speeh audio. |
|
Established |
| 151 |
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence |
|
Established |
| 152 |
thevickypedia/Jarvis
Fully Functional Voice Based Natural Language UI |
|
Established |
| 153 |
babysor/MockingBird
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time |
|
Established |
| 154 |
vivekuppal/transcribe
Transcribe is a real time transcription, conversation, Language learning... |
|
Established |
| 155 |
DigitalPhonetics/IMS-Toucan
Controllable and fast Text-to-Speech for over 7000 languages! |
|
Established |
| 156 |
gooofy/py-kaldi-asr
Some simple wrappers around kaldi-asr intended to make using kaldi's... |
|
Established |
| 157 |
gabrielmittag/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment |
|
Established |
| 158 |
davidacm/NVDA-IBMTTS-Driver
This project is aimed at developing and maintaining the NVDA IBMTTS driver.... |
|
Established |
| 159 |
richardr1126/openreader
An open-source read-along document reader server with high-quality TTS... |
|
Established |
| 160 |
dictation-toolbox/dragonfly
Speech recognition framework allowing powerful Python-based scripting and... |
|
Established |
| 161 |
altunenes/parakeet-rs
very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA... |
|
Established |
| 162 |
alphacep/vosk
VOSK Speech Recognition Toolkit |
|
Established |
| 163 |
moonstar-x/discord-tts-bot
A Text-to-Speech bot for Discord. |
|
Established |
| 164 |
argmaxinc/WhisperKit
On-device Speech Recognition for Apple Silicon |
|
Established |
| 165 |
fishaudio/fish-audio-python
The official Python library for the Fish Audio API. |
|
Established |
| 166 |
r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping. |
|
Established |
| 167 |
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert |
|
Established |
| 168 |
MainRo/deepspeech-server
A testing server for a speech to text service based on coqui.ai |
|
Established |
| 169 |
ManimCommunity/manim-voiceover
Manim plugin for all things voiceover |
|
Established |
| 170 |
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit |
|
Established |
| 171 |
kurianbenoy/whisper_normalizer
A python package for whisper normalizer |
|
Established |
| 172 |
capacitor-community/text-to-speech
⚡️ Capacitor plugin for synthesizing speech from text. |
|
Established |
| 173 |
FirezTheGreat/1SHOT
All my works - https://github.com/FirezTheGreat (latest music commands/djs... |
|
Established |
| 174 |
kalliope-project/kalliope
Kalliope is a framework that will help you to create your own personal assistant. |
|
Established |
| 175 |
jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level... |
|
Established |
| 176 |
dectalk/dectalk
Modern builds for the 90s/00s DECtalk text-to-speech application. |
|
Established |
| 177 |
Picovoice/speech-to-text-benchmark
speech to text benchmark framework |
|
Established |
| 178 |
nttcslab-sp/kaldiio
A pure python module for reading and writing kaldi ark files |
|
Established |
| 179 |
i3thuan5/tai5-uan5_gian5-gi2_kang1-ku7
臺灣言語工具 |
|
Established |
| 180 |
dlutton/flutter_tts
Flutter Text to Speech package |
|
Established |
| 181 |
petercunha/tts
:pencil: :sound: A simple text-to-speech tool. Converts your text to speech... |
|
Established |
| 182 |
alphacep/vosk-android-demo
Offline speech recognition for Android with Vosk library. |
|
Established |
| 183 |
pnlpal/dictionariez
📚 A customizable dictionary extension that supports double-click lookups in... |
|
Established |
| 184 |
ai-ng/swift
Fast voice assistant powered by Groq, Cartesia, and Vercel. |
|
Established |
| 185 |
wq2012/SimpleDER
A lightweight library to compute Diarization Error Rate (DER). |
|
Established |
| 186 |
asterics/Asterics-AAC
Free, easy-to-use AAC app with offline support, flexible input options,... |
|
Established |
| 187 |
openctp/openctp
openctp提供CTP股票期权、中泰证券XTP、华鑫证券奇点TORA、东方证券OST、东方财富证券EMT、盈透证券TWS、易盛TAP、量投QDP等各通道... |
|
Established |
| 188 |
sfortis/openai_tts
Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine... |
|
Established |
| 189 |
BryceWG/BiBi-Keyboard
说点啥(BiBi Keyboard):一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR... |
|
Established |
| 190 |
R3gm/SoniTranslate
Synchronized Translation for Videos. Video dubbing |
|
Established |
| 191 |
midas-research/audino
Open source audio annotation tool for humans |
|
Established |
| 192 |
hkchengrex/MMAudio
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality... |
|
Established |
| 193 |
OpenMOSS/MOSS-TTSD
MOSS-TTSD is a spoken dialogue generation model designed for expressive... |
|
Established |
| 194 |
yeyupiaoling/PaddlePaddle-DeepSpeech
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。 |
|
Established |
| 195 |
pykaldi/pykaldi
A Python wrapper for Kaldi |
|
Established |
| 196 |
sindresorhus/awesome-whisper
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition... |
|
Established |
| 197 |
sidharthrajaram/StyleTTS2
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and... |
|
Established |
| 198 |
agentvoiceresponse/avr-infra
The AVR Infrastructure project is designed to launch the Agent Voice... |
|
Established |
| 199 |
pot-app/pot-desktop
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition. |
|
Established |
| 200 |
yeyupiaoling/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without... |
|
Established |