All Voice AI Tools
8,165 tools ranked by quality score · Page 4 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 301 |
taigrr/elevenlabs
ElevenLabs Artificial Voice Synthesis Client |
|
Established |
| 302 |
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project. |
|
Established |
| 303 |
deepgram-starters/node-transcription
Get started using Deepgram's Transcription with this Node demo app |
|
Established |
| 304 |
Agents365-ai/video-podcast-maker
AI-powered video podcast creation skill for coding agents. Supports Bilibili... |
|
Established |
| 305 |
EveryVoiceTTS/EveryVoice
The EveryVoice TTS Toolkit - Text To Speech for your language |
|
Established |
| 306 |
aedocw/epub2tts
Turn an epub or text file into an audiobook |
|
Established |
| 307 |
BolajiAyodeji/chat-with-siri
🤖 A text-to-speech chatbot built using Nextjs, OpenAI, and ElevenLabs. |
|
Established |
| 308 |
BoltzmannEntropy/MimikaStudio
MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic... |
|
Established |
| 309 |
deepgram-starters/node-voice-agent
Get started using Deepgram's Voice Agent with this Node demo app |
|
Established |
| 310 |
yanorei32/discord-tts
TTS Discord Bot [VOICEROID, VOICEVOX, AivisSpeech, kttsproject, WinRT, and... |
|
Established |
| 311 |
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统 |
|
Established |
| 312 |
PaciStardust/HOSCY
Companion for OSC and Communication |
|
Established |
| 313 |
unilight/seq2seq-vc
A sequence-to-sequence voice conversion toolkit. |
|
Established |
| 314 |
Macoron/whisper.unity
Running speech to text model (whisper.cpp) in Unity3d on your local machine. |
|
Established |
| 315 |
echogarden-project/echogarden
Cross-platform speech toolset, used from the command-line or as a Node.js... |
|
Established |
| 316 |
ciffelia/koe
Discord 読み上げ Bot |
|
Established |
| 317 |
primepake/wav2lip_288x288
Wav2Lip version 288 and pipeline to train |
|
Established |
| 318 |
Weilbyte/tiktok-tts
Generate TikTok Text-to-Speech voices in your browser |
|
Established |
| 319 |
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS,... |
|
Established |
| 320 |
TuananhCR/Dia-Finetuning-Vietnamese
TTS Dia finetuning for Vietnamese |
|
Established |
| 321 |
adrianlyjak/obsidian-aloud-tts
Obsidian TTS Plugin |
|
Established |
| 322 |
deepgram-devs/nextjs-text-to-speech
Get started using Deepgram's Text-to-Speech with this Next.js demo app |
|
Established |
| 323 |
PrzemyslawSwiderski/python-gradle-plugin
Gradle plugin to run Python projects. |
|
Established |
| 324 |
jonatasgrosman/huggingsound
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools |
|
Established |
| 325 |
FENRlR/MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch |
|
Established |
| 326 |
mathigatti/midi2voice
Singing synthesis from MIDI file |
|
Established |
| 327 |
HeyWillow/willow
Open source, local, and self-hosted Amazon Echo/Google Home competitive... |
|
Established |
| 328 |
robdmac/talkito
TalkiTo lets developers interact with AI systems through speech across... |
|
Established |
| 329 |
scarletcho/KoLM
Korean text normalization and language preparation package for LM in... |
|
Established |
| 330 |
misyaguziya/VRCT
VRCT(VRChat Chatbox Translator & Transcription) |
|
Established |
| 331 |
reazon-research/ReazonSpeech
Massive open Japanese speech corpus |
|
Established |
| 332 |
yeyupiaoling/YeAudio
Python的音频工具 |
|
Established |
| 333 |
mlalma/KokoroTestApp
Test application for Kokoro TTS model |
|
Established |
| 334 |
OpenVoiceOS/ovos-tts-plugin-cotovia
galician tts plugin for OVOS |
|
Established |
| 335 |
soniqo/speech-swift
AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and... |
|
Established |
| 336 |
Thiagohgl/ai-pronunciation-trainer
This tool uses AI to evaluate your pronunciation. |
|
Established |
| 337 |
zaigie/FunSpeech
开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端 |
|
Established |
| 338 |
saharmor/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper... |
|
Established |
| 339 |
ArdaGnsrn/elevenlabs-laravel
This is an Open Source PHP Laravel package for ElevenLabs Text to Speech API. |
|
Established |
| 340 |
asiff00/On-Device-Speech-to-Speech-Conversational-AI
This is an on-CPU real-time conversational system for two-way speech... |
|
Established |
| 341 |
alphacep/awesome-russian-speech
Russian speech technology links |
|
Established |
| 342 |
h5p/h5p-speak-the-words
Create questions answered through speech |
|
Established |
| 343 |
lucasnewman/nanospeech
A simple, hackable text-to-speech system in PyTorch and MLX |
|
Established |
| 344 |
thorstenMueller/Thorsten-Voice
Thorsten-Voice: A free to use, offline working, high quality german TTS... |
|
Established |
| 345 |
pszemraj/vid2cleantxt
Python API & command-line tool to easily transcribe speech-based video files... |
|
Established |
| 346 |
stefantaubert/pinyin-to-ipa
Command-line interface and Python library to transcribe pinyin to IPA. The... |
|
Established |
| 347 |
JSchmie/ScrAIbe-WebUI
WebUI for ScAIbe |
|
Established |
| 348 |
manyeyes/ManySpeech
AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment... |
|
Established |
| 349 |
voicegain/platform
Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.) |
|
Established |
| 350 |
mgonzs13/audio_common
A PortAudio based audio_common with text to speech for ROS 2 |
|
Established |
| 351 |
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model |
|
Established |
| 352 |
react-native-voice/voice
:microphone: React Native Voice Recognition library for iOS and Android... |
|
Established |
| 353 |
shhossain/BanglaSpeech2Text
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla... |
|
Established |
| 354 |
readium/speech
💬 A TypeScript library for implementing read aloud on the Web |
|
Established |
| 355 |
Sharrnah/whispering-ui
Native UI for the Whispering Tiger project -... |
|
Established |
| 356 |
canopyai/Orpheus-TTS
Towards Human-Sounding Speech |
|
Established |
| 357 |
pannous/tensorflow-speech-recognition
🎙Speech recognition using the tensorflow deep learning framework,... |
|
Established |
| 358 |
dangvansam/viet-tts
VietTTS: An Open-Source Vietnamese Text to Speech |
|
Established |
| 359 |
RageAgainstThePixel/com.rest.elevenlabs
A non-official Eleven Labs voice synthesis client for Unity (UPM) |
|
Established |
| 360 |
MasuRii/opencode-smart-voice-notify
🔊 Smart voice notification plugin for OpenCode with multiple TTS engines... |
|
Established |
| 361 |
athena-team/athena
an open-source implementation of sequence-to-sequence based speech processing engine |
|
Established |
| 362 |
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model |
|
Established |
| 363 |
pnnbao97/Kani-TTS-Vie
Fast Vietnamese TTS. 370M params, 3-second inference. |
|
Established |
| 364 |
bambocher/pocketsphinx-python
Python interface to CMU Sphinxbase and Pocketsphinx libraries |
|
Established |
| 365 |
HA6Bots/Automatic-Youtube-Reddit-Text-To-Speech-Video-Generator-and-Uploader
A series of 3 programs that will automatically receive scripts from Reddit,... |
|
Established |
| 366 |
google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural... |
|
Established |
| 367 |
alexa-pi/AlexaPi
Alexa client for all your devices! # No active development. PRs welcome #... |
|
Established |
| 368 |
vannu07/jarvis
🤖 Jarvis - AI Voice Assistant with Face Recognition | Hacktoberfest 2025... |
|
Established |
| 369 |
spring-media/TransformerTTS
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based... |
|
Established |
| 370 |
TheStageAI/TheWhisper
Optimized Whisper models for streaming and on-device use |
|
Established |
| 371 |
WhiteMagic2014/tts-edge-java
java sdk for Edge Read Aloud |
|
Established |
| 372 |
whitphx/streamlit-stt-app
Real time web based Speech-to-Text app with Streamlit |
|
Established |
| 373 |
transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary |
|
Established |
| 374 |
yuvraj108c/ComfyUI-Whisper
Transcribe audio and add subtitles to videos using Whisper in ComfyUI |
|
Established |
| 375 |
mallorbc/whisper_mic
Project that allows one to use a microphone with OpenAI whisper. |
|
Established |
| 376 |
codeforequity-at/botium-speech-processing
Botium Speech Processing |
|
Established |
| 377 |
keithito/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with... |
|
Established |
| 378 |
zai-org/GLM-ASR
GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters |
|
Established |
| 379 |
xiangyuecn/Recorder
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid... |
|
Established |
| 380 |
ekwek1/soprano
Soprano: Instant, Ultra-Realistic Text-to-Speech |
|
Established |
| 381 |
BolisettySujith/J.A.R.V.I.S
A voice assistant 🗣️ which can be used to interact with your computer 💻 and... |
|
Established |
| 382 |
ArkanDash/Multi-Model-RVC-Inference
RVC Inference with multiple model and huggingface support |
|
Established |
| 383 |
XDcobra/react-native-sherpa-onnx
React Native TurboModule for Sherpa-ONNX offline on-device Speech Processing... |
|
Established |
| 384 |
MycroftAI/adapt
Adapt Intent Parser |
|
Established |
| 385 |
at16k/at16k
Trained models for automatic speech recognition (ASR). A library to quickly... |
|
Established |
| 386 |
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN &... |
|
Established |
| 387 |
ftyers/commonvoice-utils
Linguistic processing for Common Voice |
|
Established |
| 388 |
soobinseo/Transformer-TTS
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network" |
|
Established |
| 389 |
drethage/speech-denoising-wavenet
A neural network for end-to-end speech denoising |
|
Established |
| 390 |
gooofy/py-nltools
A collection of basic python modules for spoken natural language processing |
|
Established |
| 391 |
marytts/marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system... |
|
Established |
| 392 |
NVIDIA/OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP |
|
Established |
| 393 |
srvk/eesen
The official repository of the Eesen project |
|
Established |
| 394 |
doctoroyy/edge-tts-as-a-service
This is a simple HTTP service that uses the Edge-TTS library to generate... |
|
Established |
| 395 |
pierreaubert/spinorama
A library to display and compare spinorama (speakers measurements) graphs. |
|
Established |
| 396 |
jaywalnut310/glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search |
|
Established |
| 397 |
totalvoice/totalvoice-node
Client em NodeJS para API da Totalvoice |
|
Established |
| 398 |
AdolfVonKleist/Phonetisaurus
Phonetisaurus G2P |
|
Established |
| 399 |
AI4Bharat/Chitralekha
Chitralekha - A video transcreation platform for Indic languages, supporting... |
|
Established |
| 400 |
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine |
|
Established |