The Voice AI Directory
Quality-scored directory of 8,165 voice ai tools, updated daily. Every tool scored on maintenance, adoption, maturity, and community signals.
Voice AI covers text-to-speech synthesis, speech recognition, voice cloning, voice agents, and audio processing.
24
70–100
497
50–69
2,908
30–49
4,736
10–29
Top tools by quality score
| # | Tool | Score |
|---|---|---|
| 1 |
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker diarization, speech enhancement,... |
|
| 2 |
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs,... |
|
| 3 |
TalAter/annyang
💬 Speech recognition for your site |
|
| 4 |
espnet/espnet
End-to-End Speech Processing Toolkit |
|
| 5 |
Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS)... |
|
| 6 |
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) |
|
| 7 |
elevenlabs/elevenlabs-python
The official Python SDK for the ElevenLabs API. |
|
| 8 |
rapidaai/voice-ai
Rapida is an open-source, end-to-end voice AI orchestration platform for... |
|
| 9 |
DrewThomasson/ebook2audiobook
Generate audiobooks from e-books, voice cloning & 1158+ languages! |
|
| 10 |
OpenBMB/VoxCPM
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and... |
|
| 11 |
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model,... |
|
| 12 |
jdepoix/youtube-transcript-api
This is a python API which allows you to get the transcript/subtitles for a... |
|
| 13 |
salute-developers/GigaAM
Foundational Model for Speech Recognition Tasks |
|
| 14 |
espeak-ng/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than... |
|
| 15 |
met4citizen/TalkingHead
Talking Head (3D): A JavaScript class for real-time lip-sync using full-body... |
|
| 16 |
ggml-org/whisper.cpp
Port of OpenAI's Whisper model in C/C++ |
|
| 17 |
jianchang512/pyvideotrans
Translate the video from one language to another and embed dubbing & subtitles. |
|
| 18 |
nateshmbhat/pyttsx3
Offline Text To Speech synthesis for python |
|
| 19 |
KoljaB/RealtimeTTS
Converts text to speech in realtime |
|
| 20 |
cmusphinx/pocketsphinx
A small speech recognizer |
|
Browse by category
.NET TTS Libraries
224 tools
General Purpose Voice Assistants
212 tools
Lightweight TTS Libraries
210 tools
Automatic Speech Recognition
192 tools
Web Speech API TTS
171 tools
Web Speech API Libraries
170 tools
Speech-To-Text Converters
154 tools
Android Speech Apps
131 tools
Keyword Speech Recognition
126 tools
End-to-End ASR Frameworks
117 tools
iOS Speech Frameworks
106 tools
Local Voice Assistants
103 tools
Self-Hosted TTS Servers
100 tools
Voice Controlled Robotics
95 tools
Python Voice Assistants
94 tools
Speech Recognition APIs
90 tools
Discord TTS Bots
88 tools
Voice Agent Applications
85 tools
Lightweight TTS Runtimes
85 tools
Audio Transcription Apps
81 tools
AI Video Generation
80 tools
Voice Chatbot Applications
79 tools
Google TTS Libraries
79 tools
Kokoro TTS Ecosystem
78 tools
Voice Cloning Tools
77 tools
Tacotron TTS Models
77 tools
Neural Vocoder Implementations
77 tools
FastSpeech TTS Models
74 tools
OpenAI TTS Applications
73 tools
Speech Corpora Datasets
72 tools
Coqui TTS Applications
71 tools
Browser TTS Extensions
71 tools
Kaldi ASR Ecosystem
69 tools
Java TTS Libraries
68 tools
eBook to Audiobook Conversion
67 tools
Text To Speech Frameworks
66 tools
CTC ASR Implementations
65 tools
Voice Command Assistants
65 tools
Speech Emotion Recognition
64 tools
Qwen3 TTS Applications
64 tools
Audio Transcription Tools
63 tools
Local Voice Dictation
63 tools
Voice ChatGPT Interfaces
63 tools
Edge TTS Implementations
62 tools
Whisper Subtitle Generation
62 tools
Go TTS Libraries
62 tools
Educational Voice Apps
60 tools
Content-to-Podcast Converters
59 tools
Android Voice Assistants
59 tools
Voice Cloning Synthesis
58 tools
Wake Word Detection
58 tools
Voice AI Learning Collections
57 tools
Vue Speech Recognition
57 tools
Voice Controlled Desktop Automation
57 tools
TTS Model Fine-Tuning
56 tools
Speech AI Coursework
55 tools
AI Avatar Platforms
54 tools
Whisper Transcription Apps
54 tools
Telegram Voice Transcription
54 tools
AI Tutoring Platforms
54 tools
Meeting Transcription Summarizers
54 tools
FunASR Speech Recognition
53 tools
Zero-Shot Voice Synthesis
53 tools
Assistive Vision AI
53 tools
Multimodal Medical Assistants
53 tools
Speaker Diarization Embedding
52 tools
Speech Translation Apps
51 tools
Wav2Vec2 ASR Models
51 tools
Text To Speech Conversion
49 tools
Gradio TTS WebUIs
48 tools
Deepgram Starter Projects
46 tools
Rust TTS Libraries
46 tools
Sign Language Translation
46 tools
eSpeak-NG Ecosystem
45 tools
Real-Time Voice Translation
44 tools
Vosk ASR Implementations
43 tools
Video Dubbing Tools
43 tools
ElevenLabs Integrations
43 tools
Piper TTS Ecosystem
43 tools
AI-Powered eReaders
42 tools
Video Transcription Extraction
40 tools
AWS Polly TTS
40 tools
PDF to Audio Conversion
39 tools
Twitch Chat TTS
39 tools
React Speech Recognition
37 tools
React Native Voice Libraries
36 tools
TTS Dataset Creation
36 tools
VITS TTS Implementations
34 tools
Sign Language Recognition
34 tools
Audio Noise Reduction
33 tools
System TTS Wrappers
33 tools
Voice Assistant Applications
33 tools
Whisper Fine-Tuning
33 tools
Conformer ASR Implementations
31 tools
Speech To Text Transcription
31 tools
Parakeet ASR Implementations
30 tools
Voice Dictation Typing
30 tools
Cross-Platform TTS Frameworks
30 tools
Grapheme-to-Phoneme Conversion
29 tools
Whisper Framework Ports
27 tools
ComfyUI TTS Nodes
27 tools
ASR Evaluation Metrics
27 tools
Live Meeting Translation
27 tools
Live Caption Generation
27 tools
Voice Enabled Coding Assistants
27 tools
Streamlit TTS Apps
26 tools
Text To Speech Tts
25 tools
SMS Voice Integrations
25 tools
Rust Speech Recognition
25 tools
Embedded TTS Systems
25 tools
PHP TTS Libraries
25 tools
Whisper Diarization
24 tools
Tts
24 tools
Anki TTS Integration
24 tools
Audio Source Separation
23 tools
Interactive AI Avatars
23 tools
News Audio Bulletins
22 tools
Voice Assistant Devices
21 tools
Voice AI SDKs
21 tools
Whisper Speech Transcription
21 tools
Yandex SpeechKit Tools
21 tools
Image-to-Speech Synthesis
20 tools
OpenClaw Voice Assistants
19 tools
Web-Based TTS Apps
19 tools
Text To Speech
18 tools
Voice Ai Agents
18 tools
Home Assistant TTS
18 tools
Speech Recognition Datasets
17 tools
Audio Music Learning
17 tools
Text Normalization Engines
17 tools
Stt
17 tools
Multilingual Speech Datasets
17 tools
Face Recognition Systems
17 tools
Ukrainian Voice AI
16 tools
IBM Watson Speech
16 tools
Voice Ai Assistants
16 tools
AI Interview Simulators
16 tools
Clipboard Text-to-Speech
15 tools
Voice Assistant Frameworks
14 tools
Personal Assistant Rag
14 tools
Persian Speech AI
14 tools
Wav2Vec2 Speech Recognition
13 tools
Conversational Chatbot Applications
13 tools
Virtual Assistants Nlp
13 tools
Voice Assistant Projects
12 tools
Audio Event Classification
12 tools
Lip Reading Synthesis
11 tools
Ai Podcast Generation
10 tools
Voice Interactive Games
10 tools
Uncategorized
9 tools
Government Procurement Docs
9 tools
Voice Controlled Calculators
9 tools
Multimodal Vision Language
7 tools
Data Annotation Tools
6 tools
Text To Video Generation
6 tools
Bioacoustic Species Classification
6 tools
Conversational Rag Agents
6 tools
Youtube Transcript Summarization
6 tools
Text Translation Tools
6 tools
Comfyui Extensions
5 tools
Speech Synthesis Diffusion
5 tools
Facial Attribute Classification
5 tools
Image Caption Generation
5 tools
Video Content Intelligence
5 tools
Chatgpt Api Tutorials
5 tools
Stable Diffusion Tools
5 tools
Deepfake Detection Systems
4 tools
Voice Controlled News Apps
4 tools
Llm Scaling Architecture
3 tools
Text Scanning Ocr
3 tools
Text Emotion Recognition
3 tools
Unity Ml Inference
3 tools
Flutter Ai Chat Apps
3 tools
Multi Modal Ai Assistants
3 tools
Machine Translation Systems
3 tools
Ai Virtual Companions
3 tools
Talking Head Generation
3 tools
Audio Classification Transformers
3 tools
Gemini Api Applications
3 tools
Ai Image Generation Platforms
3 tools
Assistive Vision Navigation
3 tools
Natural Language Task Scheduling
3 tools
Streamlit Chatbot Apps
3 tools
Agentic Ai Orchestration
3 tools
Joke Telling Apps
3 tools
Claude Skill Orchestration
3 tools
Meeting Transcription Automation
3 tools
Youtube Video Summarization
3 tools
Text To Speech Mcp
2 tools
Ai Assistant Platforms
2 tools
Discord Ai Chatbots
2 tools
Ai Chatbot Interfaces
2 tools
Respiratory Disease Detection
2 tools
Next Word Prediction
2 tools
Ai Children Storytelling
2 tools
Rust Nlp Bindings
2 tools
Llm Fine Tuning
2 tools
Clip Vision Language
2 tools
Personal Knowledge Management
2 tools
Youtube Video Intelligence
2 tools
Go Nlp Libraries
2 tools
Lyric Generation Ai
2 tools
Llm Sdk Packages
2 tools
Natural Language Command Generation
2 tools
Embedding Model Tuning
2 tools
Vs Code Ai Workflows
2 tools
Ai Translation Tools
2 tools
Llm Learning Resources
2 tools
Telegram Llm Bots
2 tools
Chatbot Nlp Frameworks
2 tools
Telemedicine Consultation Platforms
2 tools
Healthcare Ai Diagnostics
2 tools
Alzheimer Disease Detection
2 tools
Chatbot Development Frameworks
2 tools
Voice To Voice Chatbots
2 tools
Spell Checking Correction
2 tools
Ai Workflow Automation
1 tools
Text Embedding Runtimes
1 tools
Mediapipe Implementations
1 tools
Vision Language Models
1 tools
Neural Machine Translation
1 tools
Indic Language Translation
1 tools
Gpt Implementation Tutorials
1 tools
Multi Agent Orchestration
1 tools
Gemini Prompt Workbenches
1 tools
Speculative Decoding Algorithms
1 tools
Vibe Coding Frameworks
1 tools
Vietnamese Nlp Tools
1 tools
Llm Inference Serving
1 tools
Document Qa Chatbots
1 tools
Ai Terminal Agents
1 tools
Nlp Task Libraries
1 tools
Chatbot Frameworks
1 tools
Vibe Coding Framework
1 tools
Ai Note Taking Apps
1 tools
Llm Docker Deployments
1 tools
Nlp Dataset Collections
1 tools
Stress Detection Ml
1 tools
Fullstack Ai Assistants
1 tools
Temporal Expression Parsing
1 tools
Graph Database Rag
1 tools
Ai Interview Coaching
1 tools
Health App Development
1 tools
Openclaw Skill Integrations
1 tools
Hand Gesture Control
1 tools
Ml Benchmarking Frameworks
1 tools
Model Compression Optimization
1 tools
Viral Clip Generation
1 tools
Text Tokenization Libraries
1 tools
Ocr Document Extraction
1 tools
Discord Ai Bots
1 tools
Edge Camera Ml
1 tools
Reading Comprehension Qa
1 tools
Go Ml Bindings
1 tools
Facial Recognition Apps
1 tools
Musical Instrument Datasets
1 tools
Llm Translation Tools
1 tools
Edge Device Ml Frameworks
1 tools
Sacred Text Nlp
1 tools
Tokenization Libraries
1 tools
Ai Content Writing
1 tools
Voice Agent
1 tools
Ai Powered Studying
1 tools
Flashcard Generation
1 tools
Federated Learning Frameworks
1 tools
Clinical Llm Tools
1 tools
Smart Home Automation
1 tools
Semantic Kernel Tools
1 tools
Healthcare Ai Applications
1 tools
Langgraph Agent Implementations
1 tools
Word Lookup Games
1 tools
Ai Skill Integrations
1 tools
Clinical Ai Agents
1 tools
Dotnet Nlp Libraries
1 tools
Android Vision Ml
1 tools
Ml Learning Resources
1 tools
Spotify Music Recommendation
1 tools
Eye Gaze Tracking
1 tools
Nlu Game Applications
1 tools
Agent Development Frameworks
1 tools
Deepseek Deployment Tools
1 tools
Diffusion Model Frameworks
1 tools
Vibe Coding Workflows
1 tools
Variational Autoencoders Nlp
1 tools
Clinical Decision Support
1 tools
Transformer Implementation Education
1 tools
Prompt Engineering Guides
1 tools
Multimodal Rag Systems
1 tools
Aws Bedrock Applications
1 tools
Generative Ai Education
1 tools
Sentiment Analysis Applications
1 tools
Multimodal Vision Language Models
1 tools
Lexical Semantic Resources
1 tools
Conversational Ai Apps
1 tools
Image Classification Demos
1 tools
Recipe Recommendation Systems
1 tools
Text Visualization Graphs
1 tools
Turkish Ai Education Resources
1 tools
Ai Text Humanization
1 tools
Neural Architecture Text Classification
1 tools
Ai Debate Arenas
1 tools
Mojo Ml Frameworks
1 tools
Restaurant Ordering Chatbots
1 tools
Rust Onnx Runtime
1 tools
Pdf Document Chatbots
1 tools
Godot Game Ai
1 tools
Multimodal Streamlit Apps
1 tools
3D Vision Transformers
1 tools
Mental Health Risk Detection
1 tools
Llm Fine Tuning Frameworks
1 tools
Gpt Cli Interfaces
1 tools
Music Genre Classification
1 tools
Korean Text Processing
1 tools
Dotnet Openai Integrations
1 tools
Langchain Prompt Templates
1 tools
Conversational Ai Chatbots
1 tools
Gan Image Generation
1 tools
Multi Disease Risk Assessment
1 tools
Quantum Machine Learning
1 tools
Ai Video Creation
1 tools
Multimodal Fusion Transformers
1 tools
Face Recognition Embeddings
1 tools
Ai Music Production
1 tools
Ollama Chat Interfaces
1 tools
Ai Powered Saas Startups
1 tools
Gemini Interactive Agents
1 tools
Llm Implementation Tutorials
1 tools
Crop Yield Prediction
1 tools
Covid 19 Prediction Ml
1 tools
Nutrition Ai Apps
1 tools
Local Llm Orchestration
1 tools
Developer Portfolio Projects
1 tools
Document Intelligence Extraction
1 tools
Ai Investment Analysis
1 tools
Snake Game Ai
1 tools
Music Generation Transformers
1 tools
Portuguese Nlp Tools
1 tools
Toxic Comment Detection
1 tools
Twitter Sentiment Analysis
1 tools
Multi Pdf Qa Systems
1 tools
Content Generation Automation
1 tools
Machine Translation Transformers
1 tools
Javascript Ml Libraries
1 tools