The Voice AI Directory

Quality-scored directory of 8,165 voice ai tools, updated daily. Every tool scored on maintenance, adoption, maturity, and community signals.

Voice AI covers text-to-speech synthesis, speech recognition, voice cloning, voice agents, and audio processing.

Verified

24

70–100

Established

497

50–69

Emerging

2,908

30–49

Experimental

4,736

10–29

Top tools by quality score

# Tool Score
1 k2-fsa/sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement,...

88
2 Uberi/speech_recognition

Speech recognition module for Python, supporting several engines and APIs,...

85
3 TalAter/annyang

💬 Speech recognition for your site

84
4 espnet/espnet

End-to-End Speech Processing Toolkit

83
5 Blaizzy/mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS)...

80
6 m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

80
7 elevenlabs/elevenlabs-python

The official Python SDK for the ElevenLabs API.

79
8 rapidaai/voice-ai

Rapida is an open-source, end-to-end voice AI orchestration platform for...

76
9 DrewThomasson/ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1158+ languages!

76
10 OpenBMB/VoxCPM

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and...

75
11 PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model,...

74
12 jdepoix/youtube-transcript-api

This is a python API which allows you to get the transcript/subtitles for a...

73
13 salute-developers/GigaAM

Foundational Model for Speech Recognition Tasks

73
14 espeak-ng/espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than...

73
15 met4citizen/TalkingHead

Talking Head (3D): A JavaScript class for real-time lip-sync using full-body...

73
16 ggml-org/whisper.cpp

Port of OpenAI's Whisper model in C/C++

72
17 jianchang512/pyvideotrans

Translate the video from one language to another and embed dubbing & subtitles.

72
18 nateshmbhat/pyttsx3

Offline Text To Speech synthesis for python

72
19 KoljaB/RealtimeTTS

Converts text to speech in realtime

71
20 cmusphinx/pocketsphinx

A small speech recognizer

71

Browse by category

.NET TTS Libraries

224 tools

General Purpose Voice Assistants

212 tools

Lightweight TTS Libraries

210 tools

Automatic Speech Recognition

192 tools

Web Speech API TTS

171 tools

Web Speech API Libraries

170 tools

Speech-To-Text Converters

154 tools

Android Speech Apps

131 tools

Keyword Speech Recognition

126 tools

End-to-End ASR Frameworks

117 tools

iOS Speech Frameworks

106 tools

Local Voice Assistants

103 tools

Self-Hosted TTS Servers

100 tools

Voice Controlled Robotics

95 tools

Python Voice Assistants

94 tools

Speech Recognition APIs

90 tools

Discord TTS Bots

88 tools

Voice Agent Applications

85 tools

Lightweight TTS Runtimes

85 tools

Audio Transcription Apps

81 tools

AI Video Generation

80 tools

Voice Chatbot Applications

79 tools

Google TTS Libraries

79 tools

Kokoro TTS Ecosystem

78 tools

Voice Cloning Tools

77 tools

Tacotron TTS Models

77 tools

Neural Vocoder Implementations

77 tools

FastSpeech TTS Models

74 tools

OpenAI TTS Applications

73 tools

Speech Corpora Datasets

72 tools

Coqui TTS Applications

71 tools

Browser TTS Extensions

71 tools

Kaldi ASR Ecosystem

69 tools

Java TTS Libraries

68 tools

eBook to Audiobook Conversion

67 tools

Text To Speech Frameworks

66 tools

CTC ASR Implementations

65 tools

Voice Command Assistants

65 tools

Speech Emotion Recognition

64 tools

Qwen3 TTS Applications

64 tools

Audio Transcription Tools

63 tools

Local Voice Dictation

63 tools

Voice ChatGPT Interfaces

63 tools

Edge TTS Implementations

62 tools

Whisper Subtitle Generation

62 tools

Go TTS Libraries

62 tools

Educational Voice Apps

60 tools

Content-to-Podcast Converters

59 tools

Android Voice Assistants

59 tools

Voice Cloning Synthesis

58 tools

Wake Word Detection

58 tools

Voice AI Learning Collections

57 tools

Vue Speech Recognition

57 tools

Voice Controlled Desktop Automation

57 tools

TTS Model Fine-Tuning

56 tools

Speech AI Coursework

55 tools

AI Avatar Platforms

54 tools

Whisper Transcription Apps

54 tools

Telegram Voice Transcription

54 tools

AI Tutoring Platforms

54 tools

Meeting Transcription Summarizers

54 tools

FunASR Speech Recognition

53 tools

Zero-Shot Voice Synthesis

53 tools

Assistive Vision AI

53 tools

Multimodal Medical Assistants

53 tools

Speaker Diarization Embedding

52 tools

Speech Translation Apps

51 tools

Wav2Vec2 ASR Models

51 tools

Text To Speech Conversion

49 tools

Gradio TTS WebUIs

48 tools

Deepgram Starter Projects

46 tools

Rust TTS Libraries

46 tools

Sign Language Translation

46 tools

eSpeak-NG Ecosystem

45 tools

Real-Time Voice Translation

44 tools

Vosk ASR Implementations

43 tools

Video Dubbing Tools

43 tools

ElevenLabs Integrations

43 tools

Piper TTS Ecosystem

43 tools

AI-Powered eReaders

42 tools

Video Transcription Extraction

40 tools

AWS Polly TTS

40 tools

PDF to Audio Conversion

39 tools

Twitch Chat TTS

39 tools

React Speech Recognition

37 tools

React Native Voice Libraries

36 tools

TTS Dataset Creation

36 tools

VITS TTS Implementations

34 tools

Sign Language Recognition

34 tools

Audio Noise Reduction

33 tools

System TTS Wrappers

33 tools

Voice Assistant Applications

33 tools

Whisper Fine-Tuning

33 tools

Conformer ASR Implementations

31 tools

Speech To Text Transcription

31 tools

Parakeet ASR Implementations

30 tools

Voice Dictation Typing

30 tools

Cross-Platform TTS Frameworks

30 tools

Grapheme-to-Phoneme Conversion

29 tools

Whisper Framework Ports

27 tools

ComfyUI TTS Nodes

27 tools

ASR Evaluation Metrics

27 tools

Live Meeting Translation

27 tools

Live Caption Generation

27 tools

Voice Enabled Coding Assistants

27 tools

Streamlit TTS Apps

26 tools

Text To Speech Tts

25 tools

SMS Voice Integrations

25 tools

Rust Speech Recognition

25 tools

Embedded TTS Systems

25 tools

PHP TTS Libraries

25 tools

Whisper Diarization

24 tools

Tts

24 tools

Anki TTS Integration

24 tools

Audio Source Separation

23 tools

Interactive AI Avatars

23 tools

News Audio Bulletins

22 tools

Voice Assistant Devices

21 tools

Voice AI SDKs

21 tools

Whisper Speech Transcription

21 tools

Yandex SpeechKit Tools

21 tools

Image-to-Speech Synthesis

20 tools

OpenClaw Voice Assistants

19 tools

Web-Based TTS Apps

19 tools

Text To Speech

18 tools

Voice Ai Agents

18 tools

Home Assistant TTS

18 tools

Speech Recognition Datasets

17 tools

Audio Music Learning

17 tools

Text Normalization Engines

17 tools

Stt

17 tools

Multilingual Speech Datasets

17 tools

Face Recognition Systems

17 tools

Ukrainian Voice AI

16 tools

IBM Watson Speech

16 tools

Voice Ai Assistants

16 tools

AI Interview Simulators

16 tools

Clipboard Text-to-Speech

15 tools

Voice Assistant Frameworks

14 tools

Personal Assistant Rag

14 tools

Persian Speech AI

14 tools

Wav2Vec2 Speech Recognition

13 tools

Conversational Chatbot Applications

13 tools

Virtual Assistants Nlp

13 tools

Voice Assistant Projects

12 tools

Audio Event Classification

12 tools

Lip Reading Synthesis

11 tools

Ai Podcast Generation

10 tools

Voice Interactive Games

10 tools

Uncategorized

9 tools

Government Procurement Docs

9 tools

Voice Controlled Calculators

9 tools

Multimodal Vision Language

7 tools

Data Annotation Tools

6 tools

Text To Video Generation

6 tools

Bioacoustic Species Classification

6 tools

Conversational Rag Agents

6 tools

Youtube Transcript Summarization

6 tools

Text Translation Tools

6 tools

Comfyui Extensions

5 tools

Speech Synthesis Diffusion

5 tools

Facial Attribute Classification

5 tools

Image Caption Generation

5 tools

Video Content Intelligence

5 tools

Chatgpt Api Tutorials

5 tools

Stable Diffusion Tools

5 tools

Deepfake Detection Systems

4 tools

Voice Controlled News Apps

4 tools

Llm Scaling Architecture

3 tools

Text Scanning Ocr

3 tools

Text Emotion Recognition

3 tools

Unity Ml Inference

3 tools

Flutter Ai Chat Apps

3 tools

Multi Modal Ai Assistants

3 tools

Machine Translation Systems

3 tools

Ai Virtual Companions

3 tools

Talking Head Generation

3 tools

Audio Classification Transformers

3 tools

Gemini Api Applications

3 tools

Ai Image Generation Platforms

3 tools

Assistive Vision Navigation

3 tools

Natural Language Task Scheduling

3 tools

Streamlit Chatbot Apps

3 tools

Agentic Ai Orchestration

3 tools

Joke Telling Apps

3 tools

Claude Skill Orchestration

3 tools

Meeting Transcription Automation

3 tools

Youtube Video Summarization

3 tools

Text To Speech Mcp

2 tools

Ai Assistant Platforms

2 tools

Discord Ai Chatbots

2 tools

Ai Chatbot Interfaces

2 tools

Respiratory Disease Detection

2 tools

Next Word Prediction

2 tools

Ai Children Storytelling

2 tools

Rust Nlp Bindings

2 tools

Llm Fine Tuning

2 tools

Clip Vision Language

2 tools

Personal Knowledge Management

2 tools

Youtube Video Intelligence

2 tools

Go Nlp Libraries

2 tools

Lyric Generation Ai

2 tools

Llm Sdk Packages

2 tools

Natural Language Command Generation

2 tools

Embedding Model Tuning

2 tools

Vs Code Ai Workflows

2 tools

Ai Translation Tools

2 tools

Llm Learning Resources

2 tools

Telegram Llm Bots

2 tools

Chatbot Nlp Frameworks

2 tools

Telemedicine Consultation Platforms

2 tools

Healthcare Ai Diagnostics

2 tools

Alzheimer Disease Detection

2 tools

Chatbot Development Frameworks

2 tools

Voice To Voice Chatbots

2 tools

Spell Checking Correction

2 tools

Ai Workflow Automation

1 tools

Text Embedding Runtimes

1 tools

Mediapipe Implementations

1 tools

Vision Language Models

1 tools

Neural Machine Translation

1 tools

Indic Language Translation

1 tools

Gpt Implementation Tutorials

1 tools

Multi Agent Orchestration

1 tools

Gemini Prompt Workbenches

1 tools

Speculative Decoding Algorithms

1 tools

Vibe Coding Frameworks

1 tools

Vietnamese Nlp Tools

1 tools

Llm Inference Serving

1 tools

Document Qa Chatbots

1 tools

Ai Terminal Agents

1 tools

Nlp Task Libraries

1 tools

Chatbot Frameworks

1 tools

Vibe Coding Framework

1 tools

Ai Note Taking Apps

1 tools

Llm Docker Deployments

1 tools

Nlp Dataset Collections

1 tools

Stress Detection Ml

1 tools

Fullstack Ai Assistants

1 tools

Temporal Expression Parsing

1 tools

Graph Database Rag

1 tools

Ai Interview Coaching

1 tools

Health App Development

1 tools

Openclaw Skill Integrations

1 tools

Hand Gesture Control

1 tools

Ml Benchmarking Frameworks

1 tools

Model Compression Optimization

1 tools

Viral Clip Generation

1 tools

Text Tokenization Libraries

1 tools

Ocr Document Extraction

1 tools

Discord Ai Bots

1 tools

Edge Camera Ml

1 tools

Reading Comprehension Qa

1 tools

Go Ml Bindings

1 tools

Facial Recognition Apps

1 tools

Musical Instrument Datasets

1 tools

Llm Translation Tools

1 tools

Edge Device Ml Frameworks

1 tools

Sacred Text Nlp

1 tools

Tokenization Libraries

1 tools

Ai Content Writing

1 tools

Voice Agent

1 tools

Ai Powered Studying

1 tools

Flashcard Generation

1 tools

Federated Learning Frameworks

1 tools

Clinical Llm Tools

1 tools

Smart Home Automation

1 tools

Semantic Kernel Tools

1 tools

Healthcare Ai Applications

1 tools

Langgraph Agent Implementations

1 tools

Word Lookup Games

1 tools

Ai Skill Integrations

1 tools

Clinical Ai Agents

1 tools

Dotnet Nlp Libraries

1 tools

Android Vision Ml

1 tools

Ml Learning Resources

1 tools

Spotify Music Recommendation

1 tools

Eye Gaze Tracking

1 tools

Nlu Game Applications

1 tools

Agent Development Frameworks

1 tools

Deepseek Deployment Tools

1 tools

Diffusion Model Frameworks

1 tools

Vibe Coding Workflows

1 tools

Variational Autoencoders Nlp

1 tools

Clinical Decision Support

1 tools

Transformer Implementation Education

1 tools

Prompt Engineering Guides

1 tools

Multimodal Rag Systems

1 tools

Aws Bedrock Applications

1 tools

Generative Ai Education

1 tools

Sentiment Analysis Applications

1 tools

Multimodal Vision Language Models

1 tools

Lexical Semantic Resources

1 tools

Conversational Ai Apps

1 tools

Image Classification Demos

1 tools

Recipe Recommendation Systems

1 tools

Text Visualization Graphs

1 tools

Turkish Ai Education Resources

1 tools

Ai Text Humanization

1 tools

Neural Architecture Text Classification

1 tools

Ai Debate Arenas

1 tools

Mojo Ml Frameworks

1 tools

Restaurant Ordering Chatbots

1 tools

Rust Onnx Runtime

1 tools

Pdf Document Chatbots

1 tools

Godot Game Ai

1 tools

Multimodal Streamlit Apps

1 tools

3D Vision Transformers

1 tools

Mental Health Risk Detection

1 tools

Llm Fine Tuning Frameworks

1 tools

Gpt Cli Interfaces

1 tools

Music Genre Classification

1 tools

Korean Text Processing

1 tools

Dotnet Openai Integrations

1 tools

Langchain Prompt Templates

1 tools

Conversational Ai Chatbots

1 tools

Gan Image Generation

1 tools

Multi Disease Risk Assessment

1 tools

Quantum Machine Learning

1 tools

Ai Video Creation

1 tools

Multimodal Fusion Transformers

1 tools

Face Recognition Embeddings

1 tools

Ai Music Production

1 tools

Ollama Chat Interfaces

1 tools

Ai Powered Saas Startups

1 tools

Gemini Interactive Agents

1 tools

Llm Implementation Tutorials

1 tools

Crop Yield Prediction

1 tools

Covid 19 Prediction Ml

1 tools

Nutrition Ai Apps

1 tools

Local Llm Orchestration

1 tools

Developer Portfolio Projects

1 tools

Document Intelligence Extraction

1 tools

Ai Investment Analysis

1 tools

Snake Game Ai

1 tools

Music Generation Transformers

1 tools

Portuguese Nlp Tools

1 tools

Toxic Comment Detection

1 tools

Twitter Sentiment Analysis

1 tools

Multi Pdf Qa Systems

1 tools

Content Generation Automation

1 tools

Machine Translation Transformers

1 tools

Javascript Ml Libraries

1 tools