All Voice AI Tools

8,165 tools ranked by quality score

Showing 1–100 of 8,165
# Tool Score Tier
1 k2-fsa/sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement,...

88
Verified
2 Uberi/speech_recognition

Speech recognition module for Python, supporting several engines and APIs,...

85
Verified
3 TalAter/annyang

💬 Speech recognition for your site

84
Verified
4 espnet/espnet

End-to-End Speech Processing Toolkit

83
Verified
5 Blaizzy/mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS)...

80
Verified
6 m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

80
Verified
7 elevenlabs/elevenlabs-python

The official Python SDK for the ElevenLabs API.

79
Verified
8 rapidaai/voice-ai

Rapida is an open-source, end-to-end voice AI orchestration platform for...

76
Verified
9 DrewThomasson/ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1158+ languages!

76
Verified
10 OpenBMB/VoxCPM

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and...

75
Verified
11 PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model,...

74
Verified
12 jdepoix/youtube-transcript-api

This is a python API which allows you to get the transcript/subtitles for a...

73
Verified
13 salute-developers/GigaAM

Foundational Model for Speech Recognition Tasks

73
Verified
14 espeak-ng/espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than...

73
Verified
15 met4citizen/TalkingHead

Talking Head (3D): A JavaScript class for real-time lip-sync using full-body...

73
Verified
16 ggml-org/whisper.cpp

Port of OpenAI's Whisper model in C/C++

72
Verified
17 jianchang512/pyvideotrans

Translate the video from one language to another and embed dubbing & subtitles.

72
Verified
18 nateshmbhat/pyttsx3

Offline Text To Speech synthesis for python

72
Verified
19 KoljaB/RealtimeTTS

Converts text to speech in realtime

71
Verified
20 cmusphinx/pocketsphinx

A small speech recognizer

71
Verified
21 alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers...

71
Verified
22 FluidInference/FluidAudio

Frontier CoreML audio models in your apps — text-to-speech, speech-to-text,...

71
Verified
23 devnen/Chatterbox-TTS-Server

Self-host the powerful Chatterbox TTS model. This server offers a...

70
Verified
24 pnnbao97/VieNeu-TTS

Vietnamese TTS with instant voice cloning • On-device • Real-time CPU...

70
Verified
25 descriptinc/descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz,...

69
Established
26 mozilla-ai/document-to-podcast

Blueprint by Mozilla.ai for generating podcasts from documents using local AI

69
Established
27 lucidrains/HS-TasNet

Implementation of HS-TasNet, "Real-time Low-latency Music Source Separation...

69
Established
28 readest/readest

Readest is a modern, feature-rich ebook reader designed for avid readers...

69
Established
29 livekit/livekit

End-to-end realtime stack for connecting humans and AI

69
Established
30 EDCD/EDDI

Companion application for Elite Dangerous

69
Established
31 k2-fsa/sherpa

Speech-to-text server framework with next-gen Kaldi

69
Established
32 IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

69
Established
33 pndurette/gTTS

Python library and CLI tool to interface with Google Translate's text-to-speech API

68
Established
34 Picovoice/cheetah

On-device streaming speech-to-text engine powered by deep learning

68
Established
35 diodiogod/TTS-Audio-Suite

A ComfyUI custom node integration for multi-engine multi-language...

68
Established
36 collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper.

68
Established
37 EDDiscovery/EDDiscovery

Captains log and 3d star map for Elite Dangerous

68
Established
38 kxxt/aspeak

A simple text-to-speech client for Azure TTS API.

68
Established
39 Picovoice/rhino

On-device Speech-to-Intent engine powered by deep learning

67
Established
40 Vonage/vonage-php-sdk-core

Vonage REST API client for PHP. API support for SMS, Voice, Text-to-Speech,...

67
Established
41 meizhong986/WhisperJAV

ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD....

67
Established
42 Kieirra/murmure

Fully local, private and cross platform Speech-to-Text with LLM Post-processing

67
Established
43 thewh1teagle/kokoro-onnx

TTS with kokoro and onnx runtime

67
Established
44 cboard-org/cboard

Augmentative and Alternative Communication (AAC) system with text-to-speech...

67
Established
45 jamiepine/voicebox

The open-source voice synthesis studio

67
Established
46 huggingface/speech-to-speech

Build local voice agents with open-source models

67
Established
47 Picovoice/porcupine

On-device wake word detection powered by deep learning

67
Established
48 rany2/edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT...

66
Established
49 mbailey/voicemode

Natural (2-way) voice conversations with Claude Code

66
Established
50 speechmatics/speechmatics-python

Python library and CLI for Speechmatics

66
Established
51 thewh1teagle/sherpa-rs

Rust bindings to https://github.com/k2-fsa/sherpa-onnx

66
Established
52 lenML/Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model,...

65
Established
53 SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

65
Established
54 RHVoice/RHVoice

a free and open source speech synthesizer for Russian and other languages

65
Established
55 foyoux/pygtrans

谷歌翻译, 支持 APIKEY 一口气翻译十万条

65
Established
56 software-mansion/react-native-executorch

Declarative way to run AI models in React Native on device, powered by ExecuTorch.

65
Established
57 travisvn/chatterbox-tts-api

Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling...

65
Established
58 Softcatala/whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on...

64
Established
59 Vonage/vonage-node-sdk

Vonage API client for Node.js. API support for SMS, Voice, Text-to-Speech,...

64
Established
60 shibing624/parrots

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine....

64
Established
61 FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training...

64
Established
62 pion/mediadevices

Go implementation of the MediaDevices API.

64
Established
63 jatinkrmalik/vocalinux

Free, open-source, 100% offline voice dictation for Linux. Speak and type...

64
Established
64 compulim/web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services for both speech-to-text and...

64
Established
65 vilassn/whisper_android

Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

64
Established
66 index-tts/index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

63
Established
67 yeyupiaoling/MASR

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2...

63
Established
68 herimor/voxtream

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and...

63
Established
69 rsxdalv/TTS-WebUI

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio,...

63
Established
70 yeyupiaoling/PPASR

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Confor...

63
Established
71 khanld/chunkformer

ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription

63
Established
72 santinic/audiblez

Generate audiobooks from e-books

63
Established
73 ccoreilly/vosk-browser

A speech recognition library running in the browser thanks to a WebAssembly...

63
Established
74 denizsafak/abogen

Generate audiobooks from EPUBs, PDFs and text with synchronized captions.

62
Established
75 thewh1teagle/phonikud

Hebrew grapheme to phoneme (G2P)

62
Established
76 jamsch/expo-speech-recognition

Speech Recognition for React Native Expo projects

62
Established
77 tsmdt/whisply

💬 Fast, cross-platform CLI and GUI for batch transcription, translation,...

62
Established
78 TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in...

62
Established
79 k2-fsa/sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using...

62
Established
80 supertone-inc/supertonic

Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.

62
Established
81 Rei-x/discord-speech-recognition

Speech to text extension for discord.js

62
Established
82 tensorflow/lingvo

Lingvo

62
Established
83 playht/pyht

PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API

62
Established
84 kahne/fastwer

A PyPI package for fast word/character error rate (WER/CER) calculation

62
Established
85 FelippeChemello/podcast-maker

Fully automated video maker using motion graphics and text-to-speech...

62
Established
86 fishaudio/fish-speech

SOTA Open Source TTS

62
Established
87 amicalhq/amical

🎙️ AI Dictation App - Open Source and Local-first ⚡ Type 3x faster, no...

62
Established
88 modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA...

62
Established
89 ieasybooks/tafrigh

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.

61
Established
90 githubharald/CTCDecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path,...

61
Established
91 Azure-Samples/Cognitive-Speech-TTS

Microsoft Text-to-Speech API sample code in several languages, part of...

61
Established
92 gunthercox/chatterbot-voice

A example of verbal communication using ChatterBot

61
Established
93 gradio-app/fastrtc

The python library for real-time communication

61
Established
94 pavelzbornik/whisperX-FastAPI

FastAPI service on top of WhisperX

61
Established
95 travisvn/edge-tts-universal

Use Microsoft Edge's online text-to-speech service in Node.js, browsers, or...

61
Established
96 analyticsinmotion/werpy

🐍📦 Ultra-fast Python package for calculating and analyzing the Word Error...

61
Established
97 speechbrain/speechbrain

A PyTorch-based Speech Toolkit

61
Established
98 dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

61
Established
99 janvarev/Irene-Voice-Assistant

Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы...

61
Established
100 fgnt/meeteval

MeetEval - A meeting transcription evaluation toolkit

61
Established
1 2 3 80 81 82 Next »