All Voice AI Tools

8,165 tools ranked by quality score · Page 2 of 82

Showing 101–200 of 8,165
# Tool Score Tier
101 daanzu/kaldi-active-grammar

Python Kaldi speech recognition with grammars that can be set...

61
Established
102 roryeckel/wyoming_openai

OpenAI-Compatible Proxy Middleware for the Wyoming Protocol

61
Established
103 kishanrajput23/Jarvis-Desktop-Voice-Assistant

A python based desktop voice assistant capable of executing system-level...

61
Established
104 sandrohanea/whisper.net

Whisper.net. Speech to text made simple using Whisper Models

61
Established
105 ChetanXpro/nodejs-whisper

NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as...

61
Established
106 royshil/obs-localvocal

OBS plugin for local speech recognition and captioning using AI

61
Established
107 NVIDIA-AI-Blueprints/pdf-to-podcast

Transform PDFs into AI podcasts for engaging on-the-go audio content.

61
Established
108 nazdridoy/kokoro-tts

A CLI text-to-speech tool using the Kokoro model, supporting multiple...

61
Established
109 PyThaiNLP/PyThaiTTS

Open Source Thai Text-to-speech library in Python

61
Established
110 zuoban/tts

tts 服务

61
Established
111 githubharald/CTCWordBeamSearch

Connectionist Temporal Classification (CTC) decoder with dictionary and...

61
Established
112 charleprr/redditube

A video generator from Reddit posts and comments

61
Established
113 Picovoice/web-voice-processor

A library for real-time voice processing in web browsers

60
Established
114 snakers4/silero-models

Silero Models: pre-trained text-to-speech models made embarrassingly simple

60
Established
115 deepgram/deepgram-python-sdk

Official Python SDK for Deepgram.

60
Established
116 Wikidepia/g2p-id

Indonesian Grapheme-to-Phoneme (IPA notation)

60
Established
117 sdkcarlos/artyom.js

A voice control - voice commands - speech recognition and speech synthesis...

60
Established
118 JamesBrill/react-speech-recognition

💬Speech recognition for your React app

60
Established
119 lugia19/elevenlabslib

Full python wrapper for the elevenlabs API.

60
Established
120 OpenVoiceOS/ovos-tts-server

simple flask server to host OpenVoiceOS tts plugins as a service

60
Established
121 yandexdataschool/speech_course

YSDA course in Speech Processing.

60
Established
122 mkiol/dsnote

Speech Note Linux app. Note taking, reading and translating with offline...

60
Established
123 morganney/tts-react

Convert text to speech using React.

60
Established
124 Vonage/vonage-ruby-sdk

Vonage REST API client for Ruby. API support for SMS, Voice, Text-to-Speech,...

60
Established
125 PyThaiNLP/pythaiasr

Python Thai Automatic Speech Recognition

60
Established
126 daswer123/xtts-api-server

A simple FastAPI Server to run XTTSv2

60
Established
127 revdotcom/revai-node-sdk

Node.js SDK for the Rev AI API

60
Established
128 TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art...

60
Established
129 istupakov/onnx-asr

A lightweight Python package for Automatic Speech Recognition using ONNX models

60
Established
130 MycroftAI/mycroft-precise

A lightweight, simple-to-use, RNN wake word listener

60
Established
131 Spr-Aachen/Easy-Voice-Toolkit

A user-friendly audio toolkit for voice recognition, voice transcription,...

60
Established
132 itsmevictor/clean-transcribe

A simple CLI to transcribe Youtube videos or local audio/video files and...

59
Established
133 OpenVoiceOS/ovos-tts-plugin-espeakNG

espeakNG plugin

59
Established
134 n1teshy/yapper-tts

offline text to speech and free SOTA LLM APIs to let your programs speak to you

59
Established
135 Ailln/cn2an

📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)

59
Established
136 shivammehta25/Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

59
Established
137 mdiller/MangoByte

A discord bot that provides the ability to play dota hero response clips, do...

59
Established
138 CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

59
Established
139 deepgram/deepgram-js-sdk

Official JavaScript SDK for Deepgram.

59
Established
140 ken107/read-aloud

An awesome browser extension that reads aloud webpage content with one click

59
Established
141 phuc-nt/my-translator

Real-time speech translation — macOS & Windows, free TTS, no server, your...

59
Established
142 mybigday/whisper.rn

React Native binding of whisper.cpp.

59
Established
143 kstonekuan/tambourine-voice

Your personal voice interface for any app. Speak naturally and your words...

59
Established
144 pilot51/voicenotify

Android app that speaks notifications

59
Established
145 linto-ai/WebVoiceSDK

Buildings block for voice-enabled applications in the browser

59
Established
146 p0n1/epub_to_audiobook

EPUB to audiobook converter, optimized for Audiobookshelf, WebUI included

59
Established
147 coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research...

59
Established
148 Enemyx-net/VibeVoice-ComfyUI

A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech...

58
Established
149 aichaos/rivescript-python

A RiveScript interpreter for Python. RiveScript is a scripting language for...

58
Established
150 tabahi/bournemouth-forced-aligner

Extract phoneme-level timestamps from speeh audio.

58
Established
151 linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

58
Established
152 thevickypedia/Jarvis

Fully Functional Voice Based Natural Language UI

58
Established
153 babysor/MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

58
Established
154 vivekuppal/transcribe

Transcribe is a real time transcription, conversation, Language learning...

58
Established
155 DigitalPhonetics/IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

58
Established
156 gooofy/py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's...

58
Established
157 gabrielmittag/NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

58
Established
158 davidacm/NVDA-IBMTTS-Driver

This project is aimed at developing and maintaining the NVDA IBMTTS driver....

58
Established
159 richardr1126/openreader

An open-source read-along document reader server with high-quality TTS...

58
Established
160 dictation-toolbox/dragonfly

Speech recognition framework allowing powerful Python-based scripting and...

58
Established
161 altunenes/parakeet-rs

very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA...

58
Established
162 alphacep/vosk

VOSK Speech Recognition Toolkit

58
Established
163 moonstar-x/discord-tts-bot

A Text-to-Speech bot for Discord.

58
Established
164 argmaxinc/WhisperKit

On-device Speech Recognition for Apple Silicon

58
Established
165 fishaudio/fish-audio-python

The official Python library for the Fish Audio API.

58
Established
166 r9y9/nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

58
Established
167 fishaudio/Bert-VITS2

vits2 backbone with multilingual-bert

58
Established
168 MainRo/deepspeech-server

A testing server for a speech to text service based on coqui.ai

58
Established
169 ManimCommunity/manim-voiceover

Manim plugin for all things voiceover

58
Established
170 wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

57
Established
171 kurianbenoy/whisper_normalizer

A python package for whisper normalizer

57
Established
172 capacitor-community/text-to-speech

⚡️ Capacitor plugin for synthesizing speech from text.

57
Established
173 FirezTheGreat/1SHOT

All my works - https://github.com/FirezTheGreat (latest music commands/djs...

57
Established
174 kalliope-project/kalliope

Kalliope is a framework that will help you to create your own personal assistant.

57
Established
175 jim60105/docker-whisperX

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level...

57
Established
176 dectalk/dectalk

Modern builds for the 90s/00s DECtalk text-to-speech application.

57
Established
177 Picovoice/speech-to-text-benchmark

speech to text benchmark framework

57
Established
178 nttcslab-sp/kaldiio

A pure python module for reading and writing kaldi ark files

57
Established
179 i3thuan5/tai5-uan5_gian5-gi2_kang1-ku7

臺灣言語工具

57
Established
180 dlutton/flutter_tts

Flutter Text to Speech package

57
Established
181 petercunha/tts

:pencil: :sound: A simple text-to-speech tool. Converts your text to speech...

57
Established
182 alphacep/vosk-android-demo

Offline speech recognition for Android with Vosk library.

57
Established
183 pnlpal/dictionariez

📚 A customizable dictionary extension that supports double-click lookups in...

57
Established
184 ai-ng/swift

Fast voice assistant powered by Groq, Cartesia, and Vercel.

57
Established
185 wq2012/SimpleDER

A lightweight library to compute Diarization Error Rate (DER).

57
Established
186 asterics/Asterics-AAC

Free, easy-to-use AAC app with offline support, flexible input options,...

57
Established
187 openctp/openctp

openctp提供CTP股票期权、中泰证券XTP、华鑫证券奇点TORA、东方证券OST、东方财富证券EMT、盈透证券TWS、易盛TAP、量投QDP等各通道...

57
Established
188 sfortis/openai_tts

Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine...

57
Established
189 BryceWG/BiBi-Keyboard

说点啥(BiBi Keyboard):一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR...

57
Established
190 R3gm/SoniTranslate

Synchronized Translation for Videos. Video dubbing

57
Established
191 midas-research/audino

Open source audio annotation tool for humans

57
Established
192 hkchengrex/MMAudio

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality...

57
Established
193 OpenMOSS/MOSS-TTSD

MOSS-TTSD is a spoken dialogue generation model designed for expressive...

57
Established
194 yeyupiaoling/PaddlePaddle-DeepSpeech

基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。

57
Established
195 pykaldi/pykaldi

A Python wrapper for Kaldi

57
Established
196 sindresorhus/awesome-whisper

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition...

56
Established
197 sidharthrajaram/StyleTTS2

🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and...

56
Established
198 agentvoiceresponse/avr-infra

The AVR Infrastructure project is designed to launch the Agent Voice...

56
Established
199 pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

56
Established
200 yeyupiaoling/Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without...

56
Established