All Voice AI Tools
8,165 tools ranked by quality score · Page 3 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 201 |
jianchang512/stt
Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式 |
|
Established |
| 202 |
Migushthe2nd/MsEdgeTTS
A simple Azure Speech Service module that uses the Microsoft Edge Read Aloud... |
|
Established |
| 203 |
MatteoFasulo/Whisper-TikTok
From AI tools to TikTok video creation using FFMPEG, Microsoft Edge read... |
|
Established |
| 204 |
vox-serve/vox-serve
A Streaming-Native Serving Engine for TTS/STS Models |
|
Established |
| 205 |
aahl/zai-tts
🗣️ ZAI/GLM TTS to OpenAI Speech API, 免费的语音合成API,支持克隆音色,基于智谱TTS |
|
Established |
| 206 |
Femoon/tts-azure-web
TTS Azure Web 是一个 Azure 文本转语音(TTS)网页应用,可以在本地或者云端使用你的 Azure Key 一键部署。TTS... |
|
Established |
| 207 |
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning) |
|
Established |
| 208 |
ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API |
|
Established |
| 209 |
rwth-i6/rasr
The RWTH ASR Toolkit. |
|
Established |
| 210 |
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper |
|
Established |
| 211 |
AbdullahHendy/live-translation
Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM... |
|
Established |
| 212 |
ThioJoe/Auto-Synced-Translated-Dubs
Automatically translates the text of a video based on a subtitle file, and... |
|
Established |
| 213 |
yuga-hashimoto/openclaw-assistant
OpenClaw voice assistant app for Android - Wake word activation & system... |
|
Established |
| 214 |
namastexlabs/murmurai
🎙️ Drop-in replacement for paid transcription APIs. Self-hosted,... |
|
Established |
| 215 |
lobehub/lobe-tts
🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser |
|
Established |
| 216 |
GitYCC/g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022) |
|
Established |
| 217 |
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages |
|
Established |
| 218 |
marty1885/paroli
Streaming TTS based on Piper with optional RK3588 NPU support |
|
Established |
| 219 |
alphacep/vosk-unity-asr
Automatic Speech Recognition in Unity using Vosk library |
|
Established |
| 220 |
haoheliu/voicefixer
General Speech Restoration |
|
Established |
| 221 |
Stypox/dicio-android
Dicio assistant app for Android |
|
Established |
| 222 |
justinsalamon/scaper
A library for soundscape synthesis and augmentation |
|
Established |
| 223 |
SahilAggarwal2004/react-text-to-speech
An easy-to-use React.js library that leverages the Web Speech API to convert... |
|
Established |
| 224 |
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For... |
|
Established |
| 225 |
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented... |
|
Established |
| 226 |
RageAgainstThePixel/ElevenLabs-DotNet
A Non-Official ElevenLabs RESTful API Client for dotnet |
|
Established |
| 227 |
dimonier/tg2obsidian
This bot pulls new messages from a Telegram chat or group and puts them into... |
|
Established |
| 228 |
antirek/voicer
AGI-server voice recognizer for #Asterisk |
|
Established |
| 229 |
peteonrails/voxtype
Voice-to-text with push-to-talk for Wayland compositors |
|
Established |
| 230 |
sccn/eegprep
EEGPrep is an automated preprocessing tool for human EEG data built on a... |
|
Established |
| 231 |
astorfi/speechpy
:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition:... |
|
Established |
| 232 |
dputhier/pygtftk
A python package and a set of shell commands to handle GTF files |
|
Established |
| 233 |
deepgram/deepgram-dotnet-sdk
Official .NET SDK for Deepgram. |
|
Established |
| 234 |
arcosoph/nanowakeword
A lightweight, open-source, and intelligent wake word detection engine.... |
|
Established |
| 235 |
karashiiro/TextToTalk
Chat TTS plugin for Dalamud. Has support for triggers/exclusions, several... |
|
Established |
| 236 |
readbeyond/aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize... |
|
Established |
| 237 |
innovatorved/whisper.api
This project provides an API with user level access support to transcribe... |
|
Established |
| 238 |
deepgram/deepgram-rust-sdk
Community Rust SDK for Deepgram. |
|
Established |
| 239 |
AlexxIT/YandexStation
Управление Яндекс.Станцией и другими устройствами умного дома с Алисой из... |
|
Established |
| 240 |
JackismyShephard/ultimate-rvc
An app for creating audio-based content such as song covers and speech using... |
|
Established |
| 241 |
High-Logic/Genie-TTS
GPT-SoVITS ONNX Inference Engine & Model Converter |
|
Established |
| 242 |
krillinai/KrillinAI
Video translation and dubbing tool powered by LLMs. The video translator... |
|
Established |
| 243 |
flashlight/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit |
|
Established |
| 244 |
FireRedTeam/FireRedASR
Open-source industrial-grade ASR models supporting Mandarin, Chinese... |
|
Established |
| 245 |
machinelearningZH/audio-transcription
Transcribe any audio or video file. Edit and view your transcripts in a... |
|
Established |
| 246 |
OpenMOSS/MOSS-TTS
MOSS‑TTS Family is an open‑source speech and sound generation model family... |
|
Established |
| 247 |
remsky/Kokoro-FastAPI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX... |
|
Established |
| 248 |
Saurav-Paul/AI-virtual-assistant-python
Command line virtual assistant for competitive programming |
|
Established |
| 249 |
Lyrcaxis/KokoroSharp
Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker,... |
|
Established |
| 250 |
wannaphong/ttsmms
TTS with The Massively Multilingual Speech (MMS) project |
|
Established |
| 251 |
hugobloem/wyoming-microsoft-tts
Wyoming protocol server for Microsoft Azure text-to-speech |
|
Established |
| 252 |
Aivis-Project/AivisSpeech-Engine
AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine |
|
Established |
| 253 |
TrevorS/voxtral-mini-realtime-rs
Streaming speech recognition running natively and in the browser. A pure... |
|
Established |
| 254 |
linto-ai/linto-stt
An automatic speech recognition API |
|
Established |
| 255 |
swlegion/tts
Table Top Simulator Mod for Star Wars: Legion |
|
Established |
| 256 |
mbsantiago/whombat
Audio Annotation Tool for ML development |
|
Established |
| 257 |
codename0og/codename-rvc-fork-4
Codename's rvc fork version 4, based on Applio. |
|
Established |
| 258 |
double22a/speech_dataset
The dataset of Speech Recognition |
|
Established |
| 259 |
ttop32/MouseTooltipTranslator
Mouseover Translate Any Language At Once - Chrome Extension: PDF Translator,... |
|
Established |
| 260 |
mlalma/kokoro-ios
Kokoro TTS for iOS and macOSX |
|
Established |
| 261 |
MattyB95/Jabberjay
🦜 Synthetic Voice Detection |
|
Established |
| 262 |
Aivis-Project/aivmlib
Aivis Voice Model File (.aivm/.aivmx) Utility Library |
|
Established |
| 263 |
DevEmperor/Dictate
A powerful Whisper AI keyboard for reliable speech transcription |
|
Established |
| 264 |
hs-CN/msedge-tts
This library is a wrapper of MSEdge Read aloud function API. You can use it... |
|
Established |
| 265 |
VolcanicArts/VRCOSC
A modular node-programming language, program creator, animation system,... |
|
Established |
| 266 |
evancohen/sonus
:speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword... |
|
Established |
| 267 |
stepfun-ai/Step-Audio-EditX
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model... |
|
Established |
| 268 |
shivammehta25/Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS) |
|
Established |
| 269 |
jtCodes/lyrictor
Browser-based lyric video editor built for complex timelines with hundreds... |
|
Established |
| 270 |
Blaizzy/mlx-audio-swift
A modular Swift SDK for audio processing with MLX on Apple Silicon |
|
Established |
| 271 |
mgonzs13/whisper_ros
Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2 |
|
Established |
| 272 |
ArkanDash/Advanced-RVC-Inference
Advanced RVC Inference for quicker and effortless model downloads |
|
Established |
| 273 |
stemrollerapp/stemroller
Isolate vocals, drums, bass, and other instrumental stems from any song |
|
Established |
| 274 |
lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX |
|
Established |
| 275 |
ynop/audiomate
Python library for handling audio datasets. |
|
Established |
| 276 |
HumeAI/hume-typescript-sdk
Add Hume AI to any TypeScript project |
|
Established |
| 277 |
Oaklight/asr2clip
handy cli tool to convert your speech to clipboard text |
|
Established |
| 278 |
mateogon/pdf-narrator
Convert your PDFs and EPUBs into audiobooks effortlessly. Features... |
|
Established |
| 279 |
met4citizen/HeadTTS
HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for... |
|
Established |
| 280 |
jpreprocess/jpreprocess
Japanese text preprocessor for Text-to-Speech applications (OpenJTalk... |
|
Established |
| 281 |
funnyzak/tts-now
跨平台基于云平台(阿里云、讯飞等)语音合成 API 的文字转语音助手。支持单文本快速合成和批量合成。支持windows、macOS、Linux。 |
|
Established |
| 282 |
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine |
|
Established |
| 283 |
Softcatala/open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to... |
|
Established |
| 284 |
LokerL/tts-vue
🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。 |
|
Established |
| 285 |
EddyVerbruggen/nativescript-speech-recognition
:speech_balloon: Speech to text, using the awesome engines readily available... |
|
Established |
| 286 |
chinokikiss/GSV-TTS-Lite
GSV-TTS-Lite A high-performance inference engine specifically designed for... |
|
Established |
| 287 |
emnikhil/Sign-Language-To-Text-Conversion
Sign Language to Text Conversion is a real-time system that uses a camera to... |
|
Established |
| 288 |
jpreprocess/jbonsai
Voice synthesis library for Text-to-Speech applications (Currently HTS... |
|
Established |
| 289 |
Lex-au/Orpheus-FastAPI
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices,... |
|
Established |
| 290 |
alphacep/vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi... |
|
Established |
| 291 |
hgneng/ekho
Chinese text-to-speech engine |
|
Established |
| 292 |
thewh1teagle/pyannote-rs
pyannote audio diarization in rust |
|
Established |
| 293 |
jianchang512/ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface... |
|
Established |
| 294 |
Henry-23/VideoChat
实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human,... |
|
Established |
| 295 |
drmfinlay/tts-util-app
TTS Util — Text-to-speech utility Android app for synthesising text into... |
|
Established |
| 296 |
IhorShevchuk/piper-app
The original Piper, now on iOS and macOS |
|
Established |
| 297 |
LibreSpark/LibreTTS
TTS-文本转语音/文本转语音前端,兼容OpenAI、EdgeTTS等接口 |
|
Established |
| 298 |
wxxxcxx/ms-ra-forwarder
免费的在线文本转语音API |
|
Established |
| 299 |
Notely-Voice/NotelyVoice
A 100% private AI voice transcription app that converts speech to text in... |
|
Established |
| 300 |
rzru/nightingale
Machine learning powered Karaoke app (with scores!) |
|
Established |