All Voice AI Tools

8,165 tools ranked by quality score · Page 3 of 82

Showing 201–300 of 8,165
# Tool Score Tier
201 jianchang512/stt

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

56
Established
202 Migushthe2nd/MsEdgeTTS

A simple Azure Speech Service module that uses the Microsoft Edge Read Aloud...

56
Established
203 MatteoFasulo/Whisper-TikTok

From AI tools to TikTok video creation using FFMPEG, Microsoft Edge read...

56
Established
204 vox-serve/vox-serve

A Streaming-Native Serving Engine for TTS/STS Models

56
Established
205 aahl/zai-tts

🗣️ ZAI/GLM TTS to OpenAI Speech API, 免费的语音合成API,支持克隆音色,基于智谱TTS

56
Established
206 Femoon/tts-azure-web

TTS Azure Web 是一个 Azure 文本转语音(TTS)网页应用,可以在本地或者云端使用你的 Azure Key 一键部署。TTS...

56
Established
207 RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

56
Established
208 ahmetoner/whisper-asr-webservice

OpenAI Whisper ASR Webservice API

56
Established
209 rwth-i6/rasr

The RWTH ASR Toolkit.

56
Established
210 MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

56
Established
211 AbdullahHendy/live-translation

Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM...

56
Established
212 ThioJoe/Auto-Synced-Translated-Dubs

Automatically translates the text of a video based on a subtitle file, and...

56
Established
213 yuga-hashimoto/openclaw-assistant

OpenClaw voice assistant app for Android - Wake word activation & system...

56
Established
214 namastexlabs/murmurai

🎙️ Drop-in replacement for paid transcription APIs. Self-hosted,...

56
Established
215 lobehub/lobe-tts

🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser

56
Established
216 GitYCC/g2pW

Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)

56
Established
217 xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

56
Established
218 marty1885/paroli

Streaming TTS based on Piper with optional RK3588 NPU support

56
Established
219 alphacep/vosk-unity-asr

Automatic Speech Recognition in Unity using Vosk library

56
Established
220 haoheliu/voicefixer

General Speech Restoration

56
Established
221 Stypox/dicio-android

Dicio assistant app for Android

56
Established
222 justinsalamon/scaper

A library for soundscape synthesis and augmentation

56
Established
223 SahilAggarwal2004/react-text-to-speech

An easy-to-use React.js library that leverages the Web Speech API to convert...

56
Established
224 bshall/Tacotron

A PyTorch implementation of Location-Relative Attention Mechanisms For...

55
Established
225 sooftware/conformer

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented...

55
Established
226 RageAgainstThePixel/ElevenLabs-DotNet

A Non-Official ElevenLabs RESTful API Client for dotnet

55
Established
227 dimonier/tg2obsidian

This bot pulls new messages from a Telegram chat or group and puts them into...

55
Established
228 antirek/voicer

AGI-server voice recognizer for #Asterisk

55
Established
229 peteonrails/voxtype

Voice-to-text with push-to-talk for Wayland compositors

55
Established
230 sccn/eegprep

EEGPrep is an automated preprocessing tool for human EEG data built on a...

55
Established
231 astorfi/speechpy

:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition:...

55
Established
232 dputhier/pygtftk

A python package and a set of shell commands to handle GTF files

55
Established
233 deepgram/deepgram-dotnet-sdk

Official .NET SDK for Deepgram.

55
Established
234 arcosoph/nanowakeword

A lightweight, open-source, and intelligent wake word detection engine....

55
Established
235 karashiiro/TextToTalk

Chat TTS plugin for Dalamud. Has support for triggers/exclusions, several...

55
Established
236 readbeyond/aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize...

55
Established
237 innovatorved/whisper.api

This project provides an API with user level access support to transcribe...

55
Established
238 deepgram/deepgram-rust-sdk

Community Rust SDK for Deepgram.

55
Established
239 AlexxIT/YandexStation

Управление Яндекс.Станцией и другими устройствами умного дома с Алисой из...

55
Established
240 JackismyShephard/ultimate-rvc

An app for creating audio-based content such as song covers and speech using...

55
Established
241 High-Logic/Genie-TTS

GPT-SoVITS ONNX Inference Engine & Model Converter

55
Established
242 krillinai/KrillinAI

Video translation and dubbing tool powered by LLMs. The video translator...

55
Established
243 flashlight/wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

55
Established
244 FireRedTeam/FireRedASR

Open-source industrial-grade ASR models supporting Mandarin, Chinese...

55
Established
245 machinelearningZH/audio-transcription

Transcribe any audio or video file. Edit and view your transcripts in a...

55
Established
246 OpenMOSS/MOSS-TTS

MOSS‑TTS Family is an open‑source speech and sound generation model family...

55
Established
247 remsky/Kokoro-FastAPI

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX...

55
Established
248 Saurav-Paul/AI-virtual-assistant-python

Command line virtual assistant for competitive programming

55
Established
249 Lyrcaxis/KokoroSharp

Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker,...

55
Established
250 wannaphong/ttsmms

TTS with The Massively Multilingual Speech (MMS) project

54
Established
251 hugobloem/wyoming-microsoft-tts

Wyoming protocol server for Microsoft Azure text-to-speech

54
Established
252 Aivis-Project/AivisSpeech-Engine

AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine

54
Established
253 TrevorS/voxtral-mini-realtime-rs

Streaming speech recognition running natively and in the browser. A pure...

54
Established
254 linto-ai/linto-stt

An automatic speech recognition API

54
Established
255 swlegion/tts

Table Top Simulator Mod for Star Wars: Legion

54
Established
256 mbsantiago/whombat

Audio Annotation Tool for ML development

54
Established
257 codename0og/codename-rvc-fork-4

Codename's rvc fork version 4, based on Applio.

54
Established
258 double22a/speech_dataset

The dataset of Speech Recognition

54
Established
259 ttop32/MouseTooltipTranslator

Mouseover Translate Any Language At Once - Chrome Extension: PDF Translator,...

54
Established
260 mlalma/kokoro-ios

Kokoro TTS for iOS and macOSX

54
Established
261 MattyB95/Jabberjay

🦜 Synthetic Voice Detection

54
Established
262 Aivis-Project/aivmlib

Aivis Voice Model File (.aivm/.aivmx) Utility Library

54
Established
263 DevEmperor/Dictate

A powerful Whisper AI keyboard for reliable speech transcription

54
Established
264 hs-CN/msedge-tts

This library is a wrapper of MSEdge Read aloud function API. You can use it...

54
Established
265 VolcanicArts/VRCOSC

A modular node-programming language, program creator, animation system,...

54
Established
266 evancohen/sonus

:speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword...

54
Established
267 stepfun-ai/Step-Audio-EditX

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model...

54
Established
268 shivammehta25/Neural-HMM

Neural HMMs are all you need (for high-quality attention-free TTS)

54
Established
269 jtCodes/lyrictor

Browser-based lyric video editor built for complex timelines with hundreds...

54
Established
270 Blaizzy/mlx-audio-swift

A modular Swift SDK for audio processing with MLX on Apple Silicon

54
Established
271 mgonzs13/whisper_ros

Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2

54
Established
272 ArkanDash/Advanced-RVC-Inference

Advanced RVC Inference for quicker and effortless model downloads

54
Established
273 stemrollerapp/stemroller

Isolate vocals, drums, bass, and other instrumental stems from any song

54
Established
274 lucasnewman/f5-tts-mlx

Implementation of F5-TTS in MLX

54
Established
275 ynop/audiomate

Python library for handling audio datasets.

54
Established
276 HumeAI/hume-typescript-sdk

Add Hume AI to any TypeScript project

54
Established
277 Oaklight/asr2clip

handy cli tool to convert your speech to clipboard text

54
Established
278 mateogon/pdf-narrator

Convert your PDFs and EPUBs into audiobooks effortlessly. Features...

54
Established
279 met4citizen/HeadTTS

HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for...

54
Established
280 jpreprocess/jpreprocess

Japanese text preprocessor for Text-to-Speech applications (OpenJTalk...

54
Established
281 funnyzak/tts-now

跨平台基于云平台(阿里云、讯飞等)语音合成 API 的文字转语音助手。支持单文本快速合成和批量合成。支持windows、macOS、Linux。

54
Established
282 netease-youdao/EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

54
Established
283 Softcatala/open-dubbing

Open dubbing is an AI dubbing system which uses machine learning models to...

54
Established
284 LokerL/tts-vue

🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。

54
Established
285 EddyVerbruggen/nativescript-speech-recognition

:speech_balloon: Speech to text, using the awesome engines readily available...

54
Established
286 chinokikiss/GSV-TTS-Lite

GSV-TTS-Lite A high-performance inference engine specifically designed for...

54
Established
287 emnikhil/Sign-Language-To-Text-Conversion

Sign Language to Text Conversion is a real-time system that uses a camera to...

53
Established
288 jpreprocess/jbonsai

Voice synthesis library for Text-to-Speech applications (Currently HTS...

53
Established
289 Lex-au/Orpheus-FastAPI

High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices,...

53
Established
290 alphacep/vosk-server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi...

53
Established
291 hgneng/ekho

Chinese text-to-speech engine

53
Established
292 thewh1teagle/pyannote-rs

pyannote audio diarization in rust

53
Established
293 jianchang512/ChatTTS-ui

一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface...

53
Established
294 Henry-23/VideoChat

实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human,...

53
Established
295 drmfinlay/tts-util-app

TTS Util — Text-to-speech utility Android app for synthesising text into...

53
Established
296 IhorShevchuk/piper-app

The original Piper, now on iOS and macOS

53
Established
297 LibreSpark/LibreTTS

TTS-文本转语音/文本转语音前端,兼容OpenAI、EdgeTTS等接口

53
Established
298 wxxxcxx/ms-ra-forwarder

免费的在线文本转语音API

53
Established
299 Notely-Voice/NotelyVoice

A 100% private AI voice transcription app that converts speech to text in...

53
Established
300 rzru/nightingale

Machine learning powered Karaoke app (with scores!)

53
Established
« Prev 1 2 3 4 5 80 81 82 Next »