All Voice AI Tools
8,165 tools ranked by quality score · Page 14 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 1301 |
rafaballerini/AssistentePessoal
Assistente pessoal virtual desenvolvida com Python 🤖 |
|
Emerging |
| 1302 |
repodiac/german_transliterate
Python module to clean and transliterate (i.e. normalize) German text... |
|
Emerging |
| 1303 |
lancejames221b/jarvis-voice
OpenJarvis — Real-time AI voice assistant for Discord. Talk to the same... |
|
Emerging |
| 1304 |
ranchlai/mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 ,... |
|
Emerging |
| 1305 |
atomicoo/PTTS-WebAPP
Parallel TTS web demo based on Flask + Vue (Vuetify). 基于 Flask + Vue 的语音合成单网页演示项目。 |
|
Emerging |
| 1306 |
Skeli010/GaryTTS
强大免费的本地文本转语音软件 |
|
Emerging |
| 1307 |
puff-dayo/Kokoro-82M-Android
A minimal Android demo app for Kokoro-TTS |
|
Emerging |
| 1308 |
NateRickard/Xamarin.Cognitive.Speech
A client library that makes it easy to work with the Microsoft Cognitive... |
|
Emerging |
| 1309 |
sksalahuddin2828/AI_Personal_Digital_Assistant
AI Personal Voice Assistant Project (Male - Female version) |
|
Emerging |
| 1310 |
Youdef20/voxtral.c
🔊 Streamline audio processing with Voxtral.c, a pure C implementation for... |
|
Emerging |
| 1311 |
aahl/qwen-tts2api
🗣️ Qwen TTS to OpenAI Speech API |
|
Emerging |
| 1312 |
wq2012/SpeakerRecognitionFromScratch
Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家 |
|
Emerging |
| 1313 |
tikhonp/yandex-speechkit-lib-python
Python SDK for Yandex Speechkit API. |
|
Emerging |
| 1314 |
BlinkTagInc/gtfs-tts
Review GTFS stop pronunciations to determine which stops need a tts_stop_name value. |
|
Emerging |
| 1315 |
scart97/thunder-speech
A Hackable speech recognition library. |
|
Emerging |
| 1316 |
showlab/whisperVideo
Find out who said what in the video. |
|
Emerging |
| 1317 |
PyThaiNLP/tts-thai
Thai TTS |
|
Emerging |
| 1318 |
googlecreativelab/obvi
A Polymer 3+ webcomponent / button for doing speech recognition |
|
Emerging |
| 1319 |
twilio-labs/sample-autopilot-voice-ivr
Voice-Powered IVR Chatbot with Autopilot |
|
Emerging |
| 1320 |
ErcinDedeoglu/WhisperDock
Dockerized Whisper C++ speech-to-text API for easy deployment and rapid... |
|
Emerging |
| 1321 |
SteTR/Emost-Bot
Discord Music Bot using Voice Recognition to receive commands. |
|
Emerging |
| 1322 |
kamiazya/ngx-speech-recognition
Angular 5+ speech recognition service (based on browser implementation such... |
|
Emerging |
| 1323 |
jordicor/santa-claus-is-calling
A magical Christmas experience where Santa Claus (AI with Santa's voice)... |
|
Emerging |
| 1324 |
hcy71o/AutoVocoder
Autovocoder: Fast Waveform Generation from a Learned Speech Representation... |
|
Emerging |
| 1325 |
nipponjo/tts_arabic
🎙️ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format — Python... |
|
Emerging |
| 1326 |
everydaycodings/MimicMania
MimicMania is a web application that allows you to generate speech and clone... |
|
Emerging |
| 1327 |
linagora-labs/ssak
SSAK contains helpers and tools to process data and train/infer ASR models. |
|
Emerging |
| 1328 |
kristofferv98/VoiceProcessingToolkit
The VoiceProcessingToolkit is an all-encompassing suite designed for... |
|
Emerging |
| 1329 |
ringger/transcribe-critic
Multi-source transcript merging inspired by textual criticism — LLM... |
|
Emerging |
| 1330 |
WilleIshere/SimplerKokoro
A Python package that makes it easy to use the Kokoro voice synthesis library. |
|
Emerging |
| 1331 |
huckiyang/Voice2Series-Reprogramming
ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time... |
|
Emerging |
| 1332 |
AkojimaSLP/Beamforming-for-speech-enhancement
simple delaysum, MVDR and CGMM-MVDR |
|
Emerging |
| 1333 |
gittyeric/FAlexa
Create your own verbal commands that fuzzily map to custom Javascript /... |
|
Emerging |
| 1334 |
book000/audio-transcriber-docker
Automatically transcribe the audio of video / audio files using Speech Recognition. |
|
Emerging |
| 1335 |
jing332/tts-server-go
微软TTS服务转发,以便在阅读APP中通过网络导入方式收听微软TTS / Edge大声朗读 |
|
Emerging |
| 1336 |
Saganaki22/ComfyUI-Step_Audio_EditX_TTS
ComfyUI nodes for Step Audio EditX - State-of-the-art zero-shot voice... |
|
Emerging |
| 1337 |
gianpaj/sexyvoice
Voice Cloning, Voice Call and Text to Speech platform. Perfect for content... |
|
Emerging |
| 1338 |
CoffeeVampir3/audiocraft-webui
Quick webui for audiocraft |
|
Emerging |
| 1339 |
seven-io/net-client
Official .NET API Client for seven |
|
Emerging |
| 1340 |
nabz0r/mac-local-translator
Local translation app for Mac using speech recognition and offline translation |
|
Emerging |
| 1341 |
mostafa-kermaninia/speech-processing-toolkit
A comprehensive machine learning pipeline for robust Speaker Identification... |
|
Emerging |
| 1342 |
sotelo/parrot
RNN-based generative models for speech. |
|
Emerging |
| 1343 |
TeamAudio/reaspeech
Speech recognition for REAPER |
|
Emerging |
| 1344 |
bishop-ai/bishop-ai
Voice and text virtual assistant |
|
Emerging |
| 1345 |
Lastorder-DC/chatreader-kor
채팅 읽어주는 로봇 |
|
Emerging |
| 1346 |
spokestack/spokestack-ios
Spokestack: give your iOS app a voice interface! |
|
Emerging |
| 1347 |
HenestrosaDev/audiotext
A desktop application that transcribes audio from files, microphone input or... |
|
Emerging |
| 1348 |
jianchang512/fireredasr-ui
一个中文语音转文字项目,封装自FireRedASR |
|
Emerging |
| 1349 |
WangHelin1997/SSR-Speech
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis |
|
Emerging |
| 1350 |
COBACOBAINI/vibe
Transcribe audio and video offline with OpenAI Whisper on your device,... |
|
Emerging |
| 1351 |
hubendubler/gTTS.js
A Promise based Node.js/TypeScript port of the gTTS Google-Text-To-Speech... |
|
Emerging |
| 1352 |
FontaineRiant/wrAIter
AI writing assistant with voiced narrator and characters and an illustrator |
|
Emerging |
| 1353 |
JasonLovesDoggo/Flow
Native MacOS dictation that captures audio, transcribes speech, and formats... |
|
Emerging |
| 1354 |
DeeepMaker/subtitle-to-audio
A python script to generate .wav audio files for .srt subtitle files |
|
Emerging |
| 1355 |
alsrb0607/KoreanSTT
kospeech를 활용한 한국어 음성 인식 모델 개발 |
|
Emerging |
| 1356 |
MikeyParton/react-speech-kit
React hooks for Speech Recognition and Speech Synthesis |
|
Emerging |
| 1357 |
botbahlul/pyvosklivesubtitle
PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23... |
|
Emerging |
| 1358 |
botbahlul/VOSK-Powered-Live-Subtitle-V3
ANDROID APP that can RECOGNIZE ANY LIVE AUDIO/VIDEO STREAMING (using free... |
|
Emerging |
| 1359 |
OwenEdwards/videojs-speak-descriptions-track
A Video.js 7 middleware that uses browser speech synthesis to speak... |
|
Emerging |
| 1360 |
Johnson145/voxtral_wyoming
Offline Speech-to-Text (STT) service using Mistral's Voxtral model with... |
|
Emerging |
| 1361 |
gdoudeng/react-native-baidu-asr
The react-native Baidu voice library provides voice recognition, voice... |
|
Emerging |
| 1362 |
XimilalaXiang/DeLive
DeLive is a cross-platform desktop app that captures system audio output and... |
|
Emerging |
| 1363 |
OpenMOSS/MOSS-Audio-Tokenizer
MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on... |
|
Emerging |
| 1364 |
georgezhao2010/apple_airplayer
Make your AirPlay devices as TTS speakers |
|
Emerging |
| 1365 |
totalvoice/totalvoice-php
Client em PHP para API da Totalvoice |
|
Emerging |
| 1366 |
MainRo/docker-deepspeech-server
A dockerfile to run deepspeech-server |
|
Emerging |
| 1367 |
aks-devs/mod_openai_asr
Freeswitch Speech-To-Text module |
|
Emerging |
| 1368 |
hhguo/MSMC-TTS
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS |
|
Emerging |
| 1369 |
TartuNLP/text-to-speech-api
REST API for neural text-to-speech synthesis |
|
Emerging |
| 1370 |
finos/greenkey-asrtoolkit
A collection of useful tools for handling speech recognition data |
|
Emerging |
| 1371 |
AIFSH/ComfyUI-FishSpeech
a custom comfyui node for fish-speech |
|
Emerging |
| 1372 |
OwenTyme/voice-zero
Collection of samples suitable for use with zero-shot text to speech engines. |
|
Emerging |
| 1373 |
revdotcom/reverb
Open source inference code for Rev's model |
|
Emerging |
| 1374 |
yxshee/speech-command-recognition
speech command recognition using CNNs, with preprocessing, model training,... |
|
Emerging |
| 1375 |
kapi2800/qwen3-tts-apple-silicon
Run Qwen3-TTS text-to-speech locally on Mac (M1/M2/M3/M4). Voice cloning,... |
|
Emerging |
| 1376 |
kgnlp/allophant
A multilingual phoneme recognizer capable of generalizing zero-shot to... |
|
Emerging |
| 1377 |
fqueis/pollinationsai
🔥 TypeScript SDK wrapper for Pollinations AI services |
|
Emerging |
| 1378 |
HectorPulido/chatbot-with-voice
Jarvis like chatbot with voice |
|
Emerging |
| 1379 |
rtzr/Awesome-Korean-Speech-Recognition
한국어 음성인식 STT API 리스트. 각 성능 벤치마크. |
|
Emerging |
| 1380 |
amitdev01/awesome-voice-ai
Awesome Voice Ai |
|
Emerging |
| 1381 |
petewarden/spchcat
Speech recognition tool to convert audio to text transcripts, for Linux and... |
|
Emerging |
| 1382 |
tuan3w/cnn_vocoder
A fast cnn-based vocoder |
|
Emerging |
| 1383 |
alamparelli/mcp-claude-say
Voice interaction for Claude Code - Talk to Claude and hear responses using... |
|
Emerging |
| 1384 |
kahne/SpeechTransProgress
Tracking the progress in end-to-end speech translation |
|
Emerging |
| 1385 |
forfrt/SteerMoE
SteerMoE: Efficient Audio-Language Models with Preserved Reasoning Capabilities |
|
Emerging |
| 1386 |
Edw590/VISOR---Android-Version-Assistant
V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory! |
|
Emerging |
| 1387 |
mobassir94/comprehensive-bangla-tts
Aiming to achieve ultimate Multilingual TTS pipeline with main focus on... |
|
Emerging |
| 1388 |
dpm76/QuickRouteMap
Simple route guidance application. |
|
Emerging |
| 1389 |
18F/dol-whd-14c
The 14(c) system will become a modern, digital-first service. Applicants... |
|
Emerging |
| 1390 |
priyanujgogoi-28/flowery-tts
Wrapper of Flowery Text to Speech API for Dart |
|
Emerging |
| 1391 |
Yuan-ManX/audio-development-tools
Audio Development Tools (ADT) is a project for advancing sound, speech, and... |
|
Emerging |
| 1392 |
solaoi/lycoris
Real-time speech recognition & AI-powered note-taking app for macOS with... |
|
Emerging |
| 1393 |
arpy8/ESP32_Voice_Assistant
This project combines embedded system and AI inference to create an... |
|
Emerging |
| 1394 |
dsfsi/dsfsi-datasets
Official DSFSI Public Datasets Registry - Comprehensive catalog of 50+... |
|
Emerging |
| 1395 |
TheMorpheus407/OpenAI-Audiobook-Generator
This project is a web-based application that converts text into audio,... |
|
Emerging |
| 1396 |
TartuNLP/text-to-speech-worker
Estonian multi-speaker neural text-to-speech worker that processes requests... |
|
Emerging |
| 1397 |
Pranjalya/tts-tortoise-gradio
A Gradio setup for Tortoise TTS. |
|
Emerging |
| 1398 |
ardha27/AI-Waifu-Vtuber
AI Vtuber for Streaming on Youtube/Twitch |
|
Emerging |
| 1399 |
yeahhe365/PageTalk
一个简洁且优秀的描述是:这是一款在任何网页上实现无缝语音转文字的 Chrome 扩展,使用先进的 ASR API。 |
|
Emerging |
| 1400 |
JoelShine/Jarvis-v2.0
This is a major update of my project JARVIS-The-Ultimate-Project. You can... |
|
Emerging |