All Voice AI Tools
8,165 tools ranked by quality score · Page 15 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 1401 |
tihu-nlp/tihu
Persian Text-To-Speech |
|
Emerging |
| 1402 |
markokosticdev/cloud_text_to_speech_flutter
Single interface to Google, Microsoft, and Amazon Text-To-Speech. |
|
Emerging |
| 1403 |
orange2ai/youtube-subtitle-translator
🌐 Real-time YouTube subtitle translator browser extension. Translate... |
|
Emerging |
| 1404 |
rudrankriyam/Glosik
Sample project for F5-TTS using MLX Swift |
|
Emerging |
| 1405 |
lucko515/speech-recognition-neural-network
This is the end-to-end Speech Recognition neural network, deployed in Keras.... |
|
Emerging |
| 1406 |
cameronking4/VapiBlocks
Vapi Blocks is a library of components & api snips to copy and paste into... |
|
Emerging |
| 1407 |
Lunarien/Lunariens-Mental-Math-Trainer
Mental math trainer made in C#. |
|
Emerging |
| 1408 |
holm-aune-bachelor2018/ctc
Speech recognition with CTC in Keras with Tensorflow backend |
|
Emerging |
| 1409 |
AryanVBW/AiVoiceClonerPRO
Revolutionize Your Voice with AI Voice Cloner! Transform Your Speech into... |
|
Emerging |
| 1410 |
Emotional-Text-to-Speech/hmm-for-emo-tts
:computer: A repository with comprehensive instructions for using the... |
|
Emerging |
| 1411 |
declare-lab/speech-adapters
Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient... |
|
Emerging |
| 1412 |
modelscope/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and... |
|
Emerging |
| 1413 |
Kini218/speech-to-text
Speech to text script on python |
|
Emerging |
| 1414 |
alias454/YATSEE
YATSEE - Yet Another Tool for Speech Extraction & Enrichment |
|
Emerging |
| 1415 |
MHaggis/ASRGEN
ASR Configurator, Essentials and Atomic Testing |
|
Emerging |
| 1416 |
nl8590687/ASRT_SDK_Python3
ASRT语音识别系统的Python版SDK |
|
Emerging |
| 1417 |
1038lab/ComfyUI-SparkTTS
ComfyUI-SparkTTS is a custom ComfyUI node implementation of SparkTTS, an... |
|
Emerging |
| 1418 |
Dostoyewski/django_voice_bot
Package for django onpage support bot with speech recognition and voice commands |
|
Emerging |
| 1419 |
iBrammm/qwen-asr
🎙️ Implement fast, dependency-free C inference for Qwen3-ASR speech-to-text... |
|
Emerging |
| 1420 |
yl4579/HiFTNet
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter... |
|
Emerging |
| 1421 |
titilambert/pynuance
Wrapper for Nuance Communications services |
|
Emerging |
| 1422 |
Andrewcpu/elevenlabs-api
🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs... |
|
Emerging |
| 1423 |
Frikallo/parakeet.cpp
Ultra fast and portable Parakeet implementation for on-device inference in... |
|
Emerging |
| 1424 |
tktcorporation/discord-tts-bot
A discord bot to use tts in your voice channel. |
|
Emerging |
| 1425 |
janewu77/ela-extension
English Learner Assistant |
|
Emerging |
| 1426 |
1neReality/MITSUHA
World's First Multilingual Inexpensive Therapeutic Sophisticated... |
|
Emerging |
| 1427 |
bhattbhavesh91/wav2vec2-huggingface-demo
Speech to Text with self-supervised learning based on wav2vec 2.0 framework... |
|
Emerging |
| 1428 |
kokimame/joytan
Creative Audio/Textbook Maker 🎵 📖 See our YouTube channel |
|
Emerging |
| 1429 |
serpapps/ai-voice-cloner
AI Voice Cloning Desktop Application that runs locally on your computer and... |
|
Emerging |
| 1430 |
ssssssilver/sherpa-ncnn-unity
在Unity环境下,借助sherpa-ncnn框架,实现实时并准确的中英双语语音识别功能。 |
|
Emerging |
| 1431 |
Kaljurand/Arvutaja
An Android app for voice actions in Estonian and English |
|
Emerging |
| 1432 |
quangvu3/coqui-xtts
Coqui XTTS model with Vietnamese added |
|
Emerging |
| 1433 |
yzfly/awesome-voice-agents
A curated list of voice AI agent frameworks, tools, resources, and best practices |
|
Emerging |
| 1434 |
zhangzijie-pro/Speaker-Verification
Dual-model speech AI toolkit for speaker verification and speaker-aware... |
|
Emerging |
| 1435 |
pika-online/AESRC2020
a deep accent recognition network |
|
Emerging |
| 1436 |
zeropointnine/tts-audiobook-tool
Audiobook creation tool with support for multiple TTS models (Qwen3-TTS,... |
|
Emerging |
| 1437 |
Edw590/VISOR---A-Voice-Assistant
V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory! |
|
Emerging |
| 1438 |
CodeBySonu95/VoxSherpa-TTS
🎙️ VoxSherpa TTS Offline Neural Text-to-Speech Engine for Android ⚡... |
|
Emerging |
| 1439 |
renorari/VoiceJP-Discord
A discord-app can text-to-speech and speech-to-text |
|
Emerging |
| 1440 |
TETYYS/SAPI4
Web interface for Microsoft Sam & friends |
|
Emerging |
| 1441 |
mattmireles/kokoro-coreml
PyTorch → CoreML conversion pipeline for Kokoro TTS. Unlocks fast on-device... |
|
Emerging |
| 1442 |
mapluisch/OpenAI-Realtime-API-for-Unity
Implementation of OpenAI's Realtime API in Unity. Easily integrate... |
|
Emerging |
| 1443 |
shenbengit/TTSTool
科大讯飞离线语音,Text to Speech,TTS |
|
Emerging |
| 1444 |
aditya-an1l/RILearn
Reinventing Reading with a touch of Interactivity aided Learning |
|
Emerging |
| 1445 |
leprosus/golang-tts
Text-to-Speach golang package based in Amazon Polly service |
|
Emerging |
| 1446 |
cherts/mspeech
Program for speech recognition using the Google Speech API, voice commands,... |
|
Emerging |
| 1447 |
nithincvpoyyil/voice-listener
An reusable angular component for voice based input using web speech API |
|
Emerging |
| 1448 |
aboda-dirbas/whisperclip
🎤 Enhance your voice-to-text transcriptions with WhisperClip, prioritizing... |
|
Emerging |
| 1449 |
Renovamen/Speech-and-Text
Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech... |
|
Emerging |
| 1450 |
antifield/vmt
Discord App for Transcribing & Translating Voice Messages |
|
Emerging |
| 1451 |
smaranjitghose/AIAudioTranscriber
A minimalistic web app to generate transciption for audio built using Python |
|
Emerging |
| 1452 |
N6UDP/SteamDiscordTTSBot
A steam chat to Discord TTS bridge |
|
Emerging |
| 1453 |
deepgram-starters/php-transcription
Get started using Deepgram's speech-to-text with this PHP demo app |
|
Emerging |
| 1454 |
doveg/whisper-real-time
A real time offline transcriber with gui, based on OpenAI whisper |
|
Emerging |
| 1455 |
rishikksh20/gmvae_tacotron
Gaussian Mixture VAE Tacotron |
|
Emerging |
| 1456 |
EndlessReform/fish-speech.rs
A Fish Speech implementation in Rust, with Candle.rs |
|
Emerging |
| 1457 |
gillesdemey/google-speech-v2
:speech_balloon: Reverse Engineering Google's Speech To Text API (v2) |
|
Emerging |
| 1458 |
mramshaw/Speech-Recognition
Speech recognition with Python |
|
Emerging |
| 1459 |
yapit-tts/yapit
Listen to anything. TTS for documents, papers, and web pages. |
|
Emerging |
| 1460 |
PhilippeRo/IBus-Speech-To-Text
A speech to text IBus engine using VOSK |
|
Emerging |
| 1461 |
rishikksh20/Avocodo-pytorch
Avocodo: Generative Adversarial Network for Artifact-free Vocoder |
|
Emerging |
| 1462 |
Alex-Tremayne/LaTeXt
Python package for converting LaTeX to text which can be read by text to... |
|
Emerging |
| 1463 |
Harshit-shrivastav/TikTok-TTS-Bot
A python TikTok Text to speech generator telegram bot. |
|
Emerging |
| 1464 |
jing332/tts-server-android
这是一个Android系统TTS应用,内置微软演示接口,可自定义HTTP请求,可导入其他本地TTS引擎,以及根据中文双引号的简单旁白/对话识别朗读... |
|
Emerging |
| 1465 |
saurabhdaware/bol
Slightly more consistent Text-to-speech for Web and a wrapper around speechSynthesis |
|
Emerging |
| 1466 |
danielclough/vibevoice-rs
Rust implementation of VibeVoice text-to-speech with voice cloning and... |
|
Emerging |
| 1467 |
ehtisham91/Django-Speech-to-text-Chat
This App allows users to convert their speech into text and send that text... |
|
Emerging |
| 1468 |
0xPD33/sonori
Sonori is a fully local STT app for Linux (Wayland). |
|
Emerging |
| 1469 |
gheyret/UQSpeechDataset
Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット |
|
Emerging |
| 1470 |
izwi-ai/izwi
On-device AI engine for transcription, TTS, and voice workflows. |
|
Emerging |
| 1471 |
Nighthawk42/mOrpheus
Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant. |
|
Emerging |
| 1472 |
aws-samples/sample-voicebot-nova-sonic
A sample implementation of real-time voice assistant using Amazon Nova 2... |
|
Emerging |
| 1473 |
dsi-icl/do-voice-interaction
The goal of this project is to provide a voice assistant to the Data... |
|
Emerging |
| 1474 |
kaituoxu/Listen-Attend-Spell
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End... |
|
Emerging |
| 1475 |
bgArray/ZhiYin
知音 - AI音频听觉功能集成软件。提供声乐技术识别分析、伴奏分离等伴奏多种工具。 |
|
Emerging |
| 1476 |
Labmem-Zhouyx/CDFSE_FastSpeech2
The Official Implementation of “Content-Dependent Fine-Grained Speaker... |
|
Emerging |
| 1477 |
speechly/speechly
Client libraries, examples and demos of Speechly API for the Web. |
|
Emerging |
| 1478 |
domesticatedviking/TextyMcSpeechy
Easily create Piper text-to-speech models in any voice. Make a... |
|
Emerging |
| 1479 |
thinh-vu/ur_audio_sub
Generate text captions for audio files & youtube video using OpenAI Whisper... |
|
Emerging |
| 1480 |
lucascamillomd/anki-tts
A free, open-source app for Anki text-to-speech in MacOS. |
|
Emerging |
| 1481 |
tugstugi/mongolian-speech-recognition
Mongolian speech recognition with PyTorch |
|
Emerging |
| 1482 |
loretoparisi/wave2vec-recognize-docker
Wave2vec 2.0 Recognize pipeline |
|
Emerging |
| 1483 |
Baidu-AIP/speech-tts-cors
百度语音 语音合成 跨域demo以及支持库 |
|
Emerging |
| 1484 |
HeyHeyChicken/NOVA-Python
NOVA is a customizable voice assistant made with Python. |
|
Emerging |
| 1485 |
mmpneo/curses
Speech to Text and KB input captions for OBS, VRChat, Twitch chat and Discord |
|
Emerging |
| 1486 |
Umbaji/NMTMD
Official repository for the Opensource Textdataset for NMT for local langues... |
|
Emerging |
| 1487 |
ethicalabs-ai/Kurtis-E1-MLX-Voice-Agent
A lightweight voice companion, optimized for macOS. |
|
Emerging |
| 1488 |
p1an-lin-jung/teochew-g2p
这是一个潮州话文本端的处理工具和正字标准,主要为潮州方言的语音合成服务 |
|
Emerging |
| 1489 |
FR33TR1ST/VoiceAssistant
A VoiceAsistant with WhisperAI speech recognition |
|
Emerging |
| 1490 |
wwdok/faster-whisper-webui-cn
Cloned from https://huggingface.co/spaces/aadnk/faster-whisper-webui, and... |
|
Emerging |
| 1491 |
tsensei/OpenReels
Open-source AI pipeline that turns any topic into a fully rendered... |
|
Emerging |
| 1492 |
yui-mhcp/text_to_speech
(Multi Speaker) Text-To-Speech (TTS) project |
|
Emerging |
| 1493 |
ritazh/EchoML
🔉 A web app to play, visualize, and annotate your audio files for machine learning |
|
Emerging |
| 1494 |
ahaocd/davinci-voice-clone
DaVinci Subtitle Alignment + Voice Clone + AI Emotion Optimization | CosyVoice2 TTS |
|
Emerging |
| 1495 |
eellak/gsoc2021-audio-annotation-tool
Creation of a multi user audio first annotation tool - GSoC 2021 |
|
Emerging |
| 1496 |
small-cactus/Jarvis-ChatGPT-VoiceAssistant
Jarvis powered by GPT-3.5/GPT-4 |
|
Emerging |
| 1497 |
ibm-self-serve-assets/Watson-Speech
This collection demonstrates how to help you to quickly embed Watson Speech... |
|
Emerging |
| 1498 |
maum-ai/wavegrad2
Unofficial Pytorch Implementation of WaveGrad2 |
|
Emerging |
| 1499 |
carleeno/elevenlabs_tts
Custom TTS Integration using ElevenLabs API |
|
Emerging |
| 1500 |
awslabs/speech-representations
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020) |
|
Emerging |