All Voice AI Tools
8,165 tools ranked by quality score · Page 16 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 1501 |
HurroWorld/text-to-audio2face
Web interface to convert text to speech and route it to an Audio2Face... |
|
Emerging |
| 1502 |
hwRG/End-to-End-TTS-Fine-Tune
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis. |
|
Emerging |
| 1503 |
qforge-dev/qspeak
qSpeak is a powerful voice transcription and AI assistant tool that helps... |
|
Emerging |
| 1504 |
definitio/ha-rhvoice
Home Assistant integration for RHVoice - a local text-to-speech engine. |
|
Emerging |
| 1505 |
jimbobbennett/SpeechToTextSamples
Sample code showing how to use the Azure Speech to Text service from Python 🗣 |
|
Emerging |
| 1506 |
henryhale/ttspeech
🔊 A fully basic voice synthesizer in vanillaJS |
|
Emerging |
| 1507 |
tianbot/rosecho
Tianbot Rosecho (Tianecho),中文语音人机交互模块,支持ROS即插即用 |
|
Emerging |
| 1508 |
inboxpraveen/Speech-Annotation-Tool
Review, correct, and export ASR transcripts at scale. Web-based ASR accuracy... |
|
Emerging |
| 1509 |
oscie57/tiktok-voice
Simple Python script to interact with the TikTok TTS API |
|
Emerging |
| 1510 |
RafalWilinski/serverless-medium-text-to-speech
🔊 Serverless-based, text-to-speech service for Medium articles |
|
Emerging |
| 1511 |
QiBowen2008/SuperTextToolBox
一个免费的文字处理工具箱 |
|
Emerging |
| 1512 |
SadeghKrmi/pertts-streamlit
Persian text-to-speech streamlit interface |
|
Emerging |
| 1513 |
Saganaki22/ComfyUI-KittenTTS
😻 A simple ComfyUI custom node for KittenTTS - an ultra-lightweight... |
|
Emerging |
| 1514 |
gladchinda/web-speech-demo
Learn how to build a simple text-to-speech voice app for the web using the... |
|
Emerging |
| 1515 |
MicheleYin/misaki-rs
Rust port of Misaki |
|
Emerging |
| 1516 |
HerbertHe/edge-tts-server
Server for edge-tts |
|
Emerging |
| 1517 |
jscrane/TTS
Arduino Text-to-Speech Library |
|
Emerging |
| 1518 |
kaushiknishchay/ComfyUI-Qwen3-ASR
ComfyUI nodes for Qwen3-ASR (0.6B/1.7B) and ForcedAligner. Supports... |
|
Emerging |
| 1519 |
lucasnewman/vocos-mlx
Implementation of 'Vocos: Closing the gap between time-domain and... |
|
Emerging |
| 1520 |
IceFog72/pocket-tts-openapi
Fast, local, OpenAI-compatible TTS server with voice cloning support powered... |
|
Emerging |
| 1521 |
coqui-ai/STT-models
Open models for Coqui STT |
|
Emerging |
| 1522 |
soundhound/houndify-sdk-go
The official Houndify SDK for Go |
|
Emerging |
| 1523 |
satyam9090/Automatic-Indian-Sign-Language-Translator-ISL
I created an application which takes in live speech or audio recording as... |
|
Emerging |
| 1524 |
nerdaxic/glados-voice-assistant
DIY Voice Assistant based on the GLaDOS character from Portal video game... |
|
Emerging |
| 1525 |
saadbutt32/Conversion-of-Pakistan-Sign-Languag-into-Text-and-Speech-using-OpenPose-and-Machine-Learning
Real-time translation of Pakistan sign language into text and speech using... |
|
Emerging |
| 1526 |
naschorr/hawking
The retro text-to-speech bot for Discord |
|
Emerging |
| 1527 |
RoySheffer/im2wav
Official implementation of the pipeline presented in I hear your true... |
|
Emerging |
| 1528 |
AEmotionStudio/ComfyUI-FFMPEGA
Intelligent FFMPEG agent node for ComfyUI - transforms natural language... |
|
Emerging |
| 1529 |
akinsella/yt-transcript-rs
🎬️ A Rust library for accessing YouTube Video Infos & Transcripts |
|
Emerging |
| 1530 |
AndroidMaryTTS/AndroidMaryTTS
Android MARY TTS - an open-source, offline HMM-Based text-to-speech... |
|
Emerging |
| 1531 |
RapidAI/RapidTTS
A cross platform implementation of Text-to-Speech based on ONNXRuntime. |
|
Emerging |
| 1532 |
PhuocElec/zipformer-asr-api
REST-API implementation of ZipFormer for automatic speech recognition (ASR)... |
|
Emerging |
| 1533 |
moeru-ai/ortts
𖣘🔊 Simple and Easy-to-use local TTS inference server, Powered by ONNX Runtime |
|
Emerging |
| 1534 |
myuan19/voiceInput
Windows AI 语音输入🎙 — 按快捷键说话即输入,支持润色。摆脱打字限制,实现无拘束、高效率的表达。 |
|
Emerging |
| 1535 |
dmatekenya/Chichewa-Speech2Text
Automated Speech Recognition for Chichewa. |
|
Emerging |
| 1536 |
CoffeeMethod/KokoroGUI
An advanced TTS software, built for audiobooks, podcasts, videos, and more. |
|
Emerging |
| 1537 |
keonlee9420/Robust_Fine_Grained_Prosody_Control
PyTorch Implementation of Robust and fine-grained prosody control of... |
|
Emerging |
| 1538 |
skshadan/WhisCall
A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo,... |
|
Emerging |
| 1539 |
speechio/BigCiDian
Pronunciation lexicon covering both English and Chinese languages for... |
|
Emerging |
| 1540 |
mapluisch/OpenAI-Text-To-Speech-for-Unity
Implementation of OpenAI's Text-To-Speech in Unity. Synthesize any text and... |
|
Emerging |
| 1541 |
rapidaai/rapida-go
Open-source Golang SDK for Rapida to build real-time, observable Voice AI... |
|
Emerging |
| 1542 |
robmsmt/ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR... |
|
Emerging |
| 1543 |
soupslurpr/Transcribro
Private and on-device speech recognition keyboard and service for Android. |
|
Emerging |
| 1544 |
Hritikraj8804/Autotube
🤖 Automated YouTube Shorts creation using n8n, AI script generation, and... |
|
Emerging |
| 1545 |
foamliu/Listen-Attend-Spell-v2
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR). |
|
Emerging |
| 1546 |
eellak/gsoc2019-sphinx
Creation of an online Greek mail dictation system, using Sphinx and... |
|
Emerging |
| 1547 |
FaceOnLive/Spleeter-Android-iOS
On-device, Offline Spleeter Solution For Mobile |
|
Emerging |
| 1548 |
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and... |
|
Emerging |
| 1549 |
zw76859420/ASR_Syllable
基于卷积神经网络的语音识别声学模型的研究 |
|
Emerging |
| 1550 |
pymike00/YouTube-Tutorials
:open_file_folder: Source Code for (some of) the Programming Tutorials from... |
|
Emerging |
| 1551 |
alan890104/sumi
Sumi — Free, open-source voice dictation for macOS. Local-first Whisper +... |
|
Emerging |
| 1552 |
hcy71o/SNAC
Unofficial Pytorch implementation of SNAC: Speaker-normalized affine... |
|
Emerging |
| 1553 |
atakanakin/TutunSabri
He is not our hero. He is a silent guardian. A watchful protector. |
|
Emerging |
| 1554 |
Warma10032/easytts
打造最简单的TTS前端集合,最简单的有声小说制作工作流。基于正则规则对小说进行分句,基于RoBERTa对小说中的对话进行说话人识别,从而实现一键式生成多人... |
|
Emerging |
| 1555 |
zh217/torch-asg
Auto Segmentation Criterion (ASG) implemented in pytorch |
|
Emerging |
| 1556 |
tristan-mcinnis/Multimodal-voice-assistant
This project is a multi-modal AI voice assistant that uses LM Studio, OpenAI... |
|
Emerging |
| 1557 |
Igorcbraz/Calculadora
📐 Calculadora simples e intuitiva com suporte a comandos de voz e temas... |
|
Emerging |
| 1558 |
apluka34/Bud500
Bud500: A Comprehensive Vietnamese ASR Dataset |
|
Emerging |
| 1559 |
WeiChiaChang/happy-halloween
🗣 Say "happy halloween" to your browser 🎃 |
|
Emerging |
| 1560 |
markmiddo/synthia
AI-powered voice assistant that respects your privacy. Control your desktop,... |
|
Emerging |
| 1561 |
FedericaPaoli1/stm32-speech-recognition-and-traduction
stm32-speech-recognition-and-traduction is a project developed for the... |
|
Emerging |
| 1562 |
marytts/gradle-marytts-voicebuilding-plugin
A replacement for the legacy VoiceImportTools in MaryTTS |
|
Emerging |
| 1563 |
lokkelvin2/tacotron2-tts-GUI
Text To Speech (TTS) GUI wrapper for NVIDIA Tacotron 2+Waveglow. For custom... |
|
Emerging |
| 1564 |
AcTePuKc/Kokoro-Local-Gui
Hyper-fast, local, high-quality TTS based on Kokoro-82M. PySide6 GUI included. |
|
Emerging |
| 1565 |
grebtsew/Text_To_Speech_Server_Node
A super simple speaking server node that receives requests and reads them... |
|
Emerging |
| 1566 |
Allan-Nava/fakeyou.go
A powerful golang sdk library for interacting with the FakeYouAPI easily |
|
Emerging |
| 1567 |
Jdreioe/Wingmate
A project to make people who cannot speak, speak! |
|
Emerging |
| 1568 |
vkosuri/dialogflow-lite
[Maintainer Required] A light-weight python library REST agent for Dialogflow |
|
Emerging |
| 1569 |
yeyupiaoling/VITS-Pytorch
本项目是基于Pytorch的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复杂的流程,... |
|
Emerging |
| 1570 |
user3301/ssml_builder
:sound: a general SSML(Speech Synthesis Markup Language) builder |
|
Emerging |
| 1571 |
sunshine0523/MNNServer
A third-party MNN server supporting external calls, embedding model, TTS... |
|
Emerging |
| 1572 |
pschatzmann/arduino-espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than... |
|
Emerging |
| 1573 |
FlooferLand/ttvoice-mod
A Minecraft mod that lets you type to speak! |
|
Emerging |
| 1574 |
shahules786/mayavoz
Pytorch based speech enhancement toolkit. |
|
Emerging |
| 1575 |
daanzu/speech-training-recorder
Simple GUI application to help record audio dictated from given text... |
|
Emerging |
| 1576 |
maum-ai/nuwave2
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling... |
|
Emerging |
| 1577 |
ShadowForests/VoiceToSpeech
Live speech recognition to synthesized speech with hundreds of voices, TTS,... |
|
Emerging |
| 1578 |
sophiefy/StellaVoiceChanger
Deep-learning-based voice changer, supporting local inference. |
|
Emerging |
| 1579 |
weimeng23/speech-recognition-learning-resources
:white_check_mark: A list of speech recognition learning resources including... |
|
Emerging |
| 1580 |
felivalencia3/RealVoiceGPT
RealVoiceGPT is a web application that lets you have voice conversations... |
|
Emerging |
| 1581 |
itspyguru/Tkinter-Applications
A collection of small tkinter apps made by me |
|
Emerging |
| 1582 |
Adamiito0909/mlx-swift-audio
🎤 Enhance your apps with MLX Swift Audio, offering robust text-to-speech and... |
|
Emerging |
| 1583 |
reybahl/Assistant
A machine learning powered, voice-based virtual assistant for Raspberry Pi.... |
|
Emerging |
| 1584 |
smx-smx/KodiSharp
Use Kodi python APIs in C#, and write rich addons using the .NET framework/Mono |
|
Emerging |
| 1585 |
1ytic/pytorch-edit-distance
Levenshtein edit-distance on PyTorch and CUDA |
|
Emerging |
| 1586 |
MattePalte/Verbify-TTS
Simple and free Text-to-Speech (TTS) engine that reads for you any text on... |
|
Emerging |
| 1587 |
aks-devs/mod_google_asr
Freeswitch Speech-to-Text module |
|
Emerging |
| 1588 |
TeaPoly/Conformer-Athena
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena. |
|
Emerging |
| 1589 |
andi611/TTS-Tacotron-Pytorch
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative... |
|
Emerging |
| 1590 |
pviotti/sayit
A text-to-speech command line tool backed by Azure Cognitive Services. |
|
Emerging |
| 1591 |
hyeonsangjeon/computing-Korean-STT-error-rates
STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지 |
|
Emerging |
| 1592 |
LetovKai/call-translator
Real-time voice translator for video calls. Speak your language on Google... |
|
Emerging |
| 1593 |
TigreGotico/phoonnx
A Python library for multilingual phonemization and Text-to-Speech (TTS)... |
|
Emerging |
| 1594 |
shi-gg/Auditional-Text
The source code of the Auditional Text discord Boat |
|
Emerging |
| 1595 |
double22a/asr_nlp_paper_code
Papers of ASR, Tools of ASR |
|
Emerging |
| 1596 |
johunsang/octo-captures
화면 녹화의 모든 것 — Auto Zoom, 아바타, 음성 변조, BGM, 타임라인 편집을 지원하는 무료 오픈소스 macOS 앱.... |
|
Emerging |
| 1597 |
racai-ai/RobinASR
Romanian Automatic Speech Recognition from the ROBIN project |
|
Emerging |
| 1598 |
abus-aikorea/kara-audio
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports... |
|
Emerging |
| 1599 |
bnsantoso/sub-to-audio
Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS... |
|
Emerging |
| 1600 |
dusty-nv/jetson-voice
ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch... |
|
Emerging |