All Voice AI Tools
8,165 tools ranked by quality score · Page 11 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 1001 |
shenasa-ai/speech2text
A Deep-Learning-Based Persian Speech Recognition System |
|
Emerging |
| 1002 |
maum-ai/assem-vc
Official Code for Assem-VC @ICASSP2022 |
|
Emerging |
| 1003 |
siva-sub/NekoSpeak
Private, offline AI Text-to-Speech for Android with Kokoro, KittenTTS,... |
|
Emerging |
| 1004 |
wangz-code/legado-edge-tts
edge大声朗读微软TTS服务, 在阅读legado中配置语音引擎方式收听微软TTS / Edge大声朗读, 如果没有 vps 部署可以看看阅读内置... |
|
Emerging |
| 1005 |
SynHub/syn-speech
Syn.Speech is a flexible speaker independent continuous speech recognition... |
|
Emerging |
| 1006 |
talin190/Qwen3-TTS-Daggr-UI
🎤 Create dynamic voice experiences with Qwen3-TTS-Daggr-UI, a Gradio app for... |
|
Emerging |
| 1007 |
husniadil/cc-hooks
Audio feedback plugin for Claude Code with TTS announcements, sound effects,... |
|
Emerging |
| 1008 |
jim-schwoebel/download_audioset
📁 This repo makes it easy to download the raw audio files from AudioSet... |
|
Emerging |
| 1009 |
DrDroidLab/voicesummary
Open Source AI Database for Voice Agent Transcripts | Call Analysis &... |
|
Emerging |
| 1010 |
OpenMOSS/MOSS-Speech
MOSS-Speech is a true speech-to-speech large language model without text guidance. |
|
Emerging |
| 1011 |
bookbot-kids/speech-recognizer-bahasa-indonesian
A cross platform (Android/iOS/MacOS) Bahasa Indonesia speech recognizer... |
|
Emerging |
| 1012 |
cuinjune/text2video
A software tool that converts text to video for more engaging learning experience |
|
Emerging |
| 1013 |
yerfor/SyntaSpeech
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022;... |
|
Emerging |
| 1014 |
Pikurrot/whisper-gui
A simple GUI to use Whisper. |
|
Emerging |
| 1015 |
r0227n/flutter_whisper_kit
🎤 A Flutter plugin for running WhisperKit speech-to-text models on-device,... |
|
Emerging |
| 1016 |
drmfinlay/pyjsgf
JSpeech Grammar Format (JSGF) compiler, matcher and parser package for Python. |
|
Emerging |
| 1017 |
djmango/obsidian-transcription
Obsidian plugin to create high-quality transcriptions from markdown linked... |
|
Emerging |
| 1018 |
lucasnewman/f5-tts-swift
Implementation of F5-TTS in Swift using MLX |
|
Emerging |
| 1019 |
murf-ai/murf-python-sdk
Python sdk for Murf text to speech API |
|
Emerging |
| 1020 |
holgern/kokorog2p
A unified multi-language G2P (Grapheme-to-Phoneme) library for Kokoro TTS. |
|
Emerging |
| 1021 |
algolia/voice-overlay-android
🗣 An overlay that gets your user’s voice permission and input as text in a... |
|
Emerging |
| 1022 |
BernieTv/ElevenLabs-Clone
A self-hosted ElevenLabs clone for text-to-speech, voice conversion, and AI... |
|
Emerging |
| 1023 |
Candida18/Virtual-Assistance-For-The-Blind
The proposed Voice-based Email System uses AI (voice commands) that will... |
|
Emerging |
| 1024 |
nixonyh/UnityTTS
Text to Speech in Unity. |
|
Emerging |
| 1025 |
isaiahbjork/expo-kokoro-onnx
Run Kokoro TTS locally on device using Expo & ONNX Runtime |
|
Emerging |
| 1026 |
jpescada/TwitterPiBot
A Python based bot for Raspberry Pi that grabs tweets with a specific... |
|
Emerging |
| 1027 |
mozilla-ai/speech-to-text-finetune
Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language |
|
Emerging |
| 1028 |
rishikksh20/TFGAN
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for... |
|
Emerging |
| 1029 |
zycv/awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting... |
|
Emerging |
| 1030 |
tonesto7/echo-speaks
Integrate your Amazon Echo devices into your Hubitat environment to create... |
|
Emerging |
| 1031 |
travisvn/edge-tts-extension
Chrome extension to generate free, high-quality text-to-speech using... |
|
Emerging |
| 1032 |
Amirrezahmi/Zozo-Assistant
Zozo Assistant is a voice-activated chatbot that performs tasks based on... |
|
Emerging |
| 1033 |
Berkeley-Speech-Group/sylber
Sylber: Syllabic Embedding Representation of Speech from Raw Audio |
|
Emerging |
| 1034 |
verbio-technologies/python-verbio-speech-center
Python integration with the Verbio Speech Center Cloud.... |
|
Emerging |
| 1035 |
kosich/rxjs-tts
RxJS wrapper for Text-to-Speech Web API |
|
Emerging |
| 1036 |
ttaoREtw/Tacotron-pytorch
A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model |
|
Emerging |
| 1037 |
pufanyi/GenderRecognitionByVoice
NTU SC1015 Group Project - Gender Recognition by Voice |
|
Emerging |
| 1038 |
matteo-convertino/vosk-build-model
How to create your own model for vosk |
|
Emerging |
| 1039 |
hirofumi0810/asr_preprocessing
Python implementation of pre-processing for End-to-End speech recognition |
|
Emerging |
| 1040 |
apaar97/translate
Android app to translate text conversations, supporting 90+ languages with... |
|
Emerging |
| 1041 |
momysnow/Momy-Desk-Robot
Smart desktop robot. |
|
Emerging |
| 1042 |
CheshireCC/faster-whisper-GUI
faster_whisper GUI with PySide6 |
|
Emerging |
| 1043 |
m3hrdadfi/soxan
Wav2Vec for speech recognition, classification, and audio classification |
|
Emerging |
| 1044 |
Azure-Samples/sonic-brief
Sonic Brief Project is an Azure-based system that transcribes and... |
|
Emerging |
| 1045 |
JosefAlbers/e2tts-mlx
Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX |
|
Emerging |
| 1046 |
seven-io/home-assistant
HACS supporting Home Assistant integration for seven |
|
Emerging |
| 1047 |
Aratako/MioTTS-Inference
Inference server for MioTTS, a lightweight and fast LLM-based TTS model. |
|
Emerging |
| 1048 |
resemble-ai/resemble-alexa
This is sample code for an Alexa skill that uses realistic voice cloning... |
|
Emerging |
| 1049 |
Justmalhar/open-audio
Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech... |
|
Emerging |
| 1050 |
n0th1ng-else/voice-to-text-bot
Telegram bot that converts Voice messages into text |
|
Emerging |
| 1051 |
vieledatengutedaten/better-teletask-extension
Browser extension that adds useful features like subtitles to HPI Tele-Task. |
|
Emerging |
| 1052 |
ycyy/faster-whisper-webui
a gradio webui for faster whisper |
|
Emerging |
| 1053 |
syntithenai/hermod
voice services stack from audio hardware through hotword, ASR, NLU, AI... |
|
Emerging |
| 1054 |
subho406/TF-Speech-Recognition-Challenge-Solution
Source code of the model used in Tensorflow Speech Recognition Challenge... |
|
Emerging |
| 1055 |
iamjanvijay/rnnt_decoder_cuda
An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA. |
|
Emerging |
| 1056 |
bytectlgo/edge-tts
Edge TTS is a command-line tool based on Microsoft Edge's text-to-speech... |
|
Emerging |
| 1057 |
Gr122lyBr/voicetag
Speaker identification powered by pyannote and resemblyzer |
|
Emerging |
| 1058 |
just-ai/aimybox-android-sdk
Voice assistant SDK for Android |
|
Emerging |
| 1059 |
am-sokolov/videodubber
The program for automatic dubbing any video file for a lot of languages. |
|
Emerging |
| 1060 |
nl8590687/ASRT_SDK_Java
ASRT Speech Recognition SDK for Java. 用于ASRT语音识别系统的Java SDK |
|
Emerging |
| 1061 |
ShaerWare/AI_Secretary_System
📞 Локальный AI-секретарь, тех. поддержка и менеджер по продажам с... |
|
Emerging |
| 1062 |
PABannier/bark.cpp
Suno AI's Bark model in C/C++ for fast text-to-speech generation |
|
Emerging |
| 1063 |
botbahlul/vosk_autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using... |
|
Emerging |
| 1064 |
huschen/kaggle_speech_recognition
Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow. |
|
Emerging |
| 1065 |
igorshmukler/kokoro-ruslan
Kokoro Language Model Training Script for Russian (Ruslan Corpus) |
|
Emerging |
| 1066 |
rajkishorbgp/JARVIS-AI-Assistant
JARVIS AI Assistant 🤖 A virtual assistant project inspired by Tony Stark's... |
|
Emerging |
| 1067 |
byhow/yanyu
A Text-to-Speech node package with pinyin audio library. |
|
Emerging |
| 1068 |
mobilepadawan/Speakit-JS
Elevate your web applications with the power of JavaScript speech synthesis. |
|
Emerging |
| 1069 |
bakaburg1/minutemaker
Generate meeting minutes starting from an audio recording or a transcripts... |
|
Emerging |
| 1070 |
BobRandomNumber/ComfyUI-DiaTTS
ComfyUI Dia safetensors implementation |
|
Emerging |
| 1071 |
huakunyang/SummerTTS
SummerTTS... |
|
Emerging |
| 1072 |
ryanleary/patter
speech-to-text in pytorch |
|
Emerging |
| 1073 |
beyondwords-io/wordpress-plugin
BeyondWords is the AI voice platform that brings frictionless audio... |
|
Emerging |
| 1074 |
keonlee9420/VAENAR-TTS
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based... |
|
Emerging |
| 1075 |
caizexin/tf_multispeakerTTS_fc
the Tensorflow version of multi-speaker TTS training with feedback constraint |
|
Emerging |
| 1076 |
gladiaio/normalization
A lightweight library for normalizing speech transcripts before computing WER |
|
Emerging |
| 1077 |
asticode/go-astideepspeech
Golang bindings for Mozilla's DeepSpeech speech-to-text library |
|
Emerging |
| 1078 |
andresayac/edge-tts-php
Edge TTS is a PHP package that allows access to the online text-to-speech... |
|
Emerging |
| 1079 |
jianchang512/zh_recogn
将音频或视频中的中文语音识别并导出为srt字幕,基于魔塔社区Paraformer模型 |
|
Emerging |
| 1080 |
mgonzs13/tts_ros
Text-to-Speech for ROS 2 |
|
Emerging |
| 1081 |
lukaszliniewicz/Pandrator
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos... |
|
Emerging |
| 1082 |
metame-ai/awesome-audio-plaza
Daily tracking of awesome audio papers, including music generation,... |
|
Emerging |
| 1083 |
Kardbord/hfapigo
Unofficial (Golang) Go bindings for the Hugging Face Inference API |
|
Emerging |
| 1084 |
nodef/extra-amazontts
Generate speech audio from super long text through machine (via "Amazon... |
|
Emerging |
| 1085 |
Sgvkamalakar/Azure-Talking-Avatar
Explore the power of Azure Text-to-Speech with interactive talking avatar,... |
|
Emerging |
| 1086 |
agentvoiceresponse/avr-tts-elevenlabs
This repository demonstrates the integration between Agent Voice Response... |
|
Emerging |
| 1087 |
mgonzs13/piper_ros
piper Text-to-Speech for ROS 2 |
|
Emerging |
| 1088 |
hi-paris/Prosody-Control-French-TTS
An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control |
|
Emerging |
| 1089 |
Jakobovski/free-spoken-digit-dataset
A free audio dataset of spoken digits. An audio version of MNIST. |
|
Emerging |
| 1090 |
meemalabs/laravel-text-to-speech
💬 A wrapper for popular TTS services to create a more simple & uniform API.... |
|
Emerging |
| 1091 |
cdimascio/watson-html5-speech-recognition
Speech Recognition for Browsers via Webkit, HTML5, and Watson |
|
Emerging |
| 1092 |
mush42/sonata
A cross-platform inference engine for neural TTS models. |
|
Emerging |
| 1093 |
bjoernkarmann/project_alias
Alias is a teachable “parasite” that is designed to give users more control... |
|
Emerging |
| 1094 |
agan-j/xiaoniu
小牛视频翻译 是一款支持本地视频翻译、字幕翻译和 YouTube 视频翻译下载的 AI... |
|
Emerging |
| 1095 |
p0p4k/pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper |
|
Emerging |
| 1096 |
xxbb1234021/speech_recognition
中文语音识别 |
|
Emerging |
| 1097 |
garvys-org/rustfst
Rust re-implementation of OpenFST - library for constructing, combining,... |
|
Emerging |
| 1098 |
devnen/Kitten-TTS-Server
Self-host the ultra-lightweight Kitten TTS model with this enhanced API... |
|
Emerging |
| 1099 |
sdip15fa/safecantonese.ai.app
Free, open-source, offline, safe and secure AI Cantonese transcription, in... |
|
Emerging |
| 1100 |
algolia/voice-overlay-ios
🗣 An overlay that gets your user’s voice permission and input as text in a... |
|
Emerging |