All Voice AI Tools
8,165 tools ranked by quality score · Page 22 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 2101 |
ElsebaiyMohamed/Modablag
This project presents a comprehensive study on video dubbing techniques and... |
|
Emerging |
| 2102 |
nidi3/swiss-wowbagger
Let yourself be insulted in swiss german. Schöner fluchen auf Berndeutsch. |
|
Emerging |
| 2103 |
jefflai108/Semi-Supervsied-Spoken-Language-Understanding-PyTorch
Semi-supervised spoken language understanding (SLU) via self-supervised... |
|
Emerging |
| 2104 |
ayutaz/uCosyVoice
CosyVoice3 text-to-speech for Unity using ONNX inference. Supports zero-shot... |
|
Emerging |
| 2105 |
gokhaneraslan/XTTS_V2-finetuning
Training XTTS V2 and PEFT LORA Text-to-Speech (TTS) |
|
Emerging |
| 2106 |
crimson0829/RecordVoiceView
录音控件 for Android,支持实时语音转化为文字 |
|
Emerging |
| 2107 |
GuruCharan94/az-podcast-transcriber
A podcast transcription service built on Azure that transcribes any new... |
|
Emerging |
| 2108 |
d-kavinraja/MouthMap
MouthMap is a deep learning-based lip reading system that converts silent... |
|
Emerging |
| 2109 |
TejasQ/praise
Do stuff with your voice in the browser. |
|
Emerging |
| 2110 |
shervinemami/practice_speechrec_mappings
A game to help design a better character mapping and to learn the mapping... |
|
Emerging |
| 2111 |
StachePL/ExcelToAmazonPolly
Simple text-to-speech tool combining powers of Excel and Amazon Polly. |
|
Emerging |
| 2112 |
rudra00434/SoulPlayer
My own music application build with Django , Tailwind CSS and Spacy... |
|
Emerging |
| 2113 |
deeheber/text-to-speech-converter
A serverless application that converts blobs of text to speech in an audio file |
|
Emerging |
| 2114 |
Yuan-ManX/ComfyUI-ChatterboxTTS
ComfyUI-ChatterboxTTS is now available in ComfyUI, Chatterbox is the first... |
|
Emerging |
| 2115 |
techiaith/docker-huggingface-stt-cy
Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech... |
|
Emerging |
| 2116 |
heyseth/Piper_TTS
Use Piper TTS in Visual Studio Code |
|
Emerging |
| 2117 |
Malith-Rukshan/whisper-transcriber-bot
🎙️ AI-powered Telegram bot for voice-to-text transcription using OpenAI... |
|
Emerging |
| 2118 |
hay/audio2text
Python command line utility wrappers for Whispercpp and other speech-to-text... |
|
Emerging |
| 2119 |
wulee510505/Text2Speach
一句代码搞定语音合成,文字转语音 |
|
Emerging |
| 2120 |
uzbekvoice/UzbekVoiceBot
Current and Live Telegram bot for collecting dataset |
|
Emerging |
| 2121 |
ducnt18121997/Viet-Text-Normalization
A Python library for text normalization, specifically designed for... |
|
Emerging |
| 2122 |
Jugendhackt/synthi-tts
Hackathon project to digitize your own voice and have it speak for you!... |
|
Emerging |
| 2123 |
playerony/TensorFlowTTS-ts
This project implements TensorflowTTS in Tensorflow.js using Typescript,... |
|
Emerging |
| 2124 |
poretsky/rulex
Russian pronunciation dictionary |
|
Emerging |
| 2125 |
Harshit-Raj-14/JARVIS-Python-Voice-Assistant
J.A.R.V.I.S - Python Smart AI Voice Assistant |
|
Emerging |
| 2126 |
momalekiii/VTT
Extract Speech/Text from Video |
|
Emerging |
| 2127 |
nishantnnb/spectrolipi
A tool designed to manage annotations for bioacoustics. |
|
Emerging |
| 2128 |
MitchellAW/Discord-Bot
My own Discord chat bot built in Python using the discord.py API. Has been... |
|
Emerging |
| 2129 |
theinlinaung2010/Azure_speech_to_test
Sample code for testing speech recognition (speech-to-text) of Burmese... |
|
Emerging |
| 2130 |
ismailperim/reportcast
Transform reports into podcasts with AI - Nobody reads your reports. But... |
|
Emerging |
| 2131 |
aflr-archive/apiaudio-python
api.audio Python SDK |
|
Emerging |
| 2132 |
cloudcommunity/Text-to-Speech-Engines
A list of different text to speech engines. |
|
Emerging |
| 2133 |
LWalone/fish-speech
🐟 Enhance communication with Fish Speech, a powerful multilingual... |
|
Emerging |
| 2134 |
MontrealAI/sign2text-v0
Sign Language to Text (A to Z) with Artificial Intelligence | Pre-Alpha Demo |
|
Emerging |
| 2135 |
neosun100/Step-Audio-R1.1
Step-Audio-R1.1: The First Audio Language Model with Test-Time Compute... |
|
Emerging |
| 2136 |
sahu-adarsh/intervyu
Practice job interviews with Neerja, an AI interviewer powered by Claude.... |
|
Emerging |
| 2137 |
jcsilva/docker-kaldi-android
Dockerfile for compiling Kaldi for Android. |
|
Emerging |
| 2138 |
parzibyte/conversor-imagen-a-texto-js
Extraer texto de imagen utilizando JavaScript y Tesseract.js |
|
Emerging |
| 2139 |
ThePlasmak/faster-whisper
An OpenClaw skill that uses faster-whisper (a faster implementation of the... |
|
Emerging |
| 2140 |
syb0rg/Khronos
The open source intelligent personal assistant |
|
Emerging |
| 2141 |
morfeusys/porfir
Голосовой ассистент Порфирьевич |
|
Emerging |
| 2142 |
Voice-Privacy-Challenge/Voice-Privacy-Challenge-2020
Baseline Recipe for VoicePrivacy Challenge 2020:... |
|
Emerging |
| 2143 |
CodersCreative/faster-whisper-rs
a rust crate for easily implementing faster-whisper stt into your rust programs. |
|
Emerging |
| 2144 |
LinqLover/simple-openai-tts-playground
Try out the OpenAI Text to Speech API in your browser. |
|
Emerging |
| 2145 |
LearnedVector/Wav2Letter
Speech Recognition model based off of FAIR research paper built using Pytorch. |
|
Emerging |
| 2146 |
egorsmkv/tts_uk
High-fidelity speech synthesis for Ukrainian using modern neural networks. |
|
Emerging |
| 2147 |
ontypehq/mlx-swift-asr
On-device speech recognition for Apple Silicon, powered by MLX. |
|
Emerging |
| 2148 |
atosystem/SpeechCLIP
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model,... |
|
Emerging |
| 2149 |
rafalimadev/piper-tts-call
Python wrapper for Piper TTS with real-time CLI/GUI, global hotkeys, and... |
|
Emerging |
| 2150 |
NeoKazuya/qwen3-tts-enhanced
Enhanced Qwen3-TTS voice cloning GUI with multi-reference samples, variation... |
|
Emerging |
| 2151 |
Degon3399/XTTS_V2
This repository offers a framework for fine-tuning the XTTS_V2 model,... |
|
Emerging |
| 2152 |
aviaryan/Very-Fast-Dictation
Instant dictation app for Mac |
|
Emerging |
| 2153 |
mikex86/DeepSpeech-Java-Bindings
Java Bindings for the C++ library DeepSpeech |
|
Emerging |
| 2154 |
QuantiusBenignus/blurt
Gnome shell extension for accurate OFFLINE speech to text input in Linux... |
|
Emerging |
| 2155 |
MahtaFetrat/ManaTTS-Persian-Tacotron2-Model
Tacotron2 Persian Text-to-Speech Model trained on ManaTTS, the largest open... |
|
Emerging |
| 2156 |
daslearning-org/text-to-speech-offline
A lightweight cross-platform Text-To-Speech application which works on... |
|
Emerging |
| 2157 |
oleksandr-g-rock/speech2text
speech2text |
|
Emerging |
| 2158 |
Saganaki22/ComfyUI-KugelAudio
🗣️ ComfyUI nodes for KugelAudi- Open-source text-to-speech with voice... |
|
Emerging |
| 2159 |
winedarkmoon/ElevenGUI
A user-friendly interface for ElevenLabs' API with added audio transcription... |
|
Emerging |
| 2160 |
1038lab/ComfyUI-VoxCPMTTS
A clean, efficient ComfyUI custom node for VoxCPM TTS (Text-to-Speech)... |
|
Emerging |
| 2161 |
greg-kennedy/p5-NRL-TextToPhoneme
Perl implementation of the Naval Research Laboratory text-to-phoneme... |
|
Emerging |
| 2162 |
wildminder/ComfyUI-KaniTTS
ComfyUI node for modular, human‑like Kani TTS. Generate natural,... |
|
Emerging |
| 2163 |
mu-hashmi/personaplex-mlx
PersonaPlex on Apple Silicon: an MLX port of NVIDIA’s full-duplex... |
|
Emerging |
| 2164 |
tim-gromeyer/VoiceAssistant
Empower Your Voice, Secure Your Privacy - Experience VoiceAssistant, Your... |
|
Emerging |
| 2165 |
echonoshy/tingshu
Tingshu 听舒 | Bringing the author’s voice directly to you |
|
Emerging |
| 2166 |
llami-team/wake-me
AI-based React component library that detects clapping sounds or finger... |
|
Emerging |
| 2167 |
Robofied/Voicenet
Comprehensive Python library for speech and voice. |
|
Emerging |
| 2168 |
stefantaubert/mean-opinion-score
Python library for calculating the mean opinion score and 95% confidence... |
|
Emerging |
| 2169 |
kaloprojects/KALO-ESP32-Voice-Assistant
Code snippets showing how to record I2S audio and store as .wav file on... |
|
Emerging |
| 2170 |
fernicar/Parakeet_GUI_TINS_Edition
A desktop application built using the TINS paradigm for transcribing audio... |
|
Emerging |
| 2171 |
sydkwests/kwest-whisper-analysis
Conducted a comprehensive technical analysis of the Whisper model on... |
|
Emerging |
| 2172 |
Oct4Pie/persian-stt
A Text-To-Speech Model Developed Using 🐸STT |
|
Emerging |
| 2173 |
Ma-Dan/asr-decode
从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库 |
|
Emerging |
| 2174 |
wblgers/hmm_speech_recognition_demo
A demo for simple isolated Chinese speech word recognition using GMMHMM in Python |
|
Emerging |
| 2175 |
htn-l/htn-l.github.io
Takes in audio feed from lectures or meetings, performs speech to text... |
|
Emerging |
| 2176 |
supershaneski/openai-chatterbox
A sample Nuxt 3 application that listens to chatter in the background and... |
|
Emerging |
| 2177 |
tsengia/JSGFKit_Plus_Plus
A C++ library for parsing and manipulating JSGF grammar files. |
|
Emerging |
| 2178 |
bundlab/voice-stream
🎙️ Lightweight offline Python TTS engine. Thread-safe, CLI-ready, and... |
|
Emerging |
| 2179 |
MahtaFetrat/ManaTTS-Persian-Speech-Dataset
ManaTTS is the largest open Persian speech dataset with 114+ hours of... |
|
Emerging |
| 2180 |
sooftware/lightning-asr
Modular and extensible speech recognition library leveraging... |
|
Emerging |
| 2181 |
sayyedrizwan/TextConvertor
Convert Text into Voice(Speech) and Speech into Text.. |
|
Emerging |
| 2182 |
edouardpoitras/eva
Open source voice-enabled personal assistant |
|
Emerging |
| 2183 |
vigonotion/tts.astromech
Text to Astromech integration for Home Assistant (R2D2 Beep Boop Sounds) |
|
Emerging |
| 2184 |
notebook-nexus/chatterbox-tts-colab
Transform any text into natural-sounding speech, clone voices from audio... |
|
Emerging |
| 2185 |
smartgic/docker-mycroft
Mycroft AI Voice Assistant Docker images and docker-compose.yml files for... |
|
Emerging |
| 2186 |
amitpatil321/VoiceForm
Voice Controlled Form, Which can be filled, cleared, submitted using only... |
|
Emerging |
| 2187 |
maemreyo/omnivoice-server
OpenAI-compatible HTTP server for OmniVoice text-to-speech |
|
Emerging |
| 2188 |
cottongeeks/podscript
Generate podcast transcripts using language and speech-to-text models |
|
Emerging |
| 2189 |
Sundy1219/ctc_beam_search_lm
CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统 |
|
Emerging |
| 2190 |
shanghaimoon888/mod_vadasr
This is FreeSwitch module that can do VAD and ASR with IFLYTEK websocket api. |
|
Emerging |
| 2191 |
mahimairaja/openrtc-python
OpenRTC lets developers run multiple LiveKit voice agents in one Python... |
|
Emerging |
| 2192 |
DKMitt/speech-to-text-js
The Voice Note App's purpose is to experiment with the Web Speech API by... |
|
Emerging |
| 2193 |
Sri-Krishna-V/Elu
AI-powered Chrome extension that makes any web article accessible —... |
|
Emerging |
| 2194 |
vectominist/MiniASR
A mini, simple, and fast end-to-end automatic speech recognition toolkit. |
|
Emerging |
| 2195 |
lucko515/Speech-commands-recognition
Recognizing common speech commands using Keras and Tensorflow. |
|
Emerging |
| 2196 |
Zoomicon/SpeechLib
Library for Speech Synthesis and Recognition using Windows.Speech or... |
|
Emerging |
| 2197 |
GuangChen2333/FindUrVoicesPJSK
《世界计划 : 缤纷舞台》单角色语音数据集一键获取小工具 | 无需手动打标 | wav无压缩 | A simple tool for obtaining... |
|
Emerging |
| 2198 |
aks-devs/mod_google_tts
Freeswitch Text-To-Speech module |
|
Emerging |
| 2199 |
hmeutzner/kaldi-avsr
Kaldi-based audio-visual speech recognition |
|
Emerging |
| 2200 |
lissettecarlr/kuon
久远:一个开发中的大模型语音助手,当前关注易用性,简单上手,支持对话选择性记忆和Model Context Protocol (MCP)服务。... |
|
Emerging |