All Voice AI Tools
8,165 tools ranked by quality score · Page 53 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 5201 |
CodewithSudeep/suma
SUMA is a personal voice assistant, it's mind is backend up by the powerful... |
|
Experimental |
| 5202 |
Edw590/AdvancedCommandsDetection
An advanced assistant commands detection engine (understands complex... |
|
Experimental |
| 5203 |
umjammer/vavi-speech
🗣 Java Text to Speech (JSAPI) engines (google cloud, cocoa, aquestalk(ゆっくり)) |
|
Experimental |
| 5204 |
keonlee9420/Deep-Learning-TTS-Template
This is a template for the Non-autoregressive Deep Learning-Based TTS model... |
|
Experimental |
| 5205 |
vpakarinen/multimodal-webui
Multimodal WebUI using Qwen's new omni model. |
|
Experimental |
| 5206 |
kevobt/speech-to-text
Speech recognition framework using keras |
|
Experimental |
| 5207 |
egorsmkv/qirimtatar-tts-datasets
Open Source Crimean Tatar Text-to-Speech datasets |
|
Experimental |
| 5208 |
aidayang/FunASR-OneClick
FunASR实时语音识别版,识别麦克风和电脑内播放的声音,电脑语音打字软件 |
|
Experimental |
| 5209 |
Requiem4soul/TTS_SS14
TTS (Text To Speech) for SS14 |
|
Experimental |
| 5210 |
asrajeh/deepspeech-arabic
End-to-End Arabic ASR using DeepSpeech engine |
|
Experimental |
| 5211 |
Sikbiditoilehater/YTM_Immersion
🎶 Transform YouTube Music's web interface into an Apple Music-style lyrics... |
|
Experimental |
| 5212 |
xaionaro-go/speech
A Speech-To-Text (with translation) library and tools; currently based on... |
|
Experimental |
| 5213 |
inferixon/InferAnki
Norwegian Language Learning Add-on for Anki with AI-powered features |
|
Experimental |
| 5214 |
verdaniq/Trachytalk
A text-to-speech app to help patients who can't talk |
|
Experimental |
| 5215 |
0vulns/Parrot
Parrot is a real-time conversion translator written in javascript. |
|
Experimental |
| 5216 |
Vivek0712/lit-translate-audio
Enjoy Literature Texts translated into your preferred language as text and... |
|
Experimental |
| 5217 |
shubham0730/FreeScribe
A react web based transcription & translation app that uses web workers to... |
|
Experimental |
| 5218 |
labsensacional/ASMRDataset
Recordings and transcriptions of ASMR artists compiled for the purpose of... |
|
Experimental |
| 5219 |
RakeshBabuGajula/real-time-voice-translator
A real-time voice translator web app built with Streamlit that captures live... |
|
Experimental |
| 5220 |
speechpro/cloud-python
Python клиент API распознавания и синтеза речи Облака ЦРТ |
|
Experimental |
| 5221 |
Ioplanet00/voice-dementia-detection
음성 기반 치매 조기 선별 AI 솔루션 (VoiceCare) |
|
Experimental |
| 5222 |
Vuurvos1/twitchTTS
A Twitch tool that reads/highlights highlighted messages |
|
Experimental |
| 5223 |
rahulv07/Decypher
Decypher is a subtitle file generator for videos. It is made using Python... |
|
Experimental |
| 5224 |
botbahlul/android-autosrt
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE... |
|
Experimental |
| 5225 |
harvatechs/KuRL
Ultra-Fast Indic Text-to-Speech Engine with Zero-Shot Voice Cloning |
|
Experimental |
| 5226 |
Trinx1/TinyStreamer
🎤 Capture audio from your microphone, encode it in MP3, and stream it live... |
|
Experimental |
| 5227 |
piyushchugeja/Voice-assistant-for-form-filling
This is a voice-controlled form filling application built using Python and... |
|
Experimental |
| 5228 |
canb0y/PlaySubtitle-v1
Android APP for closed captioning with inbuilt ASR (VOSK Speech... |
|
Experimental |
| 5229 |
Kabir5296/Kakatua-ASR
Official Training Module for IUT National ICT Fest 2024 Datathon:... |
|
Experimental |
| 5230 |
4nn0nym05/crown-tts-openai-fivem-script
Script that is using OpenAI API for text to speech. Mainly made for speech... |
|
Experimental |
| 5231 |
mrglaster/PySpeechRecognizer
Recognizes speech from .wav file |
|
Experimental |
| 5232 |
Amir-Mohseni/VoiceBridge
This repository provides a dockerized Speech-to-Speech application that... |
|
Experimental |
| 5233 |
VyetGokyra/project_NLP_final
This is a group project in the vin program: Modality Balance for Multimodal... |
|
Experimental |
| 5234 |
spokestack/react-native-spokestack-tray
React Native component for adding Spokestack to a React Native app |
|
Experimental |
| 5235 |
chrarvi/automatic-speech-recognition
An automatic speech recognition transformer for converting swedish voice to text. |
|
Experimental |
| 5236 |
ZizhaoZheng-Charlie/DodoBotOffical
Developed a Discord bot integrating voice recognition, Spotify streaming,... |
|
Experimental |
| 5237 |
emilykhidirova/speech-emotion-recognition
Speech emotion recognition using fine-tuned Wav2Vec2 |
|
Experimental |
| 5238 |
Tim55667757/AudioGenerator
Озвучка русских и иностранных текстов через платформу OpenAI |
|
Experimental |
| 5239 |
Zhanerd/HumanPose_Face_Analysis
Try to provide inference(face, pose, ocr, tts, etc.) for onnx and tensorRT and rknn. |
|
Experimental |
| 5240 |
vovandreevik/Speech-Recognition-Model
Web application that allows users to control car functions using voice commands |
|
Experimental |
| 5241 |
noahvelasco/Flutter-ElevenLabs-Tutorial
DEV: Flutter + A.I. Text-To-Speech: A Simple Guide |
|
Experimental |
| 5242 |
hemanth-07-11/Speech-to-text-convertor
This is a Speech to text converter app, developed by HEMANTH N that... |
|
Experimental |
| 5243 |
czyzi0/the-mc-speech-dataset
Free speech dataset consisting of 24018 short audio clips of a single... |
|
Experimental |
| 5244 |
Hiaggprkfkrkfk/ComfyUI-QwenTTS
🎤 Enhance your voice projects with ComfyUI-QwenTTS, featuring custom nodes... |
|
Experimental |
| 5245 |
harrisonwang/speech-recognizer
A Node.js SDK for Xunfei Speech Recognition (IAT) service, providing... |
|
Experimental |
| 5246 |
Regaez/datastar-speech
A custom Datastar action plugin that leverages the Web Speech API in order... |
|
Experimental |
| 5247 |
ishandeveloper/Speech_Recognition
Speech Recognition and Text-To-Speech implemented using Google... |
|
Experimental |
| 5248 |
mgdicesare/lecture-notes-generator
Automated pipeline to transcribe lecture audio with Whisper, generate... |
|
Experimental |
| 5249 |
ocatias/AutoMash
Automatically create YouTube mashups. Given videos and a text, AutoMash will... |
|
Experimental |
| 5250 |
JoeS69/parrot
Convert text to speech offline with Parrot, a private AI tool that reads... |
|
Experimental |
| 5251 |
dobby-seo/kosr
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식) |
|
Experimental |
| 5252 |
Kenjd/student-name-pronunciation-helper
A Shiny app to help teachers learn correct pronunciation of student names |
|
Experimental |
| 5253 |
tassosblackg/Deep4Deep
CNN implementation for ASV |
|
Experimental |
| 5254 |
Rohit909-creator/EfficientWordNet_Upgrade
EfficientWordNet enhances wakeword detection with noise-robust similarity... |
|
Experimental |
| 5255 |
maschlr/summaree_bot
AI assistant to transcribe, translate and summarize voice messages and audio files |
|
Experimental |
| 5256 |
AndrewFarley/OpenTX-Generate-Sounds-Amazon-Polly
A helper script to generate OpenTX sounds based on the language .csv |
|
Experimental |
| 5257 |
CodersCreative/voice-assistant-py
A python voice assistant which is was made to be easy to set up, customize... |
|
Experimental |
| 5258 |
code2k13/pipico_speech_recognition
This repository contains code and instructions to implement single word... |
|
Experimental |
| 5259 |
zatomos/Speech-to-text_bot
A Discord bot for voice message transcription |
|
Experimental |
| 5260 |
RKirlew/Scarlet-Virtual-Assistant-version-1.1
S.C.A.R.L.E.T (Sorta Crappy Assistant Robot Lazily Engineered Today) The... |
|
Experimental |
| 5261 |
lingualogic/my-speech-listen-en
Example for Speech-Angular ListenService |
|
Experimental |
| 5262 |
b0o/whispertool
🗣️ voice recording and transcription tool built on whisper.cpp |
|
Experimental |
| 5263 |
Dimas6690/runanywhere-expo-demo
🤖 Showcase on-device AI capabilities with RunAnywhere SDK in React Native +... |
|
Experimental |
| 5264 |
khalid-sha/arabic-ai-pronunciation
Guidelines and linguistic rules for improving Arabic pronunciation in AI... |
|
Experimental |
| 5265 |
BABIN-JOE/FLUENT-EDGE
Fluent Edge is an intelligent, real-time web application designed to help... |
|
Experimental |
| 5266 |
sprakhar77/AssistantYui
Yui is a helpful personal assistant with simple functionalities for daily... |
|
Experimental |
| 5267 |
ttsaigit/tts-widget
Embeddable AI voice chat widget — add voice AI agents to any website with... |
|
Experimental |
| 5268 |
4350pChris/matrix-transcriptions
Transcribe those annoying voice messages. |
|
Experimental |
| 5269 |
happytunesai/EZ-STT-Logger-GUI
Python GUI for real-time Speech-to-Text (STT) using local Whisper, OpenAI... |
|
Experimental |
| 5270 |
Tharinda-Pamindu/Audio_Transcription
🎙️ AI-powered Audio Transcription — Transcribe Sinhala & English audio using... |
|
Experimental |
| 5271 |
pjayanthi/franken_whisper
Orchestrate Rust-based speech-to-text pipelines with adaptive routing,... |
|
Experimental |
| 5272 |
FAIZAN-Bor/QuranCompanion
QuranCompanion helps learners practice Quran recitation with real-time AI... |
|
Experimental |
| 5273 |
harshit-862000/Text-to-speech-generation-with-LLM-hugging-face
This project demonstrates how to generate speech from text using a... |
|
Experimental |
| 5274 |
rwst/Lichess-by-Voice
Play casual chess on lichess.org via voice commands |
|
Experimental |
| 5275 |
nayyhah/Decipher
A web-based tool to provide multilingual versions of videos hosted online. |
|
Experimental |
| 5276 |
shujaatsunasra/mind-ai-voice-assistant
Voice-first AI productivity companion with natural speech, earphone nudges,... |
|
Experimental |
| 5277 |
KunalGehlot/myWhisperer
Free, open-source voice-to-text desktop app powered by OpenAI Whisper and... |
|
Experimental |
| 5278 |
tuzibr/Real_time_caption_translate
A real-time caption translation tool based on VOSK speech recognition and... |
|
Experimental |
| 5279 |
Voxray-AI/Voxray
Real-time voice AI pipeline in Go. STT → LLM → TTS. Any provider, any transport. |
|
Experimental |
| 5280 |
ShadowLp174/discord-stt
A Node.JS module for speech to text transcription in Discord voice channels... |
|
Experimental |
| 5281 |
v-aibha-v-jain/VA-task-executor
A desktop voice assistant sys, that can execute commands like open URLs, apps. |
|
Experimental |
| 5282 |
NTT123/hifigan-tpu
Train HiFi-GAN on TPU |
|
Experimental |
| 5283 |
IvanEvan/chinese-digital-speech-recognition
中文数字语音识别:识别类语音验证码的8位数字语音 |
|
Experimental |
| 5284 |
BitsofJeremy/WeirDing
Audiobook narration engine powered by Qwen3-TTS. Upload documents, pick a... |
|
Experimental |
| 5285 |
Shuhua-L/Expense-Tracker
AI-powered 🤖 application designed to simplify and enhance daily expense... |
|
Experimental |
| 5286 |
i-Rony/F.R.I.D.A.Y
Simple AI assistant capable of Speech Recognition and minor tasks with... |
|
Experimental |
| 5287 |
PiyushKhanna30/Virtual-Assistant
Virtual Assistant is made using WolframAlpha and Wikipedia. Here I have used... |
|
Experimental |
| 5288 |
eja/tts-server
An Android app for text-to-speech via HTTP requests. |
|
Experimental |
| 5289 |
PrashanthaTP/wav2mov
Speech to Facial Animation using GANs |
|
Experimental |
| 5290 |
imgta/vialect
Streamline your video/audio intake by transforming multimedia content into... |
|
Experimental |
| 5291 |
magicvoiceai/MagicVoice
MagicVoice |
|
Experimental |
| 5292 |
k1ngjet3r/GA_test_automation
Google Assistant test automation, converting text to speech, speech to text,... |
|
Experimental |
| 5293 |
glhr/speech
Text-to-Speech and Speech-to-Text methods for Python |
|
Experimental |
| 5294 |
mklement0/speak.awf
An Alfred 3 workflow that uses macOS's TTS (text-to-speech) feature to speak... |
|
Experimental |
| 5295 |
Farmerok/Uvoxus-Voice-Assistant
Uvoxus Voice Assistant for Windows lets you control your PC using commands,... |
|
Experimental |
| 5296 |
SoheilGtex/Voice-Cloning-SV2TTS-
Safe, production-ready starter for voice cloning via SV2TTS (RTVC wrapper).... |
|
Experimental |
| 5297 |
Lostenergydrink/styletts2-dataset-toolkit
Complete Windows-optimized workflow for voice cloning with StyleTTS2.... |
|
Experimental |
| 5298 |
dnacenta/voice-echo
Voice interface for Claude Code over the phone via Twilio |
|
Experimental |
| 5299 |
negihimanshu015/EchoSign
EchoSign is a full-stack American Sign Language (ASL) learning platform. The... |
|
Experimental |
| 5300 |
gongouveia/Whisper-Synthetic-ASR-Dataset-Generator
This UI serves as a Synthetic ASR Dataset Generator powered by/for OpenAI... |
|
Experimental |