All Voice AI Tools
8,165 tools ranked by quality score · Page 44 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 4301 |
BrunoHenrique00/ear
Ear is a desktop app that will help you transcribe what is playing on your computer! |
|
Experimental |
| 4302 |
mahirgul/Google-Cloud-Text-To-Speech-PHP
Google Cloud Text To Speech Application - PHP |
|
Experimental |
| 4303 |
BestSithInEU/cc-vox
Claude Code plugin that speaks a short summary aloud after every response |
|
Experimental |
| 4304 |
dictto-app/dictto
Voice-to-text for Windows — hold a hotkey, speak, release. Clean text... |
|
Experimental |
| 4305 |
bunyaminergen/awesome-speech-dataset
Awesome Speech Dataset, including download links and a brief explanation for... |
|
Experimental |
| 4306 |
chirag127/ZenRead-AI-Content-Reader-Browser-Extension
Privacy-first browser extension providing clean reader mode, AI... |
|
Experimental |
| 4307 |
Youtube-Transcript-Dev/Youtube-Transcript-API
YouTube Transcript API — Extract, transcribe, and translate YouTube videos... |
|
Experimental |
| 4308 |
habitual69/speakify-api
Speakify is a simple API that generates audio and subtitles from text using... |
|
Experimental |
| 4309 |
Bangla-Language-Processing/Bangla-Speech-Corpora
Bangla cleaned speech corpus, specially developed for Bangla Text to Speech |
|
Experimental |
| 4310 |
LinkonBSMRSTU/Speech-To-Text-App-iOS
A simple iOS App that can convert speech/voice into text. Only English voice... |
|
Experimental |
| 4311 |
klee-repos/dialogflow-voice-streaming
Intent mapping with real-time voice to text stream |
|
Experimental |
| 4312 |
Parseval-Labs/SoundML
A high level DSP library in the OCaml language |
|
Experimental |
| 4313 |
daaminashai/speech-assistant
speech assistant for individuals suffering from Aphasia |
|
Experimental |
| 4314 |
conbitin/htk3.5-install
Installation steps of HTK 3.5 under Ubuntu |
|
Experimental |
| 4315 |
tomasgoiba/diphone-synthesizer
Basic diphone-based concatenative speech synthesizer in English. |
|
Experimental |
| 4316 |
R3ner/Barrel-Timer
Advanced voice-controlled cooldown tracker for League of Legends. Tracking... |
|
Experimental |
| 4317 |
aws-samples/seq2seq-asr-misbehaves
Artifacts for the paper "Attentional Speech Recognition Models Misbehave on... |
|
Experimental |
| 4318 |
kyopark2014/demo-robo-soulmate
It is a repository to prepare a demo for dansing robot. |
|
Experimental |
| 4319 |
anuran-roy/vosk-demo
A simple offline voice recognition system purely built on Python3, that... |
|
Experimental |
| 4320 |
avikantz/Samaritan
Samaritan demo clone for iOS. |
|
Experimental |
| 4321 |
Aalwattar/ParrotInk
Professional-grade, real-time voice-to-text for Windows. Stream your voice... |
|
Experimental |
| 4322 |
celanthe/clarion
Your agents have things to say. Now they have a voice to say them with. |
|
Experimental |
| 4323 |
p-jacobo2012240/AI-Real-Time-Recognition
Tensorflow app for real-time environment sketching using text-to-speech and GCP |
|
Experimental |
| 4324 |
Mmesek/mUSh
Ultrastar Songs Creation/Management helper utils. |
|
Experimental |
| 4325 |
Leonqn/speech-to-text-bot
Speech to text telegram bot. It can convert voice and video note messages to... |
|
Experimental |
| 4326 |
verbio-technologies/rust-verbio-speech-center
Rust integration with Verbio Speech Center Cloud https://www.speech-center.verbio.com |
|
Experimental |
| 4327 |
palahsu/textspeech
A python program that helps you to read your text in lady robot voice at... |
|
Experimental |
| 4328 |
abhishekkr/lyrical-video-generator
lyrical-video-generator is supposed to help create lyrical videos from audio... |
|
Experimental |
| 4329 |
miikkij/Speechos
Local-first speech AI benchmarking — compare STT, TTS, emotion & diarization... |
|
Experimental |
| 4330 |
iChochy/mimo-tts-chat
MiMo TTS Chat |
|
Experimental |
| 4331 |
ammarasmro/Kurdish-Language
Applications of NLP on the Kurdish language |
|
Experimental |
| 4332 |
raym33/aiemoji
AI talking face |
|
Experimental |
| 4333 |
DIY-Engineering/Advanced-STS-Local-AI-Assistant
This is a fully local AI Assistant that uses Silero VAD, Faster-Whisper, LM... |
|
Experimental |
| 4334 |
SKLD-xm/speechy
A text-to-speech synthesizer based on C# that supports SSML |
|
Experimental |
| 4335 |
box-community/sample-audio-skills
🎼 Box Skills samples for processing audio files |
|
Experimental |
| 4336 |
PRITHIVSAKTHIUR/Qwen3-TTS-Daggr-UI
Demonstration for the Qwen/Qwen3-TTS-12Hz models using Daggr for modular UI... |
|
Experimental |
| 4337 |
rvuyyuru2/supertonic-restapi
Supertonic FastAPI - High Performance OpenAI-Compatible TTS API |
|
Experimental |
| 4338 |
senigami/audiobook-studio
Professional local-first AI production pipeline for long-form narration.... |
|
Experimental |
| 4339 |
harmindersinghnijjar/streamlit-punjabi-ai
Punjabi AI, ChatGPT with translation and Punjabi TTS using Narakeet's API. |
|
Experimental |
| 4340 |
x-phone/xbridge
Self-hosted voice gateway — WebSocket audio streaming and REST call control.... |
|
Experimental |
| 4341 |
somosnlp/wav2vec2-spanish
Pre-train a Spanish Wav2Vec2 model using the Spanish portion of the Common... |
|
Experimental |
| 4342 |
tozalia/pocket-tts-openapi-gpu
🎤 Clone voices locally with Pocket TTS OpenAPI - GPU. Enjoy free,... |
|
Experimental |
| 4343 |
victoryangzhijie/stt-server
Real-time speech-to-text WebSocket server with pluggable ASR backends,... |
|
Experimental |
| 4344 |
spokestack/android-skeleton
A functionless Android app that demonstrates a basic integration with the... |
|
Experimental |
| 4345 |
xujiaao/BezierSpline
Android - Smooth Bézier Spline Through Prescribed Points |
|
Experimental |
| 4346 |
msalhab96/RNN-Transducer
PyTorch implementation of Sequence Transduction with Recurrent Neural... |
|
Experimental |
| 4347 |
syedjahangirpeeran/Optical-Character-Recognition-and-TTS
Written in MATLAB, the project aims to convert hand written or printed text... |
|
Experimental |
| 4348 |
artem-gorodetskii/long-form-voice-cloning
Audio samples from "Zero-Shot Long-Form Voice Cloning with Dynamic... |
|
Experimental |
| 4349 |
Salama1429/speech-to-speech-translation
cascaded speech-to-speech translation (STST), mapping from source speech in... |
|
Experimental |
| 4350 |
rohansx/convox
Open-source voice AI orchestration platform for India. Build production... |
|
Experimental |
| 4351 |
1Finn2me/Novery
A modern Android novel reader with multi-source support, TTS, and offline reading |
|
Experimental |
| 4352 |
gillan-krishna/meeting_notes
Hobby project to transcribe audio files from meetings to transcripts with a summary |
|
Experimental |
| 4353 |
baharudin-yusup/salingsapa
A video call apps to enable deaf people to communicate with normal people... |
|
Experimental |
| 4354 |
raminnakhli/HMM-DNN-Speech-Recognition
This repository is a Python implementation of HMM-DNN model. |
|
Experimental |
| 4355 |
Sneakyhydra/Sentinel
Voice Assistant using Whisper in python3 |
|
Experimental |
| 4356 |
Shibli-Nomani/Open-Source-Models-with-Hugging-Face
Open Source Models With Hugging Face |
|
Experimental |
| 4357 |
vorojar/VibeVoice
Open-source AI audiobook studio. A free, private alternative to ElevenLabs.... |
|
Experimental |
| 4358 |
OpenLake/Speech-Analyser
An App to help you improve your English fluency 🎤 |
|
Experimental |
| 4359 |
Prakash2403/asl-recognizer
Sign language recognition using Hidden Markov Models |
|
Experimental |
| 4360 |
treychen-369/WallWhisper
🏠 Turn any IP camera into a smart English tutor for your family. AI-powered,... |
|
Experimental |
| 4361 |
prabormukherjee/Coursera_Helper_chatbot
A chatbot to help coursera student with their difficulty. |
|
Experimental |
| 4362 |
10raw/Prescription-Generator
android app to generate Doctor's Prescriptions faster using Deep Learning |
|
Experimental |
| 4363 |
FernandoLpz/SpeechRecognition
This repository contains the implementation of an Automatic Speech... |
|
Experimental |
| 4364 |
David-Antolick/REX_voice_assistant
Lightweight offline voice assistant for hands-free music control (YouTube... |
|
Experimental |
| 4365 |
SinhaRepo/nexus-ai-assistant
A distributed AI voice assistant built on a Raspberry Pi Zero W and a Flask... |
|
Experimental |
| 4366 |
manojsvgit/Voice_Based_Email_For_Blind
A Python-based application designed specifically for visually impaired... |
|
Experimental |
| 4367 |
h4l0anne/adv-companion-app
[🏆 Harm Reduction Category Winner, Top 5 @RUHacks] Drug-Venture Companion |
|
Experimental |
| 4368 |
EN10/Speech-to-Text-WaveNet
Speech to Text |
|
Experimental |
| 4369 |
richardr1126/KittenTTS-FastAPI
High-performance KittenTTS API server with a built-in web UI,... |
|
Experimental |
| 4370 |
Michaelrace/awesome-voice-agents
🗣️ Explore a curated list of voice AI agents, frameworks, tools, and best... |
|
Experimental |
| 4371 |
dalyanalytics/counselor
👑 voice-powered code review tool for R developers |
|
Experimental |
| 4372 |
AnuragGupta93/LocalEcho
**LocalEcho** is a fully local, open-source Text-to-Speech engine powered by... |
|
Experimental |
| 4373 |
Maxborland/mindtype-app
MindType — Voice-to-text with AI-powered summaries. 100+ languages, works... |
|
Experimental |
| 4374 |
IceDynamix/iceTTS
Twitch Chat TTS with no strings attached |
|
Experimental |
| 4375 |
Ziggx5/TalkToText
Speech-to-text app bulit with Python and Vosk speech recognition engine |
|
Experimental |
| 4376 |
erogol/TTS_tf
WIP Tensorflow implementation of https://github.com/mozilla/TTS |
|
Experimental |
| 4377 |
gupta-v/Eva
Eva - Desktop Assistant: A Python-based desktop assistant designed for... |
|
Experimental |
| 4378 |
meangrinch/spelling-bee
Spelling bee game with multiple difficulty tiers |
|
Experimental |
| 4379 |
371tti/Nelfie
A **standalone** Discord bot with LLM, VOICEVOX, and KaTeX support. |
|
Experimental |
| 4380 |
ryanblab1903n8/piperplus
Piperin is an efficient TTS tool that instantly creates high-quality audio... |
|
Experimental |
| 4381 |
angelinekeke/claude-awake-speak
让你的 Claude Code 会说话 — 自动语音朗读中文内容,8种微软官方音色可选,实时切换,免费无需API Key,跨平台支持 |
|
Experimental |
| 4382 |
lianabisuna/spelltacular
Random word spelling skills test/practice (Vue.js 2 & Vuetify) |
|
Experimental |
| 4383 |
GrahamPellegrini/Machine-Learning-Noise-Cancellation
Bachelor Final Year Project exploring real-time speech denoising using... |
|
Experimental |
| 4384 |
Jaffe2718/qwen3asr4j
Java binding for Qwen3 ASR |
|
Experimental |
| 4385 |
analyticsinmotion/wake-word
Hands-free voice activation for VS Code, Cursor, and compatible editors.... |
|
Experimental |
| 4386 |
YOUSSEF-BT/Ai-Summarizer
AI-powered summarizer for articles, PDFs, and Word documents with... |
|
Experimental |
| 4387 |
joypix-ai/joypix
AI Talking Video Generator: Talking Photo (AI lip-sync) + AI Avatar... |
|
Experimental |
| 4388 |
LiveTrad/livetrad-io-engine
A complete solution for Live meeting translations Extension+DesktopApp+Server |
|
Experimental |
| 4389 |
Sariel2018/audio-srt-aligner
Dual-mode subtitle tool: transcript-aware alignment and audio-only auto... |
|
Experimental |
| 4390 |
masasibata/t-one-rest-api
Production-ready REST API for Russian speech recognition using T-one model.... |
|
Experimental |
| 4391 |
anatolykoptev/moonshine-whisper
Fast speech-to-text HTTP service powered by Moonshine + sherpa-onnx. Beats... |
|
Experimental |
| 4392 |
ComputerCampaign/contentflow-ai
一个功能强大的Python工具,集成网页图片爬取和博客自动生成功能。支持XPath规则配置、任务ID管理、Selenium动态加载、GitHub图床上传、... |
|
Experimental |
| 4393 |
ekdysis/Speech-POC
POC using Apple's Speech framework demonstrating real-time speech... |
|
Experimental |
| 4394 |
Argo-Robot/wake_word_detection
Step-by-step guide to implement a wake-word detection system for Argo, an AI... |
|
Experimental |
| 4395 |
ducnt18121997/Viet-Transformer-TTS
This is PyTorch Implementation of A Non-Autoregressive Transformer with... |
|
Experimental |
| 4396 |
ZhanpengWang96/pytorch-speech2vec
Pytorch implementation of the paper Speech2Vec: A Sequence-to-Sequence... |
|
Experimental |
| 4397 |
BinkyWong/speech-recognition
Centos 7 based container for speech recognition |
|
Experimental |
| 4398 |
PrarieComamile/speech-to-text
Convert your voice to text file with this program. |
|
Experimental |
| 4399 |
davidsuragan/elevenlabs-alpha-v3
Use ElevenLabs Alpha v3 TTS model in Python with this repo. |
|
Experimental |
| 4400 |
Nono-04/ChannelPoints-TTS
A simple TTS rewards script for Twitch channel points |
|
Experimental |