All Voice AI Tools
8,165 tools ranked by quality score · Page 45 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 4401 |
inforkgodara/python-speech-to-text
A few lines of code which convert speech to text. |
|
Experimental |
| 4402 |
boned-fruitwood759/whisperx-asr-with-fastapi
🎤 Enable real-time speech recognition with WhisperX using FastAPI for... |
|
Experimental |
| 4403 |
hutchpd/AI-Medical-Scribe
Local-first AI medical scribe running entirely in the browser using Chrome... |
|
Experimental |
| 4404 |
Sergey004/silero_tts_rvc
A simple extension that allows LLM to speak in any voice, literally, based... |
|
Experimental |
| 4405 |
profdilley/markdown-speech-converter
This tool converts Markdown files into **speech-friendly plain text** files.... |
|
Experimental |
| 4406 |
danielcorsano/reader-gui
Standalone app for creating audiobooks from ebooks using realistic AI voices... |
|
Experimental |
| 4407 |
mohammadhasananisi/Google-Speech-Recognition
Persian-Speech-Recognition |
|
Experimental |
| 4408 |
sindhura-pv/lip-reading
In this project, visual speech recognition has been attempted using 2 major... |
|
Experimental |
| 4409 |
ss87021456/mfcc_ctc_speech
apply mfcc feature of waveform with the LSTM + CTC loss architecture |
|
Experimental |
| 4410 |
Dante9581/laravel-elevenlabs
🎤 Integrate ElevenLabs Text-to-Speech and Speech-to-Text APIs seamlessly... |
|
Experimental |
| 4411 |
JagratiVerma1408/ObjectDetectionApplication
Andriod app integrating tflite model for object detection |
|
Experimental |
| 4412 |
Bacdong/virtual-assistant-v1
Learning build virtual assistant with python and python library support. |
|
Experimental |
| 4413 |
Manokero/face-recognition-and-tts-numbers
En este proyecto se utiliza reconocimiento facial para verificar una persona... |
|
Experimental |
| 4414 |
andydowsen/voice-assistant
🏳🌌♨ Simple voice assistant with minimal ai logics includes streamlit web... |
|
Experimental |
| 4415 |
swiss-ai-center/text-to-speech-service
Queries an API based on Edge-TTS and returns an audio file based on... |
|
Experimental |
| 4416 |
akashchaudhary-git/android-azure-speech-openai
An integration of Azure Speech Service and Azure OpenAI in Android. This... |
|
Experimental |
| 4417 |
NitinN77/ASL-To-Speech-Rpi
A pi setup to recognize ASL signs using a pre-trained CNN model and speak it... |
|
Experimental |
| 4418 |
yiwise/yiwise-asr-demo-java
杭州一知智能科技有限公司自研 ASR Java客户端demo |
|
Experimental |
| 4419 |
aditeyabaral/natural-language-database-querying
A novel approach to data retrieval from tagged databases using only natural... |
|
Experimental |
| 4420 |
astrologos/libri-scraper
The Public Audiobook Scraper downloads full audiobook MP3's from... |
|
Experimental |
| 4421 |
imvladikon/wav2vec2-hebrew
Speech Recognition for Hebrew (using wav2vec2 models) |
|
Experimental |
| 4422 |
icosane/alstroemeria
Create and translate subtitles for any video, complete with voiceover capabilities. |
|
Experimental |
| 4423 |
DrewThomasson/ebook2audiobookEspeak
Create audiobooks with espeak in a gradio interface gui easy |
|
Experimental |
| 4424 |
tuanio/e2e-asr-toolkit
E2E Speech Recognition Toolkit with Hydra and Pytorch Lightning |
|
Experimental |
| 4425 |
vishal1patidar/TEXT-TO-SPEAK
🔖24 Different Languages voice's Add a text🗨️ in it and listen👂 |
|
Experimental |
| 4426 |
rupin/WrittenAudio
Written Audio Uses Google Text to Speech engine and a configuration file to... |
|
Experimental |
| 4427 |
techieinhouse/chatbot
python chatterbot using flask and speech recognition from html5 |
|
Experimental |
| 4428 |
BBC-Esq/Elegant-Audio-Transcriber
Extremely fast and accurate audio transcrbier surpassing Whisper. Optimized... |
|
Experimental |
| 4429 |
probablyagoodusername/vesper
Therapeutic audio pipeline. Faith meets science. Free, static, open source. |
|
Experimental |
| 4430 |
collinsuen/Local-Whisper-STT-Windows11-ZH
Local GPU-Accelerated Chinese Speech-to-Text for Windows 11 (Whisper-based,... |
|
Experimental |
| 4431 |
garconvacher/TextToSpeech_eBook
Un kit de test pour la synthèse vocale eBook (EPUB + Kindle) |
|
Experimental |
| 4432 |
Ponyu-dev/Unity-Sherpa-ONNX
Unity plugin for sherpa-onnx — offline TTS, ASR, and VAD with one-click setup |
|
Experimental |
| 4433 |
atanu20/alan-ai-news-project
Here i build a Conversational Voice Controlled React News Application using... |
|
Experimental |
| 4434 |
ckull/SUKI
A Node.JS Discord bot |
|
Experimental |
| 4435 |
YoRyan/obicaller
Talking caller ID for OBiTALK OBi200 and Raspberry Pi (or other Linux) |
|
Experimental |
| 4436 |
djleamen/renamer
Utility to rename mp3 files based on speech content |
|
Experimental |
| 4437 |
elvanselvano/streamlit-whisper
empowering the visually impaired with equal financial access through... |
|
Experimental |
| 4438 |
dongheehand/Tacotron-PyTorch
PyTorch implementation of Tacotron |
|
Experimental |
| 4439 |
itscooleric/yap
Local-first speech I/O stack — privacy-preserving transcription, synthesis,... |
|
Experimental |
| 4440 |
linseycurrie/NHS-Speech-Recognition-App
This was a group project created remotely over 7 days using Java, Spring,... |
|
Experimental |
| 4441 |
Aprataksh/Python-Files
mic_py : Python 3 code for successful use of microphone on windows.... |
|
Experimental |
| 4442 |
vault-42/AIND_DNN_Speech_Recognizer
End-to-end speech to text recognition |
|
Experimental |
| 4443 |
Momotoculteur/Keyword-voice-recognition
Créer une reconnaissance vocale de mots clés via des algorithmes... |
|
Experimental |
| 4444 |
Neil-001/audio-to-subtitle-translate
Easily convert speech to timed SRT subtitles and translated captions (Colab-ready) |
|
Experimental |
| 4445 |
dcervantes/VoiceFlashcards
VoiceFlashcards is an innovative web app that helps users practice language... |
|
Experimental |
| 4446 |
dpid/openclaw-voice-bridge
Hands-free voice interface for OpenClaw (Clawdbot). VAD-based PWA with... |
|
Experimental |
| 4447 |
elizabethfuentes12/meta-ai-agent-sample-for-aws-agentcore
Voice AI agent for Ray-Ban Meta glasses using Amazon Bedrock AgentCore and... |
|
Experimental |
| 4448 |
lmk123/cvox
Get spoken alerts when Claude Code needs permission or finishes a task — so... |
|
Experimental |
| 4449 |
neurlang/whipstr
Whipstr ASR/STT System |
|
Experimental |
| 4450 |
Epistates/rosellas
Automatic speech recognition (ASR) for Apple Silicon |
|
Experimental |
| 4451 |
D34DC3N73R/ha-chatterbox-tts
Home Assistant TTS integration for Chatterbox-TTS-Server |
|
Experimental |
| 4452 |
jagerzhang/FastTTS
基于edge-tts的简单语音合成服务,支持私有化部署,支持和源阅读APP无缝对接。 |
|
Experimental |
| 4453 |
pstepanovum/Cadence
Open-source AI pronunciation coach with phoneme feedback, guided speaking... |
|
Experimental |
| 4454 |
proger/uk
Фонограми та синтагми: інструменти обробки |
|
Experimental |
| 4455 |
umitkacar/transformer-asr-transcription
Real-time transformer-based ASR supporting 100+ languages - Google Cloud... |
|
Experimental |
| 4456 |
MAXBAF1/SpoonEat
A mobile application for maintaining a balance in nutrition, with the... |
|
Experimental |
| 4457 |
xi-Rick/captains-log
A voice transcription and logging web app built with TypeScript, Captain's... |
|
Experimental |
| 4458 |
IDEA-Emdoor-Lab/UniTTS
A TTS Trained on Universal Audio. |
|
Experimental |
| 4459 |
1999AZZAR/Telegram-Bot-Playground
This repository is a playground for experimenting with several simple... |
|
Experimental |
| 4460 |
LexicalStressDetection/lexical-stress-detection
Deep Learning model for lexical stress detection in spoken English |
|
Experimental |
| 4461 |
asheghi/text-to-speech
Text to Speech |
|
Experimental |
| 4462 |
atomiechen/funasr-client-ts
Really easy-to-use Typescript client for FunASR runtime server. |
|
Experimental |
| 4463 |
SVM0N/ttsweb
Convert PDFs/EPUBs to audiobooks with synchronized text highlighting using... |
|
Experimental |
| 4464 |
vijethph/violet-speech
Violet is a Speech Assistant made using Python |
|
Experimental |
| 4465 |
jinseok19/Intermediate_Level_Project_for_AI-X
🤖AI+X 선도 인재 양성 중급 프로젝트 with KT & 상명대학교🤖 |
|
Experimental |
| 4466 |
florabtw/google-translate-tts
Node library for Google Translate TTS (Text-to-Speech) API |
|
Experimental |
| 4467 |
TassAI/TASS-Android-UI
TASS Android UI is an open source Android application for using a remote... |
|
Experimental |
| 4468 |
gheyret/uyghur-asr-transformer
Speech Recognition for Uyghur using Speech transformer |
|
Experimental |
| 4469 |
HelgeSverre/glados
A web interface for GLaDOS text-to-speech with AI conversation capabilities |
|
Experimental |
| 4470 |
jaypinho/transcript-accuracy
A Streamlit app to evaluate the accuracy of automatic speech recognition... |
|
Experimental |
| 4471 |
baochuquan/ios-vad
iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN,... |
|
Experimental |
| 4472 |
alorbach/open-video-transcribe
Open Video Transcribe - Open-source video transcription tool that emphasizes... |
|
Experimental |
| 4473 |
kemsta/macloop
https://pypi.org/project/macloop/ |
|
Experimental |
| 4474 |
SudharsanSaravanan/JARVIS
JARVIS (Just A Rather Very Intelligent System) is a voice-controlled,... |
|
Experimental |
| 4475 |
bagustris/speech-recognition-course
Material for learning speech recognition, based on Microsoft teaching material on EdX |
|
Experimental |
| 4476 |
smswg/FreeSwitch-Mod_FunAsr
FreeSWITCH... |
|
Experimental |
| 4477 |
josharsh/terminal-voice
Voice input for the terminal. Speak, and it types. Local transcription,... |
|
Experimental |
| 4478 |
SentimentalK/Reliquary
The best voice input, a Zero-Friction Bridge to Your AI Exobrain |
|
Experimental |
| 4479 |
leminhnguyen/ai-speech-engineer-roadmap
A curated roadmap based on my 6 years of experience form zero to become a... |
|
Experimental |
| 4480 |
rudhreeshkumaar/Speech-to-Text
Speech recognition and text transcription from file or microphone |
|
Experimental |
| 4481 |
lane203m/SoundByte
U of R SSE Capstone Project; Recommending Music For Artists |
|
Experimental |
| 4482 |
sahilmishra0012/prescription-generator
This project aims at generating the prescription dictated by the doctor in a... |
|
Experimental |
| 4483 |
rahul6975/Helping-Voice
An Android application which completely works on voice input which helps... |
|
Experimental |
| 4484 |
rapidaai/rapida-python
Open-source Python SDK for real-time Voice AI, voice agents, streaming... |
|
Experimental |
| 4485 |
williamclavier/Multimodal-Classroom-Video-Recorder
A smart multimodal classroom video recording system that automatically... |
|
Experimental |
| 4486 |
lcukerd/Blink-to-Text
Application converts eye blinks to text and hence helps paralysed people communicate. |
|
Experimental |
| 4487 |
CrankZ/muyi
本地字幕生成与翻译,支持显卡加速 |
|
Experimental |
| 4488 |
Winnie-Fred/text-to-speech
Text-to-speech web-based application using Django and Google Translate... |
|
Experimental |
| 4489 |
xDoritox/Voice-Clone-Studio
🔊 Clone and design voices easily with Voice Clone Studio, a web UI powered... |
|
Experimental |
| 4490 |
PrathuashaKB/ASR-Using-Deep-Learning
Automatic Speech Recognition is a technique that processes human speech into... |
|
Experimental |
| 4491 |
kiy0ni/auto-video-editor
Un outil Python (Tkinter) qui génère automatiquement des highlights et des... |
|
Experimental |
| 4492 |
upstash/radio-hackernews
Audio Recap of Top Hackernews Stories |
|
Experimental |
| 4493 |
joaoalvarenga/voice-assistant
An open-source Alexa-like complete voice assistant system, from speech... |
|
Experimental |
| 4494 |
Mildemelwe/Japanese-Tacotron-2-notebook
Training notebook for Japanese TTS model with Tacotron 2 |
|
Experimental |
| 4495 |
salehsargolzaee/Audio-Signal-Processing-and-Feature-Extraction
Feature extraction from audio signal (explained in Persian) |
|
Experimental |
| 4496 |
Sajith171111/whisper
🗣️ Transcribe your voice to text easily on macOS. Just hold **Fn**, speak,... |
|
Experimental |
| 4497 |
ilya16/isp-tts
A simple TTS model developed for the Speech Synthesis and Voice Cloning... |
|
Experimental |
| 4498 |
nsourlos/end-to-end_deepfake_colab
Create deepfake video by just uploading the original video and specifying... |
|
Experimental |
| 4499 |
Muhib-Mehdi/ASL-Recognition-System
The ASL Recognition System is a real‑time American Sign Language (ASL)... |
|
Experimental |
| 4500 |
sebinbenjamin/wav2vec_demo
A Python tool for transcribing speech from audio files using the Wav2Vec 2.0... |
|
Experimental |