All Voice AI Tools
8,165 tools ranked by quality score · Page 47 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 4601 |
dunkbing/text2audio
Simple TTS tool made with Fresh |
|
Experimental |
| 4602 |
ZarredFelicite/parakeet-transcriber
An audio transcription tool using NVIDIA Parakeet, available as a CLI or... |
|
Experimental |
| 4603 |
sonhm3029/Realtime-Vietnamese-ASR-React-Native-and-Whisper
This project implement end to end realtime vietnamese speech recognition... |
|
Experimental |
| 4604 |
ShunsukeHayashi/byteplus-voice-ai
BytePlus音声対話AIアプリケーション - ASR, TTS, Voice Cloning統合(WebSocket対応、日本語対応✅) |
|
Experimental |
| 4605 |
neosapience/typecast-js
The official Node.js SDK for the Typecast API. |
|
Experimental |
| 4606 |
KarinBrisker/Video-Subtitler
Automatically Generating Multilingual Subtitles Using OpenAI's Whisper and... |
|
Experimental |
| 4607 |
kongju7/my_project6
Personal project 6: Speech Recognition Deep Learning Chatbot -... |
|
Experimental |
| 4608 |
bitgineer/Speakeasy
Privacy-first local voice-to-text using Whisper AI. Cross-platform desktop... |
|
Experimental |
| 4609 |
Caliope-SpeechProcessingLab/SpeechTester
Speech Tester is a set of Python scripts conceived as an extension to HTK... |
|
Experimental |
| 4610 |
lispking/qwen3-tts-mlx
A simple and easy-to-use wrapper package for Qwen3 TTS based on MLX Audio.... |
|
Experimental |
| 4611 |
my-north-ai/semantic_audio_filtering
Synthetic data augmentation technique via LLM for Automatic Speech... |
|
Experimental |
| 4612 |
loserbcc/openclaw-gateway
Open-source WSS gateway for connecting phones to moltbots. Speaks OpenClaw... |
|
Experimental |
| 4613 |
danijcom/whisper-telegram-bot
Simple Telegram bot for transcribing voice messages into text (STT) in... |
|
Experimental |
| 4614 |
blastheart1/voice-ai-braincx
🎤 Real-time voice AI conversational agent with LiveKit, FastAPI & React.... |
|
Experimental |
| 4615 |
shashankchandak/AutoSMSReader
An android application that allows users to read all incoming messages loudly |
|
Experimental |
| 4616 |
sudonitin/MediumScraper
Scraping articles of medium and providing audio versions 📑 to 🔊 using django |
|
Experimental |
| 4617 |
zzpuser/SnapDict
macOS AI 翻译词典,基于 DeepSeek 提供智能翻译、词根助记、拼写纠正和语音朗读 | AI-powered dictionary app... |
|
Experimental |
| 4618 |
FarzadForuozanfar/Speech-Recognition
I recorded 10 voices with the same words from myself and compared them with... |
|
Experimental |
| 4619 |
smcantab/speak11
Select text, press ⌥⇧/, hear it read aloud. macOS text-to-speech powered by... |
|
Experimental |
| 4620 |
Usman-bin-Khalid/Jarvis-AI-Voice-and-Text-Assistant-Python-
Jarvis AI Voice & Text Assistant – A Python-based desktop AI assistant with... |
|
Experimental |
| 4621 |
labrijisaad/Youtube-video-transcriptor
In this notebook, I implemented a script to transcribe YouTube videos (and... |
|
Experimental |
| 4622 |
boltomli/speech-api
Demo to show how to use Azure Speech Services API in app |
|
Experimental |
| 4623 |
Mohamed-Ashik-S/Speech-to-Text
This is a Speech to text project which uses openAI's Whisper model. |
|
Experimental |
| 4624 |
language-org/voice-activ-detect-deepnet
ASR: Light deep net for real-time voice activity detection |
|
Experimental |
| 4625 |
Mohamedfat7i/local-voice-cloning-app
🔊 Clone voices easily with this lightweight Python app that synthesizes... |
|
Experimental |
| 4626 |
dusionlike/unplugin-string-to-audio
在打包过程中自动将字符串转换为语音文件并添加到最终的打包文件里面, 支持Vite and Webpack |
|
Experimental |
| 4627 |
chrismarquezz/voice-chess
An interactive chess app that lets you play and control the game entirely... |
|
Experimental |
| 4628 |
adelacvg/DPTTS
An AR+AR TTS attempt. |
|
Experimental |
| 4629 |
sandeepmukku12/vocodine
🎙️ VocoDine: Book your table with your voice! Speak your booking details,... |
|
Experimental |
| 4630 |
TakumiSenaha/Nreal_IoT
This project aims to visualize the sensor information of the surroundings... |
|
Experimental |
| 4631 |
priyanshpsalian/VISION-THE-BLIND
An all in one solution for safety and security of blind. Features covered in... |
|
Experimental |
| 4632 |
Kimosabey/vox-agent-neural
Neural Voice Agent core constructs for conversational AI. |
|
Experimental |
| 4633 |
nsourlos/voice_cloning_tools
Various tools to clone a voice |
|
Experimental |
| 4634 |
MaurerKrisztian/vrc-tts-osc
Text-to-Speech & AI Bot With OSC Integration |
|
Experimental |
| 4635 |
guptakushal03/Virtual-Voice-Assistant
This Python script creates a voice-controlled desktop assistant capable of... |
|
Experimental |
| 4636 |
dangvansam/nvidia-nemo-jasper-quartznet-asr-vietnamese
Nhận dạng giọng nói Tiếng Việt sử dụng model Quartznet (Nvidia) + flask demo |
|
Experimental |
| 4637 |
axzml/VoxLinkAI_Client
Native macOS voice input assistant. Hold a hotkey, speak, and let AI... |
|
Experimental |
| 4638 |
leo01102/lumen
Lumen – Asistente IA Empático y Multimodal (rostro y voz) en tiempo real.... |
|
Experimental |
| 4639 |
Uknowme-h/Audiollect
Audiollect is a Notes to AudioBook Web App built with MERN stack , where... |
|
Experimental |
| 4640 |
vaibhav-init/AskCrow
Voice Bot using Gemini Model |
|
Experimental |
| 4641 |
vishishttiwari/Android_Application_for_understanding_ASL_using_gesture_recognition
An Android Application that uses gesture recognition to understand alphabets... |
|
Experimental |
| 4642 |
mohammad-zolghadr/Pro-Todo
A professional todolist that stores information in local storage and uses... |
|
Experimental |
| 4643 |
swarnayuroy/Web-Automation-using-speech-recognition
Generate results on web browser i.e. automated after user speaks out the... |
|
Experimental |
| 4644 |
ArielDelRio/evernote-clone
Notes App is an application to record notes and store them in the cloud in... |
|
Experimental |
| 4645 |
LEMAS-Project/LEMAS-Project
LEMAS: A 150K-Hour Large-scale Extensible Multilingual Audio Suite with... |
|
Experimental |
| 4646 |
egorsmkv/w2v2-bert-aligner
Aligner for wav2vec2-bert models |
|
Experimental |
| 4647 |
taufiq-ai/Bengali-AI-Recieptionist
An AI Recieptionist Flask App with STT, TTS, FaceRecognition,... |
|
Experimental |
| 4648 |
ccj242/Audible-Deaf-Communications
A non-profit app designed to make help the deaf communicate in person and... |
|
Experimental |
| 4649 |
kamya-ai/Talk2Text-Live
"Talk2Text Live" is a cutting-edge project that harnesses the power of... |
|
Experimental |
| 4650 |
alwalid54321/AI-Voice-Assistant
A modern, voice assistant built with React, TypeScript, and the Hugging Face... |
|
Experimental |
| 4651 |
theimpossibleastronaut/pennyworth
Voice recognition based digital home assistant in progress. Quite unusable... |
|
Experimental |
| 4652 |
metacore-stack/vocalcanvas-studio
Craft expressive speech from text using a streamlined pipeline of voices,... |
|
Experimental |
| 4653 |
cagataygedik/TTS
Internship Text-to-Speech research project. |
|
Experimental |
| 4654 |
itsanthonio/Vision-To-Speech
A vision to speech project |
|
Experimental |
| 4655 |
sancliffe/ollama-STT-TTS
A simple, hands-free Python voice assistant that runs 100% locally. This... |
|
Experimental |
| 4656 |
YIZHUANG/InstrumHack
For tieto hackathon 2018 to improve Finnish people financial well-being |
|
Experimental |
| 4657 |
RafaelCenzano/Marvin-v3-client
Marvin Version 3 client version |
|
Experimental |
| 4658 |
Mrzhangxiaoduo/react-native-speech-recognizer
react-native-speech-recognizer |
|
Experimental |
| 4659 |
Jmi2020/HowdyVox
A privacy focused offline STT TTS interface for your favorite LLM |
|
Experimental |
| 4660 |
zainibaloch/Quran-App---All-in-one
A fully responsive Next.js 13 Quran web app with audio recitation,... |
|
Experimental |
| 4661 |
lkwbr/structured-prediction
Machine learning algorithms for structured inputs and outputs, such as on... |
|
Experimental |
| 4662 |
Adexandria/TextToSpeechAPI
A REST API that converts a text image to an mp3 file. The text image can... |
|
Experimental |
| 4663 |
geniusrise/audio
Audio components for geniusrise framework |
|
Experimental |
| 4664 |
sobrunmoksesh/Intellifacts_Android_Project
An application that allows you to read facts. It includes voice interaction... |
|
Experimental |
| 4665 |
thewh1teagle/phonikud-assistant
Local AI assistant in Hebrew with Phonikud ✨ |
|
Experimental |
| 4666 |
daniel-szulc/Speech_Recognition
🎙 Automatic Keyword Speech Recognition for Polish and English in Tensorflow 🧠 |
|
Experimental |
| 4667 |
ctoth/Qlatt
Explainable WebAudio Klatt formant synthesizer with declarative TTS frontend... |
|
Experimental |
| 4668 |
jqi41/Subrank
ICASSP 2020 |
|
Experimental |
| 4669 |
falniak95/TurkishSpeechRecognition
Tamamen Türkçe Konuşma Algılama Sistemi. Google Cloud Platform API desteği... |
|
Experimental |
| 4670 |
spacelatte/Basic-Digital-Signage
This is a android application that serves as simple digital signage... |
|
Experimental |
| 4671 |
oarthurfc/AI-outgoing-call
An intelligent voice agent that automatically calls leads, promoting... |
|
Experimental |
| 4672 |
dantasl/parrot-ai
This is a proof of concept that generates speech based on parameters... |
|
Experimental |
| 4673 |
algorithmio/accent-conversion-ai
Real-time accent conversion during phone calls using Twilio, Deepgram, and... |
|
Experimental |
| 4674 |
thirteenkai/bob-plugin-qwen-tts
Bob TTS 插件 - 使用阿里云 Qwen3-TTS-Flash 模型进行语音合成,支持 45+ 种语音角色 |
|
Experimental |
| 4675 |
peterxubuaa/Voice-Assistant
Voice Assistant |
|
Experimental |
| 4676 |
dwain-barnes/DeepSeek-Thinking-TTS
Listen to DeepSeek's thinking process in real-time! This script converts... |
|
Experimental |
| 4677 |
tubexchat/interpreter-zh2en-gemini
An interpreter web app between Chinese and English that is powered by Gemini-2.0-fash |
|
Experimental |
| 4678 |
ahmedoubadi/kokoro-tts
Open-source Kokoro-TTS API server (FastAPI) and web UI (React) for... |
|
Experimental |
| 4679 |
RGonza1529/Nura
A Full-Stack React/Node.js AI-powered web application that provides... |
|
Experimental |
| 4680 |
apluka34/audio-crawler
A tool for crawling and creating audio dataset |
|
Experimental |
| 4681 |
sagar-alias-jacky/F.R.I.D.A.Y
A basic but fun virtual assistant made using Python |
|
Experimental |
| 4682 |
Yuanshi9815/LiteFocus
[Interspeech 2024] LiteFocus is a tool designed to accelerate... |
|
Experimental |
| 4683 |
Amiannn/Simple-HmmGmm
Simple HMM implementation |
|
Experimental |
| 4684 |
nathanyaqueby/roche-dementia-hackathon
AI and AR-based digital memory lane and cognitive stimulation for dementia patients |
|
Experimental |
| 4685 |
OldBonhart/TensorFlow_Speech_Recognition_Challenge
TensorFlow Speech Recognition Challenge -... |
|
Experimental |
| 4686 |
tez3998/audio-output-to-text
VOSKを使ったスピーカーやヘッドフォンから出力される音声のオフライン文字起こし |
|
Experimental |
| 4687 |
ilya16/speech-synthesis-course
An introduction course on Speech Synthesis and Voice Cloning (Skoltech ISP'25) |
|
Experimental |
| 4688 |
bobbymay/Dictation-for-macOS
Speech Recognition for macOS that allows you to define words, phrases, or... |
|
Experimental |
| 4689 |
Zhang-Nian/Intelligent_CustomerService
Speech Recognition 、Speech Synthesis 、Intelligent Dialogue |
|
Experimental |
| 4690 |
saharmor/EchoScribe
Local AI transcription workspace with cloud APIs (OpenAI Whisper) or local... |
|
Experimental |
| 4691 |
derpeloper/ostinato
giving a voice to the voiceless. |
|
Experimental |
| 4692 |
LiBinZyu/VAI
Implement highly precise natural language voice control in any Unity... |
|
Experimental |
| 4693 |
jitendrakw09/Voice-Sangam
Voice Sangam is a modern text-to-speech platform built with Next.js 16,... |
|
Experimental |
| 4694 |
YoussefBechara/Enhanced-Custom-ChatBot
Custom Built AI Chatbot using Huggingface's ai, enhanced with features such... |
|
Experimental |
| 4695 |
ashwin2k/LibraAI
Libra.AI is a women's safety-focused voice-activated A.I. assistant android... |
|
Experimental |
| 4696 |
Kunal-Kumar-Sahoo/iCompanion-AssistanceMadeSimple
This is a Python3 based virtual assistant developed for Computer Science... |
|
Experimental |
| 4697 |
imsanjoykb/Speech-NLP-Bootcamp
Speech NLP Bootcamp |
|
Experimental |
| 4698 |
Aslm-Fawzy/Speech-Recognition-Using-Raspberry-Pi
Simple Speech Recognition Program Run on Raspberry Pi |
|
Experimental |
| 4699 |
NEURASCOPE/neurascreen
Automate product tour videos with JSON scenarios. Real browser recording, AI... |
|
Experimental |
| 4700 |
ErolOZKAN-/TurkishSpeechRecognition
Turkish Speech Recognition Project / Türkçe Konuşma Tanıma Projesi |
|
Experimental |