All Voice AI Tools
8,165 tools ranked by quality score · Page 61 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 6001 |
DynamicDevices/audionews
AI-powered daily audio news digests for accessibility. Generates... |
|
Experimental |
| 6002 |
OnesAndZer0s/node-dectalk
Node.js module that provides bindings for the DecTalk Text-To-Speech library |
|
Experimental |
| 6003 |
francisalexj/Text-to-Speech
Google text to speech translation using php. |
|
Experimental |
| 6004 |
Hand-On-Web-Ltd/voice-cloning-demo
Voice cloning demo page using ElevenLabs text-to-speech API — Node.js + Express |
|
Experimental |
| 6005 |
SaniaBharadwaj/Enjo_AI
A locally-hosted AI Desktop Assistant capable of OS control, neural speech,... |
|
Experimental |
| 6006 |
hasancoded/voice-assistant
Voice Assistant powered by Google Gemini AI, with speech recognition and... |
|
Experimental |
| 6007 |
Ananya-Addisu/ihearmvp
made as a capstone project | powered by safaricom gebeya |
|
Experimental |
| 6008 |
Leonidius20/Lugat
Crimean Tatar-russian dictionary app for Android with offline text-to-speech... |
|
Experimental |
| 6009 |
ebowwa/LocalDiarizationSwiftExample
iOS example app demonstrating on-device speaker diarization using FluidAudio... |
|
Experimental |
| 6010 |
nesquikm/the_speech_to_text_button
A Flutter widget that provides a simple button for speech-to-text... |
|
Experimental |
| 6011 |
parthanand-in/Virtual-Assistant
Virtual Assistant like Alexa and Google Dot |
|
Experimental |
| 6012 |
FunnyValentine69/maidai-v3
Tsundere Japanese maid AI with voice chat, bilingual dialogue, and emotion... |
|
Experimental |
| 6013 |
Lefyd24/LunaEye
A futuristic, Siri-inspired AI voice assistant with real-time fluid visualization |
|
Experimental |
| 6014 |
farhan7727/AI-powered-voice-operated-E-Librarian-system
An AI-powered voice librarian that listens to a user, finds relevant books... |
|
Experimental |
| 6015 |
CodingWithRoshan/VaidyaVerse
AI-powered bilingual doctor assistant using Groq (Whisper + LLM) and... |
|
Experimental |
| 6016 |
ikegami-yukino/csj-eval
For evaluating speech recognition system using the Corpus of Spontaneous... |
|
Experimental |
| 6017 |
btseee/oron-tts
Vits based Mongolian (Khalkha) TTS language model |
|
Experimental |
| 6018 |
yotsuda/Speech
PowerShell modules for text-to-speech (TTS) and speech-to-text (STT) across... |
|
Experimental |
| 6019 |
jmrashed/ai-desktop-assistant
A Python-based AI desktop assistant designed to perform various tasks like... |
|
Experimental |
| 6020 |
mcp-tool-shop-org/soundboard-maui
Cross-platform .NET MAUI desktop client for the Sound Board voice engine. |
|
Experimental |
| 6021 |
StuMason/claude-tts
Text-to-speech for AI coding assistants. Give your AI a voice with emotional... |
|
Experimental |
| 6022 |
mcp-tool-shop-org/avatar-face-mvp
Real-time VRM avatar lipsync MVP — Godot 4 + FFT visemes + OpenSeeFace |
|
Experimental |
| 6023 |
rishikksh20/voxtral-codec-pytoch
Voxtral Codec : Combining Semantic VQ and Acoustic FSQ for Ultra-Low Bitrate... |
|
Experimental |
| 6024 |
RonanDavalan/PiperRead
Privacy-First Neural Text-to-Speech for Linux (Wayland & X11). |
|
Experimental |
| 6025 |
haya256/random-read-in-computer-voice-interval-cli
テキストファイルからランダムに1行を選び、一定間隔でmacOSの音声で読み上げる学習用CLIツール |
|
Experimental |
| 6026 |
hiansit/ankiflow
ブラウザで動く汎用暗記カードアプリ「AnkiFlow」。自動読み上げ(TTS)機能を搭載し、画面を見ない「聞き流し学習」にも対応しています。 |
|
Experimental |
| 6027 |
DevBytAmir/vocaudio
CLI tool to generate spoken vocabulary study audio from a JSON deck.... |
|
Experimental |
| 6028 |
Echoshard/AudiobookStudio
Desktop app for PocketTTS with voice cloning audiobook creation,... |
|
Experimental |
| 6029 |
Aryan-Pardeshi/Speech-To-Text-Selenium
Python tool using Selenium and Chrome’s Web Speech API for speech-to-text in... |
|
Experimental |
| 6030 |
rudil24/pdf-audio-reader
Javascript leveraging browser-native Web Speech API to convert any PDF to... |
|
Experimental |
| 6031 |
Muthu-Mkode/audify
An asynchronous Python desktop application that extracts text from PDFs and... |
|
Experimental |
| 6032 |
narmesh/shorts-video-automation
Automatically generate Shorts Video using AI, stock video, and TTS |
|
Experimental |
| 6033 |
michael-borck/slide-stream
Converts Markdown and PowerPoint files into AI-generated video presentations... |
|
Experimental |
| 6034 |
Ed94/Attic-Greek-TTS
idk if this is accurate. This was done in an afternoon. Used gemini 3 pro... |
|
Experimental |
| 6035 |
BdrGM/nova-multiai
Multiple AI personas inside Foundry VTT with chat + ElevenLabs TTS. Create... |
|
Experimental |
| 6036 |
hari7261/AgentPodcast-AI
PodcastAgent uses advanced text-to-speech technology to create... |
|
Experimental |
| 6037 |
r1cc4rd0m4zz4/qwen3-tts-cli
A command-line interface for generating high-quality speech using the... |
|
Experimental |
| 6038 |
jrtorrez31337/pyvs
Python Voice Synthesis - TTS/STT web app with voice cloning using Qwen3-TTS |
|
Experimental |
| 6039 |
yukihito-jokyu/qwen3-tts-mac-guide
【新人エンジニア向け】Mac環境でQwen3-TTSを使って音声合成環境を構築する手順ガイド。環境構築から日本語音声生成までステップバイステップで解説。 |
|
Experimental |
| 6040 |
KLIEBHAN/jetson-qwen3-tts
GPU-accelerated Qwen3-TTS server for NVIDIA Jetson devices |
|
Experimental |
| 6041 |
nanofatdog/TTS-STT-Web-Application-thai
เว็บแอปพลิเคชันสำหรับแปลงข้อความเป็นเสียง (TTS) และแปลงเสียงเป็นข้อความ... |
|
Experimental |
| 6042 |
hansjm10/ai-vtuber-companion
An intelligent AI companion system for Twitch streaming with VTube Studio integration |
|
Experimental |
| 6043 |
ChrisBrooksbank/Vox
Open-source screen reader for Windows 11 — built in C#/.NET 9 with UI... |
|
Experimental |
| 6044 |
itsdevcoffee/mojo-audio
Mojo audio library: FFI-enabled, pure Mojo DSP. |
|
Experimental |
| 6045 |
Alcidespb24/podcast-workflow
Automated pipeline: Obsidian markdown → AI podcast scripts → TTS audio → RSS... |
|
Experimental |
| 6046 |
navalnica/whisper-finetuning-be
Finetuning Whisper ASR model for Belarusian language |
|
Experimental |
| 6047 |
drwale-dev-labs/ai-auto-assistant
A voice-based AI agent designed for a car dealership. This agent can... |
|
Experimental |
| 6048 |
stephenombuya/Virtual-Personal-Assistant
Production-grade Python virtual assistant with full asynchronous support.... |
|
Experimental |
| 6049 |
Ramendan/BayanSynth-Studio
Vocaloid-style Arabic TTS desktop editor built with Electron + BayanSynthTTS |
|
Experimental |
| 6050 |
OVOSHatchery/ovos-tts-plugin-responsivevoice
responsive voice TTS plugin for mycroft |
|
Experimental |
| 6051 |
kaka-lin/rpi-voice-kit-app
Using app to control Voice Kit(smart speaker) |
|
Experimental |
| 6052 |
lordzuko/speech-editor
A streamlit based UI for editing speech |
|
Experimental |
| 6053 |
samimoftheworld/Voice-Activity-Detection-FInal-Project-work
this repository concedes my project work done in my bachelors |
|
Experimental |
| 6054 |
lormaechea/kaldi-grammar-compiler
A minimal tool that helps transforming fixed grammars into compiled Finite... |
|
Experimental |
| 6055 |
Secret-Ambush/voice_bot
Voice Controlled Turtlebot 2i - ROS 🤖 |
|
Experimental |
| 6056 |
nnnnnzo/AudioFileTranslator
*AFT is an audio translator nano framework* based on python who transcribe... |
|
Experimental |
| 6057 |
TornadoInsight/AI-Video-Transcriber
AI-Video-Transcriber is an intelligent, open-source tool that automatically... |
|
Experimental |
| 6058 |
ttsaigit/tts-js
JavaScript/Node.js SDK for TTS.ai API — text-to-speech, voice cloning, speech-to-text |
|
Experimental |
| 6059 |
ttsaigit/tts-python
Python SDK for the TTS.ai text-to-speech API |
|
Experimental |
| 6060 |
joaogabriel-sg/fale-por-mim
💻 Fale por mim is a web application in which the user can type their text or... |
|
Experimental |
| 6061 |
PratikDavidson/word-power-ai
Learn Communication Language Effectively with Pictorial Story. |
|
Experimental |
| 6062 |
usamireko/StableTTS-Training-Colab
A notebook created for training StableTTS models in Google Colab easily! |
|
Experimental |
| 6063 |
ABD-01/Android-Speech-Controlled-Assistance
Android Application for smart classroom, uses voice-based commands to... |
|
Experimental |
| 6064 |
enrelu/AITranslator
Gemini-powered Chrome extension for smart translations. Features... |
|
Experimental |
| 6065 |
afine907/ttspeech
A Promise tts api, it depend on browser api window.speechSynthesis |
|
Experimental |
| 6066 |
funkyfranky/TextToSpeechListener
UDP Client that listens for text messages and converts it to speech. |
|
Experimental |
| 6067 |
alwaz-shahid/whisper-asr-cli
Automatic Speech Recognition ASR / Speech To Text STT demonstration using... |
|
Experimental |
| 6068 |
Manan-49/SRT-GENERATOR
Offline desktop application for generating accurate subtitles (SRT) from... |
|
Experimental |
| 6069 |
LiZeC123/legado-tts-tencent
Tencent TTS for Legado Reader 基于腾讯语音合成API的Legado(开源阅读)TTS服务. |
|
Experimental |
| 6070 |
K4RT1K3Y4/nima
A simple chatbot written in python which takes in user audio input, displays... |
|
Experimental |
| 6071 |
Fadlay/Bard-Tuber
This code is designed to read chat messages from YouTube and Hearing from... |
|
Experimental |
| 6072 |
jetfontanilla/win-sapi-tts-audio-file-generator
using Win SpVoice Interface (SAPI) with python to generate audio files with... |
|
Experimental |
| 6073 |
naemazam/text-to-speak
Natural Reader is a professional text to speech program that converts any... |
|
Experimental |
| 6074 |
noAbbreviation/approxima
A command line program to loudly tell time (in chunks of 5 minutes). |
|
Experimental |
| 6075 |
diluteoxygen/AI-Chat-with-Speech
A Python Script that uses Cohere Language API and Elevenlabs Speech API to... |
|
Experimental |
| 6076 |
Gamingpro237/Project-Title-MASTER-GIFT-All-in-One-Speech-AI-Chatbot-Nexus
The ~MASTER-GIFT~ All-in-One Speech AI-Chatbot Nexus is an interactive... |
|
Experimental |
| 6077 |
shxfwaan/Multimodal-AI-Assistant-with-Face-Recognition-Emotion-Analysis-GPT-Based-and-Object-Detection-YOLOv8
The project introduces Maya, Multimodal AI Assistant with Face Recognition,... |
|
Experimental |
| 6078 |
10809104/taigi-speech-to-text
台語語音轉文字訓練資料集,資料來源:教育部《臺灣閩南語常用詞辭典》。 |
|
Experimental |
| 6079 |
sudoberlin/NLP_ND
Udacity Natural Language Processing Nanodegree. |
|
Experimental |
| 6080 |
mahshid1378/WhisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) |
|
Experimental |
| 6081 |
DrSensor/gospeak
TTS that can "speak as you type" using google translate (via simplytranslate.org) |
|
Experimental |
| 6082 |
amitybell/pikatts
Pika TTS is a small, local text to speech voice synthesizer package based on... |
|
Experimental |
| 6083 |
ZxBing0066/speech-recognition
demo for web speech-recognition api |
|
Experimental |
| 6084 |
skyradez/Speech-Recognition-using-Convolutional-Neural-Network
Tutorial on Speech Recognition using Convolutional Neural Network |
|
Experimental |
| 6085 |
Qianqian1220/HeartEcho
A voice-driven project capturing emotional echoes from the heart —... |
|
Experimental |
| 6086 |
cmauget/speech-to-text-benchmark
🗣️ Transcription (speech to text) d’échanges téléphoniques adressés au... |
|
Experimental |
| 6087 |
AnkushRathour/Audio-Visualization-and-Speech-Recognition
Convert audio to text using JavaScript, Speech To Text. |
|
Experimental |
| 6088 |
brailcom/voice-czech-ph
Czech diphone database for Festival: voice "ph" |
|
Experimental |
| 6089 |
AathifZahir/WhisprSplit
A powerful, local speech-to-text transcription system that combines OpenAI's... |
|
Experimental |
| 6090 |
Vatis-Tech/asr-client-js-html-js-example
How to use Vatis Tech with HTML & JavaScript. |
|
Experimental |
| 6091 |
shaikhsaif72/Jarvis-Voice-Assistant
A voice-activated virtual assistant using Python and OpenAI. |
|
Experimental |
| 6092 |
tapiaer22/Praximedes
The Praximedes project was developed to control LED lights (HappyLighting),... |
|
Experimental |
| 6093 |
hitthecodelabs/LLM_PersonalTARS
Asistente personal web con entrada por voz (STT), salida por voz (TTS) y... |
|
Experimental |
| 6094 |
AkishinoShiame/Virtual-Elderly-Chatbot-App
An Virtual Elderly Chatbot App Using Unity 5.6.2f1 |
|
Experimental |
| 6095 |
ahammedrohit/Speech-Recognition-using-wav2vec2-with-minimum-GPU
Python Colab for speech recognition with wav2vec2. Since wav2vec2 requires... |
|
Experimental |
| 6096 |
patarapolw/ttslib
TTS for local usage that works for all OS's, with a simple interface,... |
|
Experimental |
| 6097 |
kosmicteal/VoCatalogue
Standalone application that allows users to keep track of their Vocal... |
|
Experimental |
| 6098 |
Abood-devo/EV3-Automation
Lego ev3 brick automation |
|
Experimental |
| 6099 |
gas/pronunza-tts-galego-onnx-colab
Caderno de Colab para síntese de voz (TTS) en galego usando o modelo ONNX de Celtia |
|
Experimental |
| 6100 |
mathigatti/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model |
|
Experimental |