All Voice AI Tools
8,165 tools ranked by quality score · Page 70 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 6901 |
egorsmkv/whisper-ukrainian
Trainer and Evaluation scripts for fine-tuning Whisper models for the... |
|
Experimental |
| 6902 |
mandar3051982/tiny-tts
Deliver natural English speech with an ultra-lightweight, end-to-end... |
|
Experimental |
| 6903 |
NAJL123/voice-ai-assistant
Local Voice AI Assistant — faster-whisper STT + Ollama LLM + pyttsx3 TTS |
|
Experimental |
| 6904 |
kyugakai/NeuraVoice
🗣️ Elevate your workflow with NeuraVoice, an AI desktop assistant that... |
|
Experimental |
| 6905 |
rutchanon17493/Sakura-Voice
Build real-time, low-latency voice assistants supporting 23 Indian languages... |
|
Experimental |
| 6906 |
H0NEYP0T-466/NeuralMate
NeuralMate 🤖 is your smart AI personal assistant 🧠, built to help you work... |
|
Experimental |
| 6907 |
RighteousW/sign_avatar
Real-time bidirectional translation between speech and Namibian Sign... |
|
Experimental |
| 6908 |
rizwiz104/voicely
Coach structured answers in real time during mock interviews with question... |
|
Experimental |
| 6909 |
mtepenner/vi
Meet Vi, a modular, voice-activated AI assistant built in Python. It... |
|
Experimental |
| 6910 |
ChanikyaSaiL/VoicePay
Voice & face-based secure payment and authentication platform with real-time... |
|
Experimental |
| 6911 |
Subhas6033/Talk2Hire
Talk2Hire is an AI-powered hiring platform for secure online interviews with... |
|
Experimental |
| 6912 |
89891383/Polish-Kick-TTS
🎙️ Darmowy system Text-to-Speech dla polskich streamerów Kick.com. Łatwa... |
|
Experimental |
| 6913 |
maritza310308/audiobook-toolkit
🎧 Manage your audiobooks efficiently with this toolkit that converts Audible... |
|
Experimental |
| 6914 |
naseem1amjad/Python-AI-VoiceChatGPT
Use ChatGpt (openAi) by Voice i.e. using text to speech and speech to text.... |
|
Experimental |
| 6915 |
cameroncruz/dog-voicebot
Voice-enabled dog chatbot for emotional therapy. 🐶 |
|
Experimental |
| 6916 |
adrianwedd/spark
SPARK — a Claude-powered robot companion for a neurodivergent kid. Built on... |
|
Experimental |
| 6917 |
MendoLeo/tts-dataset-pipeline
Democratizing speech technology: the simplest way to create custom TTS and... |
|
Experimental |
| 6918 |
deepgram-devs/dg-sagemaker
Example code to call Deepgram APIs on Amazon SageMaker |
|
Experimental |
| 6919 |
Emmanuel-PaulMaah/liguscribe
Real-time courtroom transcription |
|
Experimental |
| 6920 |
asainov1/voice-ai-agent
Voice cloning pipeline for AI agents — F5-TTS zero-shot inference, Whisper... |
|
Experimental |
| 6921 |
Lishadsza/my-city-speaks
My City Speaks is an innovative web application that combines AI-powered... |
|
Experimental |
| 6922 |
Fencelineanapsid199/music-scribe
Analyze any YouTube track's audio to extract key, BPM, chords, time... |
|
Experimental |
| 6923 |
manchenkoff/python-assistant
Simple GUI application to emulate voice assistant workflow [just for fun] |
|
Experimental |
| 6924 |
Shantika123/Jarvis
Developed a Python-based virtual assistant that performs voice-controlled... |
|
Experimental |
| 6925 |
Daliaalkilani/Sign-Language-Translator
A Python-based system for real-time two-way translation between sign... |
|
Experimental |
| 6926 |
bmwasaru/kiswahili-speech-normalization
Kiswahili text normalization utilities for speech datasets (ASR/TTS) |
|
Experimental |
| 6927 |
alijavid110/SeeSense-AI
👁️🗨️ Empower vision with SeeSense-AI, a browser-based tool that enhances... |
|
Experimental |
| 6928 |
Vasanth2005kk/VoxLibri
VoxLibri: The Ultimate AI-Powered eBook to Audiobook Converter. 🎧📚 Transform... |
|
Experimental |
| 6929 |
voothi/20250902105308-anki-no-tts
A simple Anki add-on to globally disable all Text-to-Speech (TTS) playback |
|
Experimental |
| 6930 |
iLuiz07/DesiYatra
✨ Streamline your travel with DesiYatra, an AI system that negotiates local... |
|
Experimental |
| 6931 |
Jaya30102003/Voice-Assistant-for-Blind
A web-based voice assistant that empowers visually impaired users to perform... |
|
Experimental |
| 6932 |
Verma-Siddharth/empathy-engine
AI-powered TTS that detects emotion and modulates voice — speed, pitch — to... |
|
Experimental |
| 6933 |
funkyfranky/TTS-Radio
Create voice overs with radio effects for DCS |
|
Experimental |
| 6934 |
metacore-stack/Voice-to-Insights
Enterprise AI platform that transforms audio meetings into structured... |
|
Experimental |
| 6935 |
codekraft-studio/react-speech
A simple React component to deal with browser SpeechRecognition |
|
Experimental |
| 6936 |
kingjethro999/silero-test
Made Silero Hostable for api requests |
|
Experimental |
| 6937 |
namphung134/ASR-Vietnamese
Fine-tuning the openai/whisper-small model on the 250h dataset for... |
|
Experimental |
| 6938 |
AnshGaikwad/Personal-Voice-Assistant
Personal Voice Assistant: Easy to change the code and making it suitable for... |
|
Experimental |
| 6939 |
Diluksha-Upeka/Voxis
Voxis is an intelligent voice assistant powered by Groq's AI models,... |
|
Experimental |
| 6940 |
jaychampaneri14/voice-to-video-avatar
Convert voice/text to animated avatar video |
|
Experimental |
| 6941 |
metacore-stack/AuraVoice
Production-grade on-device AI meeting assistant featuring real-time... |
|
Experimental |
| 6942 |
Rumeysakeskin/ASR-Quantization
Post-training quantization on Nvidia Nemo ASR model |
|
Experimental |
| 6943 |
siddbhatt18/30-days-of-voice-agents
Murf AI's 30 Days of AI Voice Agents Challenge |
|
Experimental |
| 6944 |
RedDotz20/speech-to-text-recognition
🎤 Effortlessly integrate speech recognition capabilities into your React... |
|
Experimental |
| 6945 |
harlanx/voice_recorder_recognizer
An audio recorder and speech to text with commands recognition created using... |
|
Experimental |
| 6946 |
allvoicelab/allvoicelab
AI-powered audio creation platform offering TTS, Voice Cloning, Voice... |
|
Experimental |
| 6947 |
joachimhodana/rtTranslator
Simple overlay for Windows, that listens for background sound and translates... |
|
Experimental |
| 6948 |
madebyaris/dsw-voice
Real-time voice noise reduction app for macOS with virtual microphone support |
|
Experimental |
| 6949 |
m-mohsin-ali/closed-captioning-azure-speech-ai
This project demonstrates how to use Azure Cognitive Services with a... |
|
Experimental |
| 6950 |
Shubham8831/Article-to-Audio
An AI-powered web application that converts articles and URLs into... |
|
Experimental |
| 6951 |
Her-mia/Imgspeaker
An Android app written in Kotlin that performs OCR on Simplified Chinese... |
|
Experimental |
| 6952 |
labestia2/Qwen3-Audiobook-Converter
🎧 Convert various document formats into high-quality audiobooks with Qwen3... |
|
Experimental |
| 6953 |
wangjialiang678/speaklow-macvoiceinput
SpeakLow — a lightweight macOS menu bar app for voice-to-text input. Press a... |
|
Experimental |
| 6954 |
quochuy242/VNAVC
Data Pipeline for Text to Speech Project |
|
Experimental |
| 6955 |
RamirJunior/idox-ia-project
Projeto MVP com processamento de áudio com IA local |
|
Experimental |
| 6956 |
nipponjo/tts-german-pytorch
🎙️ German TTS (FastPitch) with Thorsten voice / emotional |
|
Experimental |
| 6957 |
upskaling/voice-keyboard
an interface for nerd-dictation in gtk |
|
Experimental |
| 6958 |
duanxianpi/AI-Voice-Diary
Using voice to keep a journal. |
|
Experimental |
| 6959 |
max-lt/voxtral-cpp
Local implementation for voxtral |
|
Experimental |
| 6960 |
kjanjua26/HearPapers
HearPapers allows you to listen to PDFs (by converting them to audiobooks,... |
|
Experimental |
| 6961 |
sammwyy/chat-tts
Chat TTS for your streams. |
|
Experimental |
| 6962 |
rk-vashista/TTS-Story_Generator
A versatile app that converts images into short stories and lifelike audio... |
|
Experimental |
| 6963 |
mzhang027/Gemini-Live-TTS
🎤 Transform text into natural-sounding speech with Gemini-Live-TTS, offering... |
|
Experimental |
| 6964 |
SelimHorri/txt-to-speech-funny-random-jokes
Consume random jokes APIs and make them as a speech |
|
Experimental |
| 6965 |
appsdothingsiguess/LocalStream-Transcriber
Transcribe local files and browser streams (Canvas, YouTube, and more) using... |
|
Experimental |
| 6966 |
chandankumarm55/Evolve-ai
future - image based answer , UI Improvements , youtube link based summary |
|
Experimental |
| 6967 |
JonPark0/web_audio_splitter
AI-powered audio source separation using Meta Demucs - Split songs into... |
|
Experimental |
| 6968 |
StrawTe/Comfyui-HAIGC-QwenTTS
🎤 Generate and customize voices with ComfyUI HAIGC Qwen3TTS, integrating... |
|
Experimental |
| 6969 |
quangkhai5122/signlanguagetrans
The application is deployed on the web of the ASL_Pytorch project, with... |
|
Experimental |
| 6970 |
SMIL-SPCRAS/DAVIS
Official repo for "Audio-Visual Speech Recognition In-the-Wild: Multi-Angle... |
|
Experimental |
| 6971 |
DOLMA-NLP/asr
Automatic Speech Recognition for Low-Resourced Middle Eastern Languages -... |
|
Experimental |
| 6972 |
manhph2211/ViTTS
In this repo, I developed a step-by-step pipeline for a standard... |
|
Experimental |
| 6973 |
kiraping1337/ChatTwitchTTS
Twitch TTS бот с клонированием голоса через XTTS v2. Озвучивание сообщений... |
|
Experimental |
| 6974 |
strcoder4007/S2S-Lipsync-UnrealAvatar-Backend
Unreal Metahuman Conversation Speech to Speech backend and frontend. |
|
Experimental |
| 6975 |
Srinath-N-R/IPA-Wav2Vec2-Phoneme-Recognition
End-to-end IPA-based phoneme recognition pipeline using Wav2Vec2, featuring... |
|
Experimental |
| 6976 |
oddvoices/oddvoices
An indie singing synthesizer |
|
Experimental |
| 6977 |
Irham-Azka17/AI-Audio-Transcriber
Transcribe offline audio recordings quickly with AI-powered, privacy-focused... |
|
Experimental |
| 6978 |
Karan36k/text2speech
A Basic But Useful Online Text to Speech Converter with a male voice... |
|
Experimental |
| 6979 |
di37/speech-to-text-fine-tuning-on-unseen-language
This projects aims to show how whisper model can be fine-tuned on language... |
|
Experimental |
| 6980 |
HealSpeak/HealSpeak-App
A free of cost Triage Assistant, this is the HealSpeak app. |
|
Experimental |
| 6981 |
hannabdul/etf4asr
Official repo for the paper "An Effective Training Framework for... |
|
Experimental |
| 6982 |
LauraKokkarinen/AzureAI.TextToSpeech
A console application for converting long-form plain-text files into speech... |
|
Experimental |
| 6983 |
Aryan9inja/Krishi-Setu
Voice-based AI system helping farmers access agricultural guidance via phone... |
|
Experimental |
| 6984 |
jfainberg/sincnet_adapt
Raw waveform adaptation with SincNet |
|
Experimental |
| 6985 |
YossefMohamed/covid-app-api
An Api for testing covid using cough sound |
|
Experimental |
| 6986 |
dom96/texttospeech
A Nim client for the Google Cloud Text to Speech API. |
|
Experimental |
| 6987 |
RutronikSystemSolutions/RDK3_BLE_EnOcean
Project used to illustrate how to use a RDK3 to interact with EnOcean BLE... |
|
Experimental |
| 6988 |
QuantumBeto/chines
🎤 Convert spoken Chinese into pinyin with this simple voice recognition... |
|
Experimental |
| 6989 |
unicodeveloper/voicery
Play with voices. Speak any language. Clone your vibe. |
|
Experimental |
| 6990 |
vshmyhlo/listen-attend-and-speell-pytorch
Implementation of Automatic Speech Recognition inspired by "Listen, Attend... |
|
Experimental |
| 6991 |
Maidana0/My-App
FullStack App - NextJs 14 - Nest JS - Deployment |
|
Experimental |
| 6992 |
dgop92/speech2diet
FitVoice/Speech2Diet is an application that allows people to track their... |
|
Experimental |
| 6993 |
Giuseppe-Della-Corte/IESTAC
A corpus that can be used to train English-to-Italian End-to-End... |
|
Experimental |
| 6994 |
akhilachiju/AI-Audio-Transcriber
Audio transcription app using Whisper AI for accurate speech-to-text... |
|
Experimental |
| 6995 |
nakshatra-garg/rvc-no-gui
Headless RVC voice cloning & training pipeline - Train and run voice... |
|
Experimental |
| 6996 |
Kiran8053/Speech-Emotion-Recognition
This project focuses on real-time Speech Emotion Recognition (SER) using the... |
|
Experimental |
| 6997 |
Himanshi-2519/Speech-To-Text-API
Capturing the Rhythm of your words. Real-time AI transcription with a... |
|
Experimental |
| 6998 |
AnjaneyaBhardwaj/Deafine_Frontend
A real-time audio transcription web application designed to make... |
|
Experimental |
| 6999 |
pukaa900/reagana
Ko taqaku konqamatuqa mo nqaaqaku meqa. |
|
Experimental |
| 7000 |
karim23657/ParsiGoo
ParsiGoo is a Persian multispeaker dataset for text-to-speech purposes. It... |
|
Experimental |