All Voice AI Tools
8,165 tools ranked by quality score · Page 54 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 5301 |
phil1px/voice-message-transcriber
An iOS share-action extension that transcribes voice messages using Google... |
|
Experimental |
| 5302 |
voothi/20251228104300-subtitles
This repository is dedicated to preparing subtitles as part of working with... |
|
Experimental |
| 5303 |
malob/serverless-tts-podcast
WIP rewrite of article-to-audio-cloud-function and... |
|
Experimental |
| 5304 |
cheeweijie/qwen3-tts-lora-finetuning
Qwen3‑TTS LoRA fine‑tuning tools (companion repo) for custom voice adaptation |
|
Experimental |
| 5305 |
itsmemotivist/qwen-tts2api
🗣️ Enable text-to-speech with Qwen TTS, a simple API solution that... |
|
Experimental |
| 5306 |
arrdel/voice-assistant
Python script that utilizes natural language processing (NLP) and machine... |
|
Experimental |
| 5307 |
mfirozahmed/iTranslator
Project using OCR and TTS |
|
Experimental |
| 5308 |
BertanDogancay/Multi-Functional-AI-Assistant
An advanced AI assistant that can make object detections and uses dialogpt... |
|
Experimental |
| 5309 |
safikhanSoofiyani/VoicePrescription
An android application that uses speech to text functionality to produce... |
|
Experimental |
| 5310 |
naver/multilingual-distilwhisper
This repository contains all the code necessary for running the multilingual... |
|
Experimental |
| 5311 |
lyle-mlengineer/timesnap
A web service for extracting timestamps from youtube videos. |
|
Experimental |
| 5312 |
marklubin/kairix
Voice-first AI agent with persistent memory, background reflection, and... |
|
Experimental |
| 5313 |
wyatt-avilla/discord-tiktok-tts-bot
discord bot that can play tiktok tts in voice |
|
Experimental |
| 5314 |
chalotrasahil/AI-Lecture-Studio
AI Lecture Studio is an NLP-driven system that transforms audio and video... |
|
Experimental |
| 5315 |
krishn1122/voice-agent-local
Specially designed for AI Team |
|
Experimental |
| 5316 |
ryanfb/ancientgreekspeak
Transliterate Ancient Greek to Apple phonemes for text-to-speech synthesis |
|
Experimental |
| 5317 |
ADT109119/WhisperX-GUI
一個使用者友善的圖形介面,用於輕鬆調用 WhisperX,這是一個提供精確轉錄、強大語者分離和詞級時間戳對齊的自動語音辨識 (ASR) 工具。此 GUI... |
|
Experimental |
| 5318 |
incubated-geek-cc/whisper-onnx
A Vite-ReactJS setup to run Whisper OpenAI models locally to transcribe... |
|
Experimental |
| 5319 |
samuelebh/CNN-Spoken-Digit-Classifier
Repository containing Python code of a classifier that recognizes spoken... |
|
Experimental |
| 5320 |
PhysisVerse/physis-vad-swift
Modular Swift package for on-device voice activity detection on Apple... |
|
Experimental |
| 5321 |
SuJun-Hub/voiceId
借鉴CapsWriter修改的windows端语音输入工具 |
|
Experimental |
| 5322 |
8G6/rtts
rtts is an open source JavaScript package for text to speech conversion |
|
Experimental |
| 5323 |
fann1993814/whisper.cpy
Python wrapper for Whisper.cpp |
|
Experimental |
| 5324 |
terkelg/utters
Small (257B) promise wrapper for SpeechSynthesisUtterance |
|
Experimental |
| 5325 |
MahtaFetrat/Mana-Forced-Aligner
A robust forced alignment tool for low-resource languages using multiple ASR... |
|
Experimental |
| 5326 |
zhangmei126/TextToSpeech
UE4 集成TTS文字转语音,使用SAPI5.3版本 |
|
Experimental |
| 5327 |
1abhishekpandey/FastScribe
Fast parallel video-to-text transcription powered by OpenAI's Whisper AI. |
|
Experimental |
| 5328 |
aristech-de/tts-clients
Clients to communicate with the Aristech TTS service |
|
Experimental |
| 5329 |
leanhtech/TextToSpeech_EN_VN
Đồ Án Text To Speech (Môn Hệ Điều Hành - PTITHCM) |
|
Experimental |
| 5330 |
mym-br/gnuspeech_sa
Articulatory speech synthesizer |
|
Experimental |
| 5331 |
wenhuahuo/Cross-Device-Acoustic-Communication-Python-Implementation
Digital acoustic communication tools using QFSK and Convolutional Encode. 跨设备声学通信。 |
|
Experimental |
| 5332 |
cowdude/flapi
FLAPI is an offline, containerized speech recognition websocket API |
|
Experimental |
| 5333 |
1ytic/edit-distance-papers
A curated list of papers dedicated to edit-distance as objective function |
|
Experimental |
| 5334 |
Wonbin-Jung/e3-vits
Official GitHub page of E3-VITS |
|
Experimental |
| 5335 |
iamarunbrahma/smart-voice-assistant
A simple voice assistant to get your queries in speech format and generate... |
|
Experimental |
| 5336 |
marttirandma/tipi
Tipi Web v2 |
|
Experimental |
| 5337 |
cjbayron/audiate
Ear training game using machine learning models in the browser |
|
Experimental |
| 5338 |
ChrisRobinT/realtime-translation
Real-time WebRTC voice translation using Whisper STT, Azure Translate, and... |
|
Experimental |
| 5339 |
asrajeh/kaldi-arabic
HHM-based Arabic ASR using Kaldi engine |
|
Experimental |
| 5340 |
kowaalczyk/reformer-tts
An adaptation of Reformer: The Efficient Transformer for text-to-speech task. |
|
Experimental |
| 5341 |
IRSPlays/ProjectCortexV2
A $300 wearable that gives visually impaired users real-time scene... |
|
Experimental |
| 5342 |
kevinjalbert/spellspoon
Spellspoon is a macOS tool built using Hammerspoon that enables... |
|
Experimental |
| 5343 |
WaelShaikh/OmniVerse-Desktop
OmniVerse-Desktop is your local LLM based AI assistant that integrates... |
|
Experimental |
| 5344 |
anubhav-n-mishra/xtts-api
Production-ready Text-to-Speech API with XTTS-v2, voice cloning,... |
|
Experimental |
| 5345 |
jp1924/HF_builders
🤗 Datasets의 builder script를 모와둔 repo |
|
Experimental |
| 5346 |
marcogenna/epub2audiobook
Convert EPUB books to M4B audiobooks with AI-powered TTS (Edge TTS, Kokoro, Piper) |
|
Experimental |
| 5347 |
fulviodenza/go-gladia-client
Client Go for Gladia APIs |
|
Experimental |
| 5348 |
AryanVBW/AiVoiceClone
Transform Your Voice: Replicate Your Unique Sound in a Pristine Pre-Trained... |
|
Experimental |
| 5349 |
SyedHuzaifa007/Robbie-12.20-Personal-Virtual-Assistant
It is a Speech Recognition Personal Virtual Assistant made with Python that... |
|
Experimental |
| 5350 |
sandeepswain54/Yukti-Care
Yukti Care is a mobile app that enables pharmacies, medical distributors,... |
|
Experimental |
| 5351 |
cydanix/voice-agent
Real-time voice AI assistant |
|
Experimental |
| 5352 |
Aketirani/audio-mnist
Gender Recognition By Voice Analysis |
|
Experimental |
| 5353 |
theablemo/Voice-Captcha-Verification
This repository contains the code for the Captcha Verification by voice... |
|
Experimental |
| 5354 |
Nexdata-AI/100-Hours-Thai-Children-Spontaneous-Speech-Data
Thai Child's Spontaneous Speech Data |
|
Experimental |
| 5355 |
Fdr3iZzz/YoutubeVideoTranslate
Get a translated YouTube video with AI voiceover |
|
Experimental |
| 5356 |
RumitPatel/android-continues-speech-recognition
This project is a demonstration to continues recognition of speech using... |
|
Experimental |
| 5357 |
traderpedroso/xphoneBR
XphoneBR is a Brazilian portuguese transformer base grapheme-to-phoneme and... |
|
Experimental |
| 5358 |
CrispStrobe/CrispTTS
(wip) python command-line Text-to-Speech (TTS) tool esp. for German,... |
|
Experimental |
| 5359 |
chihakuro/attendance-check
Face recognition for attendance checking system |
|
Experimental |
| 5360 |
NhanPhamThanh-IT/Vietnamese-Voice-Search-Engine
🔎 Vietnamese Voice Search Engine - Vietnamese news search app with voice... |
|
Experimental |
| 5361 |
Davi20044/Chat-de-Voz-GPT-3.5
Este projeto consiste em um assistente de conversação que utiliza a... |
|
Experimental |
| 5362 |
kundan-6646/Musica
Musica is an online audio splitter. It works with the power of AI which... |
|
Experimental |
| 5363 |
WinsDominoes/sanskrit-tts
Sanskrit Text-To-Speech Web-App - Made this for my Sanskrit Learning Journey |
|
Experimental |
| 5364 |
HKAB/vietnamese-rnnt-tutorial
A tutorial on how to train RNN-T from scratch with Whisper encoder |
|
Experimental |
| 5365 |
shesuyo/isi
alibaba 智能语音交互(Intelligent Speech Interaction) GO SDK |
|
Experimental |
| 5366 |
uigiporc/icon-sr
Progetto di Ingegneria della conoscenza, autori: Porcelli Luigi, Nicolo Cucinotta. |
|
Experimental |
| 5367 |
rgychiu/docbot
Personal doctor bot for all your common medical needs. |
|
Experimental |
| 5368 |
IHKYoung/AhaTTS
TTS Fast Web,一个简单优雅的本地文字转语音的前端与API接口。A localized, cross-platform,... |
|
Experimental |
| 5369 |
khakhasshi/myOwnTTS
A lightweight, high-performance voice cloning TTS system based on Coqui TTS... |
|
Experimental |
| 5370 |
ayutaz/uZipVoice
Unity implementation of ZipVoice - lightweight zero-shot text-to-speech... |
|
Experimental |
| 5371 |
andreehrlich/Daily-Briefing-Voice-Assistant
Conversational voice agent to brief you on your schedule for the day.... |
|
Experimental |
| 5372 |
corbinr40/RTCC
A piece of software that converts voice to text in a visual output, as an... |
|
Experimental |
| 5373 |
vislupus/Bulgarian-TTS-dataset
LibriVox dataset for Bulgarian language TTS |
|
Experimental |
| 5374 |
AppleHolic/2020AIChallengeSpeechRecognition
2020 AI Challenge 음성 인식 코드 |
|
Experimental |
| 5375 |
pika-online/Foreign_Pronunciation_Generator_for_Code-Switch_ASR
a socket script to obtain chinese phones-sequence for any english word |
|
Experimental |
| 5376 |
atharva9167j/Sign-Language-Translator
Sign Language Recognition Platform - A real-time American Sign Language... |
|
Experimental |
| 5377 |
Kavindu-Rankothge/tiktok-bot
TikTok video generation from scraping Reddit community posts |
|
Experimental |
| 5378 |
shahad-mahmud/incremental_learning_for_asr
Incremental learning for automatic speech recognition (ASR) |
|
Experimental |
| 5379 |
voidful/whisper-live-asr-demo
run whisper on CPU/GPU server |
|
Experimental |
| 5380 |
4over7/SpeakOut
Offline-first AI voice input for macOS. Hold-to-speak or tap-to-toggle,... |
|
Experimental |
| 5381 |
timothypesi/Speech-to-Text-Converter
This GitHub repository contains a Python Streamlit app that utilizes machine... |
|
Experimental |
| 5382 |
bfackland/replica_dialog_generator
🗣 Auto-generate dialog audio files using the Replica Studios 'AI Voices' API... |
|
Experimental |
| 5383 |
oscurprof/Realtime-Subtitles-Generator-using-Python
LiveScript: Real-time Live Captioning Software, generates subtitles in... |
|
Experimental |
| 5384 |
maziac/currah_uspeech_tests
Tests for the ZX Spectrums speech synthesizer peripheral: Currah uSpeech... |
|
Experimental |
| 5385 |
gerlaxrex/parrot
PARRoT: Precise Audio Recognition and Recap over Transcription |
|
Experimental |
| 5386 |
SSobol77/Say-Salomon-AI
Asynchronous text-to-speech conversion, asynchronous speech-to-text... |
|
Experimental |
| 5387 |
xingchensong/ASR-Wavnet
some ASR-system implementations (via tensorflow 1.x) |
|
Experimental |
| 5388 |
morikeli/Xcalibur
A speech recognition and translation website built with Django in addition... |
|
Experimental |
| 5389 |
MorrisXu-Driving/Improving_DeepSpeech_2_by_RNN_Transducer_Pytorch_Implementation
In this repository, based on Deep Speech 2, two losses, CTC and RNN-T are compared. |
|
Experimental |
| 5390 |
Androz2091/Cicero
Great speaker, Cicero is a text-to-speech Discord Bot! |
|
Experimental |
| 5391 |
rossriserose/Real-time-Voice-cloning
Clone a voice to generate arbitrary speech in real-time |
|
Experimental |
| 5392 |
marcosfelt/latex2speech
Convert Latex to speech |
|
Experimental |
| 5393 |
shreyashghag/OfflineSpeechRecognition
Offline Speech Recognition For Android Library |
|
Experimental |
| 5394 |
eray-yuztyurk/python-ai-voice-chatbot
AI-powered voice chatbot with Gradio web interface. Talk or type your... |
|
Experimental |
| 5395 |
Sec-ant/etts
edge-tts in Bun. |
|
Experimental |
| 5396 |
HarunoriKawano/Conformer
Implementation of the paper "Conformer: Convolution-augmented Transformer... |
|
Experimental |
| 5397 |
dibbed/TTSKit-multi-engine-tts
Python Text-to-Speech toolkit (multi-engine) with FastAPI, CLI and Telegram... |
|
Experimental |
| 5398 |
technout/tts_gtk
Graphical interface for Coqui TTS (Text to Speech) command line. Made in... |
|
Experimental |
| 5399 |
Tombarr/TranscriberApp
Local-first macOS Tahoe Transcription App & CLI Tool |
|
Experimental |
| 5400 |
Dalia-Sher/Speech-Emotion-Recognition-using-BLSTM-with-Attention
We present a study of a neural network based method for speech emotion... |
|
Experimental |