All Voice AI Tools
8,165 tools ranked by quality score · Page 21 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 2001 |
brailcom/speechd-el
Emacs speech and Braille output interface |
|
Emerging |
| 2002 |
Julia-Roman/pepega-tts
Discord bot for Google and Polly Text-to-Speech |
|
Emerging |
| 2003 |
01-vyom/End_2_End_Automatic_Speech_Recognition_For_Gujarati
[ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition... |
|
Emerging |
| 2004 |
Abhishek-op/SR
💡Kivy-android speech recognition |
|
Emerging |
| 2005 |
IndieCoderMM/smart-one-ai
🤖 AI assistant that can listen to user input and provide responses. It... |
|
Emerging |
| 2006 |
soniqo/speech-android
On-device speech SDK for Android — ASR, TTS, VAD, and noise cancellation... |
|
Emerging |
| 2007 |
artcore-c/AI-Voice-Clone-with-Qwen3-TTS
Free voice cloning and TTS for creators using Qwen3-TTS on Google Colab.... |
|
Emerging |
| 2008 |
jonelo/jAdapterForNativeTTS
A simple pure Java library that allows you to use the native Text To Speech... |
|
Emerging |
| 2009 |
ScottishFold007/Cosyvoice_DPO_NOTES
CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO... |
|
Emerging |
| 2010 |
calinalexandru/pericles
A browser extension offering intuitive text-to-speech functionality, making... |
|
Emerging |
| 2011 |
nchudleigh/sc2-ultra
Voice-controlled StarCraft II - command Zerg, Protoss, or Terran using... |
|
Emerging |
| 2012 |
aks-devs/mod_openai_tts
Freeswitch Speech-To-Text module |
|
Emerging |
| 2013 |
shafaypro/PYSHA
A Simple Virtual Assistant Build in Python 3.5 |
|
Emerging |
| 2014 |
scripty-bot/scripty
Speech to text bot for Discord |
|
Emerging |
| 2015 |
iron-mukakin/Emoji-TTS
Irodori-TTSのフォーク、echo-TTSのwebuiになります。 |
|
Emerging |
| 2016 |
Martouta/speech_processor
Speech-to-text from videos and audios (including youtube and tiktok links) |
|
Emerging |
| 2017 |
rishikksh20/iSTFT-Avocodo-pytorch
Ultrafast GAN based Vocoder for Text to Speech |
|
Emerging |
| 2018 |
parthgupta1208/VoiceCraft
Voice Craft is a desktop AI assistance tool designed to help people with... |
|
Emerging |
| 2019 |
deepily/genie-in-the-box
Genie in the Box: Distill Whisper STT => Mistral-7B =>... |
|
Emerging |
| 2020 |
mozi1924/Qwen3-TTS-EasyFinetuning
Easy fine-tuning for Qwen3-TTS: Fast voice cloning and high-quality... |
|
Emerging |
| 2021 |
kurianbenoy/malayalam_asr_benchmarking
A study to benchmark whisper based ASRs in Malayalam |
|
Emerging |
| 2022 |
audioku/cross-accent-maml-asr
Meta-learning model agnostic (MAML) implementation for cross-accented ASR |
|
Emerging |
| 2023 |
williamxhero/ttsmaker
TTSMaker: A Python library for interacting with the TTSMaker API to easily... |
|
Emerging |
| 2024 |
loushou/flutter_tts_improved
A fork of the Flutter_TTS (https://github.com/dlutton/flutter_tts) plugin,... |
|
Emerging |
| 2025 |
skit-ai/speech-recognition
SDKs and docs for Skit's speech to text service |
|
Emerging |
| 2026 |
superU-ai/voice-agent-QA
A unified benchmarking framework for evaluating Voice AI agents across... |
|
Emerging |
| 2027 |
jfainberg/lattice_combination
Lattice combination algorithm to combine inaccurate transcripts with... |
|
Emerging |
| 2028 |
phineas-pta/speech-synthesis-ngngngan
python script to download & process data to train a speech-synthesis model... |
|
Emerging |
| 2029 |
chameleon-ai/vevo
Simple GUI for Amphion Vevo |
|
Emerging |
| 2030 |
acyclics/speech-to-speech-translator
Enables a device to input speech from a microphone, translate speech to a... |
|
Emerging |
| 2031 |
mirfan899/CTTS
Cantonese TTS frontend |
|
Emerging |
| 2032 |
frrobledo/AutoDub
An advanced AI-powered tool that automatically translates and dubs YouTube... |
|
Emerging |
| 2033 |
hcoles/voices
Fast, in-process text to speech for Java |
|
Emerging |
| 2034 |
ferosai/feros
Open-source voice agent OS. Rust runtime, AI-driven builder, sub second... |
|
Emerging |
| 2035 |
qiujiali/lattice_rnn
Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation |
|
Emerging |
| 2036 |
liou666/audiread
📻 A simple and user-friendly online TTS tool. (简单易用的在线文本转语音工具) |
|
Emerging |
| 2037 |
stevenhillis/awesome-asr-contextualization
A curated list of awesome papers on contextualizing E2E ASR outputs |
|
Emerging |
| 2038 |
mishrababhishek/chatbot
AI Chatbot answers students' queries about their college program using... |
|
Emerging |
| 2039 |
botbahlul/js-live-audio-video-translate
HTML Web template that can RECOGNIZE any live audio/video streaming (using... |
|
Emerging |
| 2040 |
ameerbadri/twilio-asr-realtime-dashboard
Twilio ASR and Intent Realtime Dashboard |
|
Emerging |
| 2041 |
ndenicolais/SpeechAndText
Android application built with Kotlin and Jetpack Compose that shows how to... |
|
Emerging |
| 2042 |
OpenASR/idiolect
🎙️ Handsfree Audio Development Interface |
|
Emerging |
| 2043 |
SaptakBhoumik/easySpeech
easySpeech is an open-source Python wrapper for google speech to text API... |
|
Emerging |
| 2044 |
weespin/RequestifyTF2
Client side commands for mic spamming and more! |
|
Emerging |
| 2045 |
clloret/speaking-practice
An Android application to practice English pronunciation |
|
Emerging |
| 2046 |
theaifutureguy/Vocal-Agent
A sophisticated real-time voice assistant that seamlessly integrates speech... |
|
Emerging |
| 2047 |
Helow19274/aiogTTS
Async Python library to interface with Google Translate's text-to-speech API |
|
Emerging |
| 2048 |
SkyDocs/speaker-identification
Speaker Identification using Neural Net. |
|
Emerging |
| 2049 |
haiodo/oaitt
An OpenAI compatible transcriber using transformers and whisperx. |
|
Emerging |
| 2050 |
LibraryOfCongress/speech-to-text-viewer
AWS Transcribe evaluation pipeline: bulk-process audio files and view the results |
|
Emerging |
| 2051 |
DrAchernar/location-based-AR-app
This Flutter project is an example for a location based AR app with... |
|
Emerging |
| 2052 |
abinashmeher999/voice-data-extract
A command line interface to combine text information from subtitles with... |
|
Emerging |
| 2053 |
LuluW8071/Conformer
End-to-End Speech Recognition Training with Conformer CTC using PyTorch Lightning⚡ |
|
Emerging |
| 2054 |
cmsflash/deep-learning-sota
State-of-the-art results for deep learning tasks in various fields. |
|
Emerging |
| 2055 |
linto-ai/linto-diarization
Speaker diarization service |
|
Emerging |
| 2056 |
ORI-Muchim/One-Click-MB-iSTFT-VITS2
MB-iSTFT-VITS2(Data Preprocessing + Whisper + Text Preprocessing + Making... |
|
Emerging |
| 2057 |
niteshsharmacodes/neutts-ultimate
NeuTTS-Ultimeate - Advanced Text-to-Speech generation with unlimited... |
|
Emerging |
| 2058 |
Mohamed-samy2/Video-Interview-Analysis
PRVIA is an AI-powered system that automates the evaluation of pre-recorded... |
|
Emerging |
| 2059 |
csyan5/AttnGAN-Audio-to-image-geneation
CMPT726 Machine Learning Final Project |
|
Emerging |
| 2060 |
nate-russell/Scholar2Go
Make MP3 albums out of Academic PDFs. Works by gluing together Grobid and... |
|
Emerging |
| 2061 |
arora-r/chatapp-with-voice-and-openai
This project uses OpenAI's GPT-3 model to create a simple assistant that can... |
|
Emerging |
| 2062 |
javichur/fitness-voice
AI voice-controlled trainer in your web browser, using NLP (wit.ai), body... |
|
Emerging |
| 2063 |
speechly/browser-client-example
A demo app showcasing Speechly browser-client and detailed api responses. |
|
Emerging |
| 2064 |
Fraunhofer-AISEC/towards-resistant-audio-adversarial-examples
Generation tool for offset-resistant audio adversarial examples against Deepspeech |
|
Emerging |
| 2065 |
nixonyh/UnityASR
Automatic Speech Recognition in Unity. |
|
Emerging |
| 2066 |
KoalaV2/K.A.I
Home automation program controlled by your voice. |
|
Emerging |
| 2067 |
nheidloff/unity-watson-vr-sample
Virtual Reality Sample using IBM Watson, Unity and Google Cardboard |
|
Emerging |
| 2068 |
piotrkawa/deepfake-whisper-features
Implementation of the paper "Improved DeepFake Detection Using Whisper Features" |
|
Emerging |
| 2069 |
mike-nott/smart-announcements
Intelligent context-aware voice announcements for Home Assistant.... |
|
Emerging |
| 2070 |
Vishnu-tppr/NEXORA-AI
Made with Python, crafted by Vishnu 💻✨ Nexora AI – A smart Python voice... |
|
Emerging |
| 2071 |
Franck-Dernoncourt/ASR_benchmark
Program to benchmark various speech recognition APIs |
|
Emerging |
| 2072 |
chirag127/WebSpeak-TextToSpeech-Browser-Extension
High-fidelity browser extension leveraging the Web Speech API for precise,... |
|
Emerging |
| 2073 |
Hagsten/Talkify
Javascript Text to speech library |
|
Emerging |
| 2074 |
arham-kk/openai-tts
This repository features a Gradio interface designed to leverage the OpenAI... |
|
Emerging |
| 2075 |
manab-kb/Voice-Based-Translator
A Voice Based Translator - Speak in English or any of the available selected... |
|
Emerging |
| 2076 |
chattylabs/conversational-flow
The Conversational Flow combines both native built-in resources and cloud... |
|
Emerging |
| 2077 |
gaborvecsei/whisper-live-transcription
Live-Transcription (STT) with Whisper PoC |
|
Emerging |
| 2078 |
thc1006/whisper-colab-tpu-transcriber
High-performance Google Colab Notebook for fast & accurate audio... |
|
Emerging |
| 2079 |
richardassar/SampleRNN_torch
Torch implementation of SampleRNN: An Unconditional End-to-End Neural Audio... |
|
Emerging |
| 2080 |
neurlang/gospeak
A Golang Text to Speech System |
|
Emerging |
| 2081 |
b4rtaz/voice-assistant
Voice assistant for Visual Studio Code. |
|
Emerging |
| 2082 |
yh1008/speech-to-text
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras |
|
Emerging |
| 2083 |
resemble-ai/resemble-unity-text-to-speech
Resemble's voice cloning engine within Unity |
|
Emerging |
| 2084 |
jvandenaardweg/ssml-split
Splits SSML strings into batches AWS Polly ánd Google's Text to Speech API... |
|
Emerging |
| 2085 |
bdim404/Qwen3-TTS-WebUI
基于阿里巴巴 Qwen3-TTS 模型(17 亿参数)的全栈文本转语音 Web 应用,支持语音定制、语音设计和语音克隆,有声书生成功能。A... |
|
Emerging |
| 2086 |
ArchitParnami/Few-Shot-KWS
Few-Shot Keyword Spotting |
|
Emerging |
| 2087 |
ohmstone/pocket-tts-deno
WASM ONNX build of Pocket TTS with voice cloning adapted from... |
|
Emerging |
| 2088 |
aperepel/claude-mlx-tts
Voice-cloned smart attention TTS notifications for Claude Code. AI... |
|
Emerging |
| 2089 |
azu/vscode-read-aloud-text
VSCode extension that read aloud text like Markdown and text etc... |
|
Emerging |
| 2090 |
AceCentre/TextAloud
iOS app. Built in Swift. Reads out text - sentence by sentence, paragraph by... |
|
Emerging |
| 2091 |
alecokas/BiLatticeRNN-Confidence
Confidence Estimation for Black Box Automatic Speech Recognition Systems... |
|
Emerging |
| 2092 |
manish-4007/YT-video-Transcription
An AI tools which helps to analyze any YouTube video, give the sentiment of... |
|
Emerging |
| 2093 |
ga642381/FastSpeech2
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to... |
|
Emerging |
| 2094 |
bhashini-ai/g2p
Grapheme-to-phoneme (G2P) conversion for Tamil / Kannada languages - a... |
|
Emerging |
| 2095 |
prateekralhan/Speech2Text-for-Long-Audio-Files
Perform SOTA Speech2Text on Long Audio Files with/without diarization Using... |
|
Emerging |
| 2096 |
vijethph/Insight
A Flutter app to help blind people. |
|
Emerging |
| 2097 |
anwar-gazi/ivrworks
Build IVR, run voice campaign, with machine detection, speech recognition... |
|
Emerging |
| 2098 |
asus4/unity-speech-recognizer
iOS Speech Recognizer for Unity |
|
Emerging |
| 2099 |
marcominerva/TranslatorService
A lightweight library that uses Cognitive Translator Service for text... |
|
Emerging |
| 2100 |
kwebby/Qwen3-TTS-Voice-Studio
A Text to Speech App for Qwen3-TTS Family Models to create custom voices,... |
|
Emerging |