All Voice AI Tools
8,165 tools ranked by quality score · Page 33 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 3201 |
Echoshard/DiscordBotOpenAI_TTS
A simple discord bot that can produces mp3's using Open AI's TTS API. |
|
Emerging |
| 3202 |
CT83/Hellin-Worki
A video conferencing platform which seamlessly dials your coworkers when you... |
|
Emerging |
| 3203 |
stitchng/adonis-infobip
An addon/plugin package to provide InfoBip single/bulk SMS/Voice services in... |
|
Emerging |
| 3204 |
devfinwiz/Python-Voice-Assistant-Virtual-Slave
This voice assistant is buit in VS Code. It has an ability to understand... |
|
Emerging |
| 3205 |
lohriialo/texttospeech
Google's Speech Synthesis, Text to speech conversion powered by machine learning |
|
Emerging |
| 3206 |
appatalks/Bark_text-to-speech
Playground with Bark |
|
Emerging |
| 3207 |
rabiaedayilmaz/speech2text-pipelines
Speech to text pipelines using both APIs and finetuned models on custom and... |
|
Emerging |
| 3208 |
SilkReyn/MAS-xttsClient
Submod for Monika-After-Story that generates voice for Monika's dialogue by... |
|
Emerging |
| 3209 |
Taijul007/VieNeu-TTS
🎤 Generate realistic Vietnamese speech with VieNeu-TTS, an advanced... |
|
Emerging |
| 3210 |
epfluegel/TalkMaths
A Vocola 2 (DNS) extension for creating and editing mathematics (in LaTeX)... |
|
Emerging |
| 3211 |
benrucker/JermaBot
A wacky, sound-oriented Discord bot |
|
Emerging |
| 3212 |
YugwonWon/KOINA
KOINA (Korean Intonation Annotator) is a tool that automatically annotates... |
|
Emerging |
| 3213 |
fwcd/okpi
Virtual assistant with offline voice recognition for Raspberry Pi |
|
Emerging |
| 3214 |
siddhantmishra1305/Anuvaad
An iOS translator that supports more that 40 languages. User can add notes... |
|
Emerging |
| 3215 |
ascender1729/AudioDictate
An efficient desktop application for transcribing audio files into text... |
|
Emerging |
| 3216 |
brailcom/tts-api-provider
Common interface to speech synthesis |
|
Emerging |
| 3217 |
dalmoon15/styletts2-dataset-toolkit
🎤 Streamline voice cloning with the StyleTTS2 Dataset Toolkit, a... |
|
Emerging |
| 3218 |
sanjifr3/Narrator
An image and video description generator using an CNN-RNN based architecture. |
|
Emerging |
| 3219 |
tazz4843/scripty
Speech to text bot for Discord using Mozilla's DeepSpeech |
|
Emerging |
| 3220 |
jailuthra/asr
Kaldi ASR wrapper scripts |
|
Emerging |
| 3221 |
bibinkunjumon2020/Azure-Avatar-AI
The text to speech avatar system is a text to speech feature with vision... |
|
Emerging |
| 3222 |
PezCoder/ai-chatbot
Bot who can listen & talk. |
|
Emerging |
| 3223 |
marvin1099/AndroidFossSTTandKeyboard
This is my Foss setup to replace Gboard, Google Voice input, Gboard IME (STT... |
|
Emerging |
| 3224 |
Kini218/transcriber_bot
convert text to speech and conversely |
|
Emerging |
| 3225 |
th33k/Luigi
LUIGI is an interactive pet robot designed for fun, companionship, and... |
|
Emerging |
| 3226 |
anshulgupta0803/ASSR
ASSR: Automatic Stuttered Speech Recognition |
|
Emerging |
| 3227 |
mkiol/papago
Papago repeats what you say but in different language |
|
Emerging |
| 3228 |
ashsystems/coqui-rs
Rust bindings to the https://github.com/coqui-ai TTS library |
|
Emerging |
| 3229 |
oswaldoludwig/Pruning-pre-trained-models-using-evolutionary-computation
This repository contains scripts to prune Wav2vec2 using a... |
|
Emerging |
| 3230 |
jark006/SummerTTS_VS
SummerTTS... |
|
Emerging |
| 3231 |
Token-project/token.tts
TOKEN TTS (Trusted digital TimeStamping Service) provides anonymous,... |
|
Emerging |
| 3232 |
diharaw/emo-lib
Bi-model Convolutional Neural Network based Emotion Classification library... |
|
Emerging |
| 3233 |
SeanPLeary/dc_tts-transfer-learning
Transfer learning exploration of dc_tts text-to-speech model |
|
Emerging |
| 3234 |
TeaPoly/CE-OptimizedLoss
Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss... |
|
Emerging |
| 3235 |
akshatg-721/JanSamvaad-ResolveOS
JanSamvaad ResolveOS — A voice-first AI governance system that converts... |
|
Emerging |
| 3236 |
I5UCC/VRCTextboxSTT
A SpeechToText application that uses OpenAI's whisper via faster-whisper to... |
|
Emerging |
| 3237 |
gtsopus/SoftEng-SoftDev2-UoI-Projects
University project for the "Software Engineering" course made in... |
|
Emerging |
| 3238 |
maxiee/HeartEcho
Explore and express your inner voice through personalized conversations with... |
|
Emerging |
| 3239 |
CodingWithEnjoy/Speech-To-Text-HTML-CSS-JS
متن به صدا | Text To Speech 😊🤩 |
|
Emerging |
| 3240 |
nezhar/speech-condenser
A tool for summarizing dialogues from videos or audio |
|
Emerging |
| 3241 |
ashfaaqrifath/Speechtron
This Python text to speech program converts text from user-provided files or... |
|
Emerging |
| 3242 |
ServerSideHannes/las
tf 2.0 implementation of Listen, attend and spell |
|
Emerging |
| 3243 |
ambegossi/dislexiapp-backend
💫 Node.js backend for DislexiApp. |
|
Emerging |
| 3244 |
sdsb8432/TextToSpeech-Android
Text to Speech for Android Application with Google API |
|
Emerging |
| 3245 |
licavalentin/reddit-video-creator
✨📼Create Reddit Videos with JavaScript📼✨ |
|
Emerging |
| 3246 |
huaxiaozhong1/Tensorflow-SparkFunEdge-FullLifeCycel-for-SequenceModel
An "AI on-device" project for sequence model. Based at Tensorflow Lite for... |
|
Emerging |
| 3247 |
sarumaj/bing-wallpaper-changer
Fetch newest bing wallpaper and set it as background. Use NLP and... |
|
Emerging |
| 3248 |
zhongyuchen/DSPSpeech-20
A speech dataset of 20 isolated words each with 680 recordings from 34 individuals |
|
Emerging |
| 3249 |
aaivu/KuralNet
A deep learning-based Speech Emotion Recognition (SER) model trained... |
|
Emerging |
| 3250 |
TheMindhouse/memospeak
Memorize any text with voice recognition |
|
Emerging |
| 3251 |
alihassanml/Voice-Controlled-Agentic-AI-Bot
A real-time voice assistant powered by Ollama, Piper TTS, and... |
|
Emerging |
| 3252 |
crispinprojects/klatt-synthesizer
Klatt speech synthesizer |
|
Emerging |
| 3253 |
nuaazs/VAF_2
Aims to create a comprehensive voice toolkit for training, testing, and... |
|
Emerging |
| 3254 |
tushar-prabhu/Multilingual-Voice-Transcriber-and-Translator
A Python-based application that records voice, transcribes spoken text,... |
|
Emerging |
| 3255 |
rodrigosuelli/ditey-web
🎙 Leitor de textos online desenvolvido com React e Web Speech API. Tcc (ETEC) |
|
Emerging |
| 3256 |
gokulkarthik/text2speech
Towards Building Text-To-Speech Systems for the Next Billion Users -... |
|
Emerging |
| 3257 |
DevStranger/NoteWriter
NoteWriter - aplikacja do sporządzania notatek ze zdalnych spotkań |
|
Emerging |
| 3258 |
miaubonito/subsync
🎥 Transcribe and translate YouTube subtitles quickly with SubSync, a Python... |
|
Emerging |
| 3259 |
t13m/kaldi-readers-for-tensorflow
readers that enable reading kaldi ark in tensorflow |
|
Emerging |
| 3260 |
legekka/GanyuTTS
A small VITS+SOVITS/RVC TTS API |
|
Emerging |
| 3261 |
haliphax/tts
Twitch text to speech overlay for OBS (using lobe-tts) |
|
Emerging |
| 3262 |
NICEElevateAI/ElevateAIDotNetSDK
.Net core 6 SDK for ElevateAI |
|
Emerging |
| 3263 |
hollygrimm/voice-dataset-creation
Tools to create your own voice dataset for TTS training |
|
Emerging |
| 3264 |
utsavpshah/SpeakingHands
This is an extension to LeapTrainer.js repository. With this project, we... |
|
Emerging |
| 3265 |
saztorralba/CNNWordReco
Code and scripts for training and testing isolated spoken word recognition... |
|
Emerging |
| 3266 |
bartbilliet/LiveTranslate.App
Generate translated subtitles for any audio source (Xamarin mobile app) |
|
Emerging |
| 3267 |
georgezoto/RNN-LSTM-NLP-Sequence-Models
Sequence Models repository for all projects and programming assignments of... |
|
Emerging |
| 3268 |
nodef/extra-tts
Generate speech audio from super long text through machine. |
|
Emerging |
| 3269 |
MiniXC/phones
A collection of utilities for handling IPA phones. |
|
Emerging |
| 3270 |
scottgl9/openclaw-matrix-voice
Matrix voice call bot with LiveKit, Whisper STT, and Chatterbox TTS,... |
|
Emerging |
| 3271 |
biyoml/PyTorch-End-to-End-ASR-on-TIMIT
Attention-based end-to-end ASR on TIMIT in PyTorch |
|
Emerging |
| 3272 |
alaminsheikh01/speech-recognition
Speech recognition, also known as automatic speech recognition (ASR),... |
|
Emerging |
| 3273 |
2017fandrei/ForcedAlignment
Graphical utility for forced alignment using aeneas, an interactive audio player |
|
Emerging |
| 3274 |
akabe/obs-transcript
Real-time subtitle generation by speech recognition for OBS Studio |
|
Emerging |
| 3275 |
RW128k/VCIDE
A simple text editor for writing Python using your voice. |
|
Emerging |
| 3276 |
seanghay/wav2vec2-khmer-openslr
Wav2Vec2 with OpenSLR 42 (Khmer language) |
|
Emerging |
| 3277 |
Nikya/voicify
To generate spoken notification |
|
Emerging |
| 3278 |
gillesdegottex/percivaltts
ATTENTION! This is a mirror of the following GitLab project: |
|
Emerging |
| 3279 |
SUNGBEOMCHOI/Korean-Streaming-ASR
Korean Streaming ASR(with Denoiser and Conformer CTC) |
|
Emerging |
| 3280 |
doubleZ0108/Human-Computer-Interaction
Human-Computer Interaction | Tongji Univ. SSE Course Projects |
|
Emerging |
| 3281 |
rafaelvalle/asrgen
Attacking Speaker Recognition with Deep Generative Models |
|
Emerging |
| 3282 |
roojay/bobplug-google-tts
Bob 的一个 Google tts 插件 |
|
Emerging |
| 3283 |
QuyAnh2005/vits-japanese
Text to Speech for Japanese |
|
Emerging |
| 3284 |
97jamie/public-police-footage
Code for Constructing Datasets From Public Police Body Camera Footage (ICASSP 2025) |
|
Emerging |
| 3285 |
Nicolas-Prevot/TTS_playground
Unified toolkit for testing and comparing multiple state-of-the-art... |
|
Emerging |
| 3286 |
7rajatgupta/react-text-to-speech
react library using the speech syntesizer API to convert text to speech in real time |
|
Emerging |
| 3287 |
FlyingPolarBear/CityKBQA
Xiaode: a Knowledge Based Question Answering System with Speech IO |
|
Emerging |
| 3288 |
derek-byte/multilingual-voice-assistant-llm
cohere labs - aya expedition 2025: integrating speech & audio into aya... |
|
Emerging |
| 3289 |
codycollier/wer
A word error rate util for golang |
|
Emerging |
| 3290 |
yxngrbree/text-to-speech
Nano weight TTS |
|
Emerging |
| 3291 |
khalooei/Voxtral-AI-Demo-Local-Interface
Voxtral is a state-of-the-art model developed to handle both speech... |
|
Emerging |
| 3292 |
cobaltos/dictit
Speech Recognition Tool Based On Speech Recognition API |
|
Emerging |
| 3293 |
ZackAkil/global-video-dubbing
Using Googel Cloud Video Intelligence API with Cloud Translation API and... |
|
Emerging |
| 3294 |
BlankOnTheHub/Audiopub
🎧 Transform EPUBs into high-fidelity audiobooks locally with Audiopub, using... |
|
Emerging |
| 3295 |
shessam/DSR
Throughout history, Altough there has been significant research in the field... |
|
Emerging |
| 3296 |
EricNeves/speechRecognition
Speech Recognition with JS 🎙️ |
|
Emerging |
| 3297 |
botbahlul/android-autosrt-v2
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE... |
|
Emerging |
| 3298 |
common-voice/our-voices-model-competition
Our Voices Competition |
|
Emerging |
| 3299 |
JTylerH/unifi-aihorn-dynamic-tts
This project hosts a lightweight Node.js web app that connects to your UniFi... |
|
Emerging |
| 3300 |
yikZero/Rotts
Full-stack web service with React frontend and Python backend. Features Edge... |
|
Emerging |