All Voice AI Tools

8,165 tools ranked by quality score · Page 33 of 82

Showing 3201–3300 of 8,165
# Tool Score Tier
3201 Echoshard/DiscordBotOpenAI_TTS

A simple discord bot that can produces mp3's using Open AI's TTS API.

30
Emerging
3202 CT83/Hellin-Worki

A video conferencing platform which seamlessly dials your coworkers when you...

30
Emerging
3203 stitchng/adonis-infobip

An addon/plugin package to provide InfoBip single/bulk SMS/Voice services in...

30
Emerging
3204 devfinwiz/Python-Voice-Assistant-Virtual-Slave

This voice assistant is buit in VS Code. It has an ability to understand...

30
Emerging
3205 lohriialo/texttospeech

Google's Speech Synthesis, Text to speech conversion powered by machine learning

30
Emerging
3206 appatalks/Bark_text-to-speech

Playground with Bark

30
Emerging
3207 rabiaedayilmaz/speech2text-pipelines

Speech to text pipelines using both APIs and finetuned models on custom and...

30
Emerging
3208 SilkReyn/MAS-xttsClient

Submod for Monika-After-Story that generates voice for Monika's dialogue by...

30
Emerging
3209 Taijul007/VieNeu-TTS

🎤 Generate realistic Vietnamese speech with VieNeu-TTS, an advanced...

30
Emerging
3210 epfluegel/TalkMaths

A Vocola 2 (DNS) extension for creating and editing mathematics (in LaTeX)...

30
Emerging
3211 benrucker/JermaBot

A wacky, sound-oriented Discord bot

30
Emerging
3212 YugwonWon/KOINA

KOINA (Korean Intonation Annotator) is a tool that automatically annotates...

30
Emerging
3213 fwcd/okpi

Virtual assistant with offline voice recognition for Raspberry Pi

30
Emerging
3214 siddhantmishra1305/Anuvaad

An iOS translator that supports more that 40 languages. User can add notes...

30
Emerging
3215 ascender1729/AudioDictate

An efficient desktop application for transcribing audio files into text...

30
Emerging
3216 brailcom/tts-api-provider

Common interface to speech synthesis

30
Emerging
3217 dalmoon15/styletts2-dataset-toolkit

🎤 Streamline voice cloning with the StyleTTS2 Dataset Toolkit, a...

30
Emerging
3218 sanjifr3/Narrator

An image and video description generator using an CNN-RNN based architecture.

30
Emerging
3219 tazz4843/scripty

Speech to text bot for Discord using Mozilla's DeepSpeech

30
Emerging
3220 jailuthra/asr

Kaldi ASR wrapper scripts

30
Emerging
3221 bibinkunjumon2020/Azure-Avatar-AI

The text to speech avatar system is a text to speech feature with vision...

30
Emerging
3222 PezCoder/ai-chatbot

Bot who can listen & talk.

30
Emerging
3223 marvin1099/AndroidFossSTTandKeyboard

This is my Foss setup to replace Gboard, Google Voice input, Gboard IME (STT...

30
Emerging
3224 Kini218/transcriber_bot

convert text to speech and conversely

30
Emerging
3225 th33k/Luigi

LUIGI is an interactive pet robot designed for fun, companionship, and...

30
Emerging
3226 anshulgupta0803/ASSR

ASSR: Automatic Stuttered Speech Recognition

30
Emerging
3227 mkiol/papago

Papago repeats what you say but in different language

30
Emerging
3228 ashsystems/coqui-rs

Rust bindings to the https://github.com/coqui-ai TTS library

30
Emerging
3229 oswaldoludwig/Pruning-pre-trained-models-using-evolutionary-computation

This repository contains scripts to prune Wav2vec2 using a...

30
Emerging
3230 jark006/SummerTTS_VS

SummerTTS...

30
Emerging
3231 Token-project/token.tts

TOKEN TTS (Trusted digital TimeStamping Service) provides anonymous,...

30
Emerging
3232 diharaw/emo-lib

Bi-model Convolutional Neural Network based Emotion Classification library...

30
Emerging
3233 SeanPLeary/dc_tts-transfer-learning

Transfer learning exploration of dc_tts text-to-speech model

30
Emerging
3234 TeaPoly/CE-OptimizedLoss

Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss...

30
Emerging
3235 akshatg-721/JanSamvaad-ResolveOS

JanSamvaad ResolveOS — A voice-first AI governance system that converts...

30
Emerging
3236 I5UCC/VRCTextboxSTT

A SpeechToText application that uses OpenAI's whisper via faster-whisper to...

30
Emerging
3237 gtsopus/SoftEng-SoftDev2-UoI-Projects

University project for the "Software Engineering" course made in...

30
Emerging
3238 maxiee/HeartEcho

Explore and express your inner voice through personalized conversations with...

30
Emerging
3239 CodingWithEnjoy/Speech-To-Text-HTML-CSS-JS

متن به صدا | Text To Speech 😊🤩

30
Emerging
3240 nezhar/speech-condenser

A tool for summarizing dialogues from videos or audio

30
Emerging
3241 ashfaaqrifath/Speechtron

This Python text to speech program converts text from user-provided files or...

30
Emerging
3242 ServerSideHannes/las

tf 2.0 implementation of Listen, attend and spell

30
Emerging
3243 ambegossi/dislexiapp-backend

💫 Node.js backend for DislexiApp.

30
Emerging
3244 sdsb8432/TextToSpeech-Android

Text to Speech for Android Application with Google API

30
Emerging
3245 licavalentin/reddit-video-creator

✨📼Create Reddit Videos with JavaScript📼✨

30
Emerging
3246 huaxiaozhong1/Tensorflow-SparkFunEdge-FullLifeCycel-for-SequenceModel

An "AI on-device" project for sequence model. Based at Tensorflow Lite for...

30
Emerging
3247 sarumaj/bing-wallpaper-changer

Fetch newest bing wallpaper and set it as background. Use NLP and...

30
Emerging
3248 zhongyuchen/DSPSpeech-20

A speech dataset of 20 isolated words each with 680 recordings from 34 individuals

30
Emerging
3249 aaivu/KuralNet

A deep learning-based Speech Emotion Recognition (SER) model trained...

30
Emerging
3250 TheMindhouse/memospeak

Memorize any text with voice recognition

30
Emerging
3251 alihassanml/Voice-Controlled-Agentic-AI-Bot

A real-time voice assistant powered by Ollama, Piper TTS, and...

30
Emerging
3252 crispinprojects/klatt-synthesizer

Klatt speech synthesizer

30
Emerging
3253 nuaazs/VAF_2

Aims to create a comprehensive voice toolkit for training, testing, and...

30
Emerging
3254 tushar-prabhu/Multilingual-Voice-Transcriber-and-Translator

A Python-based application that records voice, transcribes spoken text,...

30
Emerging
3255 rodrigosuelli/ditey-web

🎙 Leitor de textos online desenvolvido com React e Web Speech API. Tcc (ETEC)

30
Emerging
3256 gokulkarthik/text2speech

Towards Building Text-To-Speech Systems for the Next Billion Users -...

30
Emerging
3257 DevStranger/NoteWriter

NoteWriter - aplikacja do sporządzania notatek ze zdalnych spotkań

30
Emerging
3258 miaubonito/subsync

🎥 Transcribe and translate YouTube subtitles quickly with SubSync, a Python...

30
Emerging
3259 t13m/kaldi-readers-for-tensorflow

readers that enable reading kaldi ark in tensorflow

30
Emerging
3260 legekka/GanyuTTS

A small VITS+SOVITS/RVC TTS API

30
Emerging
3261 haliphax/tts

Twitch text to speech overlay for OBS (using lobe-tts)

30
Emerging
3262 NICEElevateAI/ElevateAIDotNetSDK

.Net core 6 SDK for ElevateAI

30
Emerging
3263 hollygrimm/voice-dataset-creation

Tools to create your own voice dataset for TTS training

30
Emerging
3264 utsavpshah/SpeakingHands

This is an extension to LeapTrainer.js repository. With this project, we...

30
Emerging
3265 saztorralba/CNNWordReco

Code and scripts for training and testing isolated spoken word recognition...

30
Emerging
3266 bartbilliet/LiveTranslate.App

Generate translated subtitles for any audio source (Xamarin mobile app)

30
Emerging
3267 georgezoto/RNN-LSTM-NLP-Sequence-Models

Sequence Models repository for all projects and programming assignments of...

30
Emerging
3268 nodef/extra-tts

Generate speech audio from super long text through machine.

30
Emerging
3269 MiniXC/phones

A collection of utilities for handling IPA phones.

30
Emerging
3270 scottgl9/openclaw-matrix-voice

Matrix voice call bot with LiveKit, Whisper STT, and Chatterbox TTS,...

30
Emerging
3271 biyoml/PyTorch-End-to-End-ASR-on-TIMIT

Attention-based end-to-end ASR on TIMIT in PyTorch

30
Emerging
3272 alaminsheikh01/speech-recognition

Speech recognition, also known as automatic speech recognition (ASR),...

30
Emerging
3273 2017fandrei/ForcedAlignment

Graphical utility for forced alignment using aeneas, an interactive audio player

30
Emerging
3274 akabe/obs-transcript

Real-time subtitle generation by speech recognition for OBS Studio

30
Emerging
3275 RW128k/VCIDE

A simple text editor for writing Python using your voice.

30
Emerging
3276 seanghay/wav2vec2-khmer-openslr

Wav2Vec2 with OpenSLR 42 (Khmer language)

30
Emerging
3277 Nikya/voicify

To generate spoken notification

30
Emerging
3278 gillesdegottex/percivaltts

ATTENTION! This is a mirror of the following GitLab project:

30
Emerging
3279 SUNGBEOMCHOI/Korean-Streaming-ASR

Korean Streaming ASR(with Denoiser and Conformer CTC)

30
Emerging
3280 doubleZ0108/Human-Computer-Interaction

Human-Computer Interaction | Tongji Univ. SSE Course Projects

30
Emerging
3281 rafaelvalle/asrgen

Attacking Speaker Recognition with Deep Generative Models

30
Emerging
3282 roojay/bobplug-google-tts

Bob 的一个 Google tts 插件

30
Emerging
3283 QuyAnh2005/vits-japanese

Text to Speech for Japanese

30
Emerging
3284 97jamie/public-police-footage

Code for Constructing Datasets From Public Police Body Camera Footage (ICASSP 2025)

30
Emerging
3285 Nicolas-Prevot/TTS_playground

Unified toolkit for testing and comparing multiple state-of-the-art...

30
Emerging
3286 7rajatgupta/react-text-to-speech

react library using the speech syntesizer API to convert text to speech in real time

30
Emerging
3287 FlyingPolarBear/CityKBQA

Xiaode: a Knowledge Based Question Answering System with Speech IO

30
Emerging
3288 derek-byte/multilingual-voice-assistant-llm

cohere labs - aya expedition 2025: integrating speech & audio into aya...

30
Emerging
3289 codycollier/wer

A word error rate util for golang

30
Emerging
3290 yxngrbree/text-to-speech

Nano weight TTS

30
Emerging
3291 khalooei/Voxtral-AI-Demo-Local-Interface

Voxtral is a state-of-the-art model developed to handle both speech...

30
Emerging
3292 cobaltos/dictit

Speech Recognition Tool Based On Speech Recognition API

30
Emerging
3293 ZackAkil/global-video-dubbing

Using Googel Cloud Video Intelligence API with Cloud Translation API and...

30
Emerging
3294 BlankOnTheHub/Audiopub

🎧 Transform EPUBs into high-fidelity audiobooks locally with Audiopub, using...

30
Emerging
3295 shessam/DSR

Throughout history, Altough there has been significant research in the field...

30
Emerging
3296 EricNeves/speechRecognition

Speech Recognition with JS 🎙️

30
Emerging
3297 botbahlul/android-autosrt-v2

ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE...

30
Emerging
3298 common-voice/our-voices-model-competition

Our Voices Competition

30
Emerging
3299 JTylerH/unifi-aihorn-dynamic-tts

This project hosts a lightweight Node.js web app that connects to your UniFi...

30
Emerging
3300 yikZero/Rotts

Full-stack web service with React frontend and Python backend. Features Edge...

30
Emerging
« Prev 1 2 3 31 32 33 34 35 80 81 82 Next »