All Voice AI Tools

8,165 tools ranked by quality score · Page 50 of 82

Showing 4901–5000 of 8,165
# Tool Score Tier
4901 isayahc/Semi-Automated-Youtube-Channel

Semi automated youtube channel that has a lot of cool features for someone...

22
Experimental
4902 rafalposwiata/text-normalization

Repository for text normalization research.

22
Experimental
4903 SPACESODA/read-txt

Read TXT is a lightweight text-to-speech reader with auto language detection...

22
Experimental
4904 SAMKhadka/ace-step-ui

🎵 Generate AI music effortlessly with ACE-Step UI, the open source...

22
Experimental
4905 allisonandreyev/WhisperQuantization

WhisperCPP (FP32) INT8, INT4, INT5, quantization effect on model latency and...

22
Experimental
4906 CaesiumY/dding-dong

Claude Code notification plugin — Sound alerts & OS notifications on task...

22
Experimental
4907 vpakarinen/kokorotts-webui

WebUI for Kokoro text-to-speech.

22
Experimental
4908 Mayank17M/vocalize

A speech recognition app that helps you keep track of your mental health and...

22
Experimental
4909 m-cheicki/VoiceOver_front

🎙️🎤 VoiceOver is a web application that allows you to transcribe English...

22
Experimental
4910 Bushramjad/XLSR-Wav2Vec2-Speech-Recognition-Urdu

Speech Recognition in Urdu language by fine-tuning the pretrained...

22
Experimental
4911 Gyvastis/google-speech-tts

A wrapper for Google Translate to generate an audio from a text.

22
Experimental
4912 fpaupier/tts-distil-whisper

Distil whisper on web

22
Experimental
4913 toavina2018/task-pilot

📋 Manage projects efficiently with TaskPilot, a full-stack application...

22
Experimental
4914 nilkanthshirodkar/Speech-Recognition-Using-HMM

Automatic Speech Recognition (ASR) system was implemented using the HMM...

22
Experimental
4915 ladykot/Butler

Прототип виртуального дворецкого на базе Yandex SpeechKit

22
Experimental
4916 skykongkong8/AI_device_with_RaspberryPi

Python/GPIO code for Tangible Artificial Intelligence device with RaspberryPi

22
Experimental
4917 NickSwardh/StreamSpeechToText

Stream Mp3 & Opus to Azure's Speech to Text without GStreamer

22
Experimental
4918 FelixWaweru/Copresenter

A virtual co-host that makes presentations a breeze by using AI to read out...

22
Experimental
4919 sergix44/oddcast-tts-php

A PHP interface to the online Oddcast demo API.

22
Experimental
4920 hahaanisha/digipal

Bridging the digital divide with interactive learning, voice guidance, and...

22
Experimental
4921 thewh1teagle/zipvoice-onnx

TTS with ZipVoice and onnxruntime

22
Experimental
4922 Shuichi346/qwen-voice-clone-webui

A Gradio WebUI for voice cloning powered by Qwen3-TTS. Provide reference...

22
Experimental
4923 f76tbntbww-crypto/VoiceForge

One-click local AI voice assistant powered by ASR+LLM+TTS, 100% coded by...

22
Experimental
4924 waltervanheuven/speech2text

Speech2Text

22
Experimental
4925 elemarmar/joke-teller

🤖💬 Joke Teller gets random jokes from third party API and converts them to...

22
Experimental
4926 NJUxlj/hotel-voice-agent-manual

一个RAG语音对话助手,用于上海的旅游信息查询。用户语音输入用ASR转文本,再用智谱api搜知识库+RAG生成回复,最后用TTS转语音输出。

22
Experimental
4927 x07x08/waveboard

A simple cross-platform soundboard

22
Experimental
4928 milosgajdos/playht_rs

PlayHT TTS Rust crate

22
Experimental
4929 tristan-mcinnis/Realtime-Whisper-Console-Transcriber

A real-time speech-to-text transcriber using the Whisper model, designed for...

22
Experimental
4930 edgarmedrano/javier-js-code

JAvascript Voicexml InterpretER. This is the JavaScript implementation, if...

22
Experimental
4931 burritosoftware/mira

A modular text-to-speech Discord bot for Bay Area public transit systems.

22
Experimental
4932 fruxc/Voice-Assistant-Based-News-App

Artificial-Intelligence based news application - A web application which...

22
Experimental
4933 ab-smith/kokoro-tts-webui

Gradio-based web ui for Kokoro to simplify its usage with multiple voices,...

22
Experimental
4934 Jayden-X-L/lobster-radio-skill

个性化qwen3本地模型驱动的资讯电台生成服务 - OpenClaw Skill

22
Experimental
4935 ali-ibnouf/SmartTalker

Digital Human AI Agent Platform — Real-time talking avatar with Arabic-first support

22
Experimental
4936 JhonatanAiT14/dictate.sh

🎤 Transcribe speech with low-latency on Apple Silicon using dictate.sh;...

22
Experimental
4937 dragonchen0131/Ai_Lee_translator

An ancient/modern chinese translator with a unique voice

22
Experimental
4938 nerdpudding/nerdpudding

The proof is in the pudding. Real-time AI video commentary with...

22
Experimental
4939 famda/semantics

Semantics CLI - Unified interface for media intelligence

22
Experimental
4940 faizalichsan1337/ai-podcast-clipper-saas

🎥 Create engaging short clips from podcasts using AI to boost visibility on...

22
Experimental
4941 HQQHQ/FinetuneSpeechT5-Spanish

This repository hosts the code and resources for fine-tuning a SpeechT5...

22
Experimental
4942 speak-rs/speakly

High-performance, extensible speech recognition toolkit for Rust — OpenAI...

22
Experimental
4943 neshani/Kitten-Offline-TTS

Kitten Offline Mobile TTS Webapp

22
Experimental
4944 venusdev85/Speech-Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

22
Experimental
4945 Tanaka-zi/VoiceR

VoiceR is a Linux voice control app that lets you control games using speech...

22
Experimental
4946 type-a/speechnet

Automatic Speech Recognition

22
Experimental
4947 sajjadabbasi1383/Voice-Translation

Online translation of text and voice and scanning of images

22
Experimental
4948 gheyret/thuyg20_scripts

Script files of THUYG-20(A free Uyghur speech database Released by...

22
Experimental
4949 ashisbehera/Smart_Alarm

This project is based on text to speech alarm application.

22
Experimental
4950 sayak119/Express

Express Yourself.

22
Experimental
4951 manasmodak/SpeechRecognition

WPF App to show text-speech and speech recognition

22
Experimental
4952 Bugsbunnydev2000/Analysis-of-body-language-and-speech-in-video

Analysis of body language and speech in video with LLMs

22
Experimental
4953 boboyiyi/multi_speaker_tacotron

A TensorFlow implementation of multi speaker Tacotron speech synthesis

22
Experimental
4954 nullbyte91/amazon-polly-TTS

A Simple Text To Speech application using Amazon Polly - Excel to MP3

22
Experimental
4955 rupac4530-creator/ai-desktop-assistant

Voice-controlled AI desktop assistant | 100% local & private | Whisper +...

22
Experimental
4956 loganngarcia/chaplin-ui

Web interface for a real-time silent speech recognition tool.

22
Experimental
4957 RiccardoGrin/TerminalWhisper

Voice-to-text for Windows using OpenAI Whisper. Hold a hotkey, speak, text appears.

22
Experimental
4958 natelindev/voice-agent

Low-latency real-time terminal voice assistant with VAD, ASR, LLM, and TTS

22
Experimental
4959 artryazanov/gemini-speech-to-speech-translator

Transform your audio content into any language with high accuracy and...

22
Experimental
4960 LINSUISHENG034/Qwen3-ASR-Desktop

Modern PyQt6 desktop GUI for Qwen3-ASR with batch transcription support

22
Experimental
4961 jerrykuo7727/ASR-common-voice-zh-tw

HMM-based ASR systems trained on CommonVoice(zh-TW) using Kaldi.

22
Experimental
4962 Huzaifa-code/SpeakFlow

SpeakFlow: A React-based web app for real-time speech transcription and...

22
Experimental
4963 ZeiraxGaming/captainslog-whisper

Convert your voice to text locally using Whisper without sending data to the...

22
Experimental
4964 VirtualZer0/StreamTalkerServer

AI text-to-speech server powered by Qwen3-TTS with voice cloning, batch...

22
Experimental
4965 A5hG0/Lyrics-To-Song-Generator

Step-by-step toolkit for DiffSinger voice synthesis. Preprocessing scripts +...

22
Experimental
4966 svn05/vietnamese-whisper-asr

Fine-tuned Whisper for Vietnamese ASR with Librosa preprocessing and Gradio demo.

22
Experimental
4967 siva-sub/pocket-tts-openapi-gpu

GPU-enhanced Pocket TTS with Remotion + TikTok captions

22
Experimental
4968 opensource-spraakherkenning-nl/ASR_NL_results

Results of Dutch ASR models, collected by the community

22
Experimental
4969 Jobijoba2000/add_dub

Automated video voice-over tool for Windows. Converts subtitles to speech...

22
Experimental
4970 burrmill/burrmill

BurrMill core

22
Experimental
4971 hritools/speech-to-text

A speech recognition library with a primary use for Russian language

22
Experimental
4972 GirlsInICT2023-Winner/smart-outdoor-activity-alerts

[Ericsson-LG] Girls in ICT 2023 Hackathon

22
Experimental
4973 alx741/kaldi_spanish_dimex100

Kaldi ASR Spanish example using the DIMEx100 corpus

22
Experimental
4974 sanbabyfrancis/sruthi

A malayalam voice assistant built using python

22
Experimental
4975 matin91/Kasko

Kasko is a Talking To-do List app, which allows the user to set up Reminders...

22
Experimental
4976 rahelmartim/IBM-STT-TTS

Project exploring IBM-watson speech-to-text and text-to-speech services in python.

22
Experimental
4977 BrotatoBoiV2/Live-Translate

Local, real-time AI translator for language immersion. Filters English,...

22
Experimental
4978 Synapsr/Selaou

Validez et corrigez vos transcriptions audio pour créer des datasets...

22
Experimental
4979 europanite/client_side_audio_transcription

A Browser-Based AI Audio Transcription Playground Powered by Whisper.

22
Experimental
4980 YuriyGuts/gdg-speech-classifier

A machine learning system that recognizes the word 'Google' in human speech...

22
Experimental
4981 Salut1231/wyoming-voice-match

🗣 Verify speaker identity and clean voice audio for accurate speech-to-text...

22
Experimental
4982 LyounJAP/TTSRadioLib

基于百度合成语音的语音合成工具类

22
Experimental
4983 xanderstevenson/community-content-pipeline

A Source of Truth for the Cisco Community Engagement, with creation and...

22
Experimental
4984 idsudd/tricahue

🦜 Tricahue: modelo de transcripción de voz especializado en español chileno

22
Experimental
4985 speechly/react-ui

A collection of React components for Speechly-powered applications

22
Experimental
4986 monish6666/avro-phonetic-go

📜 Convert Banglish to Bangla script seamlessly with this Go library,...

22
Experimental
4987 diogosapessoa/speech-to-text

Speech recognizer using xamarin monoandroid

22
Experimental
4988 ZacDair/SER_Platform_AICS

This repository contains the code to create and conduct emotion recognition...

22
Experimental
4989 mk-knight23/37-tool-text-to-speech

Production-grade Text-to-Speech utility built with Vue 3 and Web Speech API....

22
Experimental
4990 huss2342/x_news_station

turn x/twitter feed into audio

22
Experimental
4991 vinsis/speech-commands-recognition

Single word speech recognition using PyTorch

22
Experimental
4992 shr1324/orpheus-tts-docker

🔊 Deploy Orpheus TTS with ease using Docker, featuring GPU management,...

22
Experimental
4993 sglkc/live-translate

🎙️ Translate as you speak using Google Chrome's Web Speech API for speech...

22
Experimental
4994 Pierillo/hallucination-check

Pipeline automatizado que cura, redacta y envía un newsletter diario de IA...

22
Experimental
4995 bacharyehya/outloud

Beautiful TUI for text-to-speech. Gemini, OpenAI, or local. One command.

22
Experimental
4996 kilogramme/nerdpudding

Provide live AI video commentary with text-to-speech for any video source,...

22
Experimental
4997 davidsuragan/tulga-cli

TulgaCLI is a tool that allows you to chat and voice chat with virtual...

22
Experimental
4998 gkcomputers040/santa-claus-is-calling

🎅 Create magical moments with real AI phone calls from Santa, delivering...

22
Experimental
4999 ihsacm/ComfyUI-KittenTTS

Integrate KittenTTS into ComfyUI to enable fast, lightweight text-to-speech...

22
Experimental
5000 kayrugold/andyai

A self-evolving, tri-brain autonomous AI agent featuring local subconscious...

22
Experimental
« Prev 1 2 3 48 49 50 51 52 80 81 82 Next »