All Voice AI Tools

8,165 tools ranked by quality score · Page 51 of 82

Showing 5001–5100 of 8,165
# Tool Score Tier
5001 11dome11/Lucy---Virtual-Assistant

Lucy - a simple virtual assistant with speech recognition

22
Experimental
5002 laithisgood/kokoclone

Deliver fast, real-time multilingual voice cloning with an efficient neural...

22
Experimental
5003 leejgdh/GPT-SoVITS-ko

한국어 전용 GPT-SoVITS TTS 서비스

22
Experimental
5004 voothi/20250421115831-anki-gtts-player

A powerful Anki audio add-on with a 3-tier playback system: prioritizes your...

22
Experimental
5005 zoebchhatriwala/ICS-I-can-speak-

This Application Converts Your Input Text Into Speech. Developed For Windows...

22
Experimental
5006 Thatcherismkiwi946/rustfs

🌐 Build high-performance distributed object storage easily with RustFS,...

22
Experimental
5007 EDWINANGO/Synchronizer

Manage server-authoritative data channels for Roblox with automatic client...

22
Experimental
5008 SharunDeva/deep-delta-learning

🔍 Discover Deep Delta Learning, a new framework that transforms residual...

22
Experimental
5009 RJoshi141/utter

Voice capture app for Apple Watch and iPhone. Speak a thought on your wrist,...

22
Experimental
5010 LakshmiSravyaVedantham/cutto

AI Video Director for Kids' Education — describe a lesson, get a finished...

22
Experimental
5011 Bubblefox9473/AI-Waifu-Vtuber

🤖 Create a multilingual AI waifu VTuber with advanced TTS, real-time lip...

22
Experimental
5012 Ammar-create/Pollination-tools

Free AI tools hub powered by Pollinations.ai — translator, voice studio,...

22
Experimental
5013 nitrogoat74/aacs

🤖 Establish a clear standard for AI governance and accountability with the...

22
Experimental
5014 rjtsuri1000/Audio-Gain-Module-FPGA

🔊 Implement and scale audio gain in real-time using a fixed-point DSP module...

22
Experimental
5015 Twerionex/soprano-factory

🎤 Train or fine-tune your own Soprano text-to-speech models with ease using...

22
Experimental
5016 nisakson2000/Gizmo-AI

A fully local AI assistant — 9B LLM + vision on GPU, Voice Studio with voice...

22
Experimental
5017 Seda-Gtech/ai-voice-architecture

Flutter Web demo showcasing AI voice architecture — ElevenLabs TTS, Voice...

22
Experimental
5018 notvibhu8/VoiceLICT

📢 Empower LICT students to voice concerns using AI to identify common issues...

22
Experimental
5019 fabiolimace/espeak-playground

Espaço para experimentação do software espeak-ng. 🔬 🥼

22
Experimental
5020 1urelius/atlas.cam

Display live webcam video as ASCII art in the terminal with real-time edge...

22
Experimental
5021 deuxksy/today-vn-news

베트남 뉴스 자동 생성 파이프라인 (TTS, FFmpeg, Hardware Acceleration)

22
Experimental
5022 KernicDE/nova-ed-monitor

NOVA — Navigation, Operations, and Vessel Assistance for Elite Dangerous

22
Experimental
5023 zsoltfrks/multimodal-story-generator

A rather simple story generator from images with text-to-speech integration...

22
Experimental
5024 DarkSide7839/PytDm

🌐 Streamline your downloads with PytDm, a modern Python download manager...

22
Experimental
5025 kvnpetit/BetterFrenchTTS

Intelligent Android TTS wrapper optimized for French — Kotlin DSL, SSML...

22
Experimental
5026 mizunashi-mana/cc-voice-reporter

Real-time voice reporting for Claude Code — hear what Claude is doing...

22
Experimental
5027 Narasimha1997/wavenet-stt

An end-to-end speech recognition system with Wavenet. Built using C++ and python.

22
Experimental
5028 ankitiscracked/usevoiceai

the Typescript toolkit for ambitious voice AI apps

22
Experimental
5029 ccoreilly/deepspeech-catala

Deepspeech ASR Model for the Catalan Language

22
Experimental
5030 jianchang512/speech2text-df

基于Dolphin模型的东方语言音视频转字幕api及webui

22
Experimental
5031 pselvana/VoiceCrafter

Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

22
Experimental
5032 pragyak412/Improving-Voice-Separation-by-Incorporating-End-To-End-Speech-Recognition

Implementing the paper -

22
Experimental
5033 ivedants/Magic-Media-native-iOS-iPadOS-AR-App

Magic Media is an award-winning experimental native iOS/iPadOS application...

22
Experimental
5034 PineapplePie/SpeechHelper

SpeechHelper is an Android text-to-speech (TTS) library that simplifies the...

22
Experimental
5035 KISETU-ggwp/JpSignSpell

"Yubimoji-kun" is a web application that recognizes fingerspelling in...

22
Experimental
5036 MeDeity/LibBaiduTextToSpeech

一句话拥有 百度语音合成 能力

22
Experimental
5037 LexMainye/Kasuku-Transcriber

A speech to text web app for people with speech impairments that has support...

22
Experimental
5038 IseduardoRezende/IAParty

Profile/Persona Call using LLM

22
Experimental
5039 hekmon/kyutai-rs

Golang bindings to Kyutai Delayed Streams Modeling Rust productions servers

22
Experimental
5040 exyezed/audiotts-pro

Text-to-Speech generator and audio downloader supporting Azure Speech, IBM...

22
Experimental
5041 dwain-barnes/vibevoice-0.5-realtime-fastrtc-plugin

A FastRTC-compatible wrapper for Microsoft's...

22
Experimental
5042 mkpoli/wenyan-book-video

Narration video rendering pipeline for 《文言陰符》 (wenyan-book)

22
Experimental
5043 SenalDolage/object-detection-TFJS-ReactNative

A mobile application that identifies nearby objects and gives a voice output...

22
Experimental
5044 ferrinweb/voicedictation-webapi-demo

A iflytek voice dictation web api demo. 讯飞语音听写接口纯前端demo.

22
Experimental
5045 adrenak/UniSpeech

A simple to use Speech Recognition library for Unity based on the Microsoft...

22
Experimental
5046 Astralchemist/Voice-Clone-TTS

This is a text to speech model that has many various uses

22
Experimental
5047 sellorm/rsay

Make R and your Mac speak

22
Experimental
5048 smch/tts

Text to speech with web speech synthesis api and amazon polly, reads and...

22
Experimental
5049 Hrithik1122/quizilla.github.io

Quizilla is a web application, use a (Text-to-Speech) API for listening...

22
Experimental
5050 maggieezzat/speech-to-speech-translation

A flask web-page hosting a speech to speech translation demo

22
Experimental
5051 RFebrians/AI-Assistant

I/O Voice Recognition using Conditional Rendering

22
Experimental
5052 DoubleCouponDay/TextToSpeechMod

Designed for the game space engineers

22
Experimental
5053 nikkoxgonzales/streaming-tts

A streamlined, Kokoro-based text-to-speech library with streaming support.

22
Experimental
5054 SunPCSolutions/DiarASR

Enterprise-Grade Secure ASR Diarization Pipeline - HIPAA-compliant speech...

22
Experimental
5055 Ali1gamer7798/StreamXBot

Stream music in your browser with a self-hosted Telegram bot that works...

22
Experimental
5056 dlacheal/AI-VoiceAssistant

AIVA es un ecosistema de asistencia de voz de baja latencia diseñado para...

22
Experimental
5057 NguyenPhamMC/whisperer

🎤 Record and transcribe voice dictation on Linux with push-to-talk...

22
Experimental
5058 gregormcw/notable

Voice-first note capture and semantic retrieval.

22
Experimental
5059 sankalp20436/E-ceptionist

Eceptionist-A smart receptionist is a facial recognition-based monitoring...

22
Experimental
5060 harmlessman/CoquiTTSGui

Gui for users who use the coqui-TTS vits model.

22
Experimental
5061 hadihaider055/vocal-dub

Dub audio into 50+ languages using AI. Whisper transcription, Google...

22
Experimental
5062 rockywuest/kawaii-bath-assistant

🛁 Cute AI-powered bathroom assistant for M5Stack Core 2 — kawaii face,...

22
Experimental
5063 husseinnsourr/NeuralChatter

A Next-Generation Neural TTS Engine. High-quality, human-like voice...

22
Experimental
5064 NormVg/AutoCaptionGenAI

A Python project that extracts audio from video files, transcribes the...

22
Experimental
5065 sherurox/Motion-Flow

Real-time, bidirectional sign language translation — powered entirely in the...

22
Experimental
5066 PatrickFanella/soundhash

A sophisticated system for matching audio clips from videos across social...

22
Experimental
5067 DarkKnightSgh/Dotslash5.0HackAttack

Team HackAttack:Our solution combines state-of-the-art technologies to...

22
Experimental
5068 alozowski/textplease

Upload an audio/video file, configure settings, and receive a text transcript

22
Experimental
5069 gouhaha/Whisper-App

Windows Whisper transcription app (PyInstaller + ffmpeg)

22
Experimental
5070 ggegoge/PyTDM

Pytońska treść do mowy – Polish Text to Speech library for Python

22
Experimental
5071 deepgram-starters/fastapi-text-to-speech

Get started using Deepgram's Text-to-Speech with this FastAPI demo app

22
Experimental
5072 talhabinjaved/voice-ai-agents-openai-telnyx

A FastAPI starter that turns a Telnyx phone number into a realtime,...

22
Experimental
5073 priya-kumari-04/-MindfulMate

Nurturing Mental Wellness Together

22
Experimental
5074 jina-ai/executor-coquiTTS

Executor that leverages CoquiTTS engine for text2speech

22
Experimental
5075 igorovh/tts

📢 !tts command for twitch.tv/kick.com

22
Experimental
5076 yxwyoyoyo/xf-tts

讯飞在线语音合成

22
Experimental
5077 Erenyegar2/modular-auto-specch-recog-toolkit

🎤 Build and deploy advanced automatic speech recognition systems with this...

22
Experimental
5078 Superx11179/DC-Speech-VAE

🎤 Compress speech to 5 Hz with DC-Speech-VAE, ensuring high perceptual...

22
Experimental
5079 soanseng/voxpen-android

AI voice keyboard for Android — speak naturally, get polished text. Whisper...

22
Experimental
5080 Orca0917/TransformerTTS

Unofficial PyTorch implementation of Transformer-TTS, a Transformer-based...

22
Experimental
5081 analyticsinmotion/micstream

Cross-platform microphone audio capture for Node.js with pre-built...

22
Experimental
5082 loglux/SpeakItAI

Convert text to speech using Microsoft Azure Neural Text-to-Speech (TTS) and...

22
Experimental
5083 Jahangirbd23/WenetSpeech-Yue

📑 Explore WenetSpeech-Yue, a comprehensive Cantonese speech corpus with rich...

22
Experimental
5084 phith0n/v2srt

v2srt 是一个基于人工智能的视频字幕生成工具,为任意视频生成高质量的字幕文件。

22
Experimental
5085 biraj21/open-voice

Open Source Voice AI Infrastructure with WebRTC backend, and web and mobile...

22
Experimental
5086 bseceenn/Fun-CosyVoice3-0.5B-2512-Deploy

🎤 Deploy a simplified voice synthesis service with Fun-CosyVoice3-0.5B-2512,...

22
Experimental
5087 BlackRoad-OS/whisper.cpp

Fork of whisper.cpp — speech-to-text inference for BlackRoad edge devices

22
Experimental
5088 bdcorps/VideoNews

An app experiment to develop a dynamic world news channel app

22
Experimental
5089 Zer0pa/ZPE-Prosody

ZPE-Prosody V0.0: DETERMINISTIC SPEECH PROSODY CODEC: Intonation | Rhythm |...

22
Experimental
5090 Salama1429/Text-to-speech_TTS_Model_Training

Training Text to speech model for German Language

22
Experimental
5091 Alex2135/ASR-proto

Implemintetion of linear attention conformer - LAC

22
Experimental
5092 hongkongkiwi/elevenlabs-cli

Community-built CLI for the ElevenLabs AI audio platform with TTS, STT,...

22
Experimental
5093 Ushaflow/merge-ssml

Combine multiple SSML documents in JS

22
Experimental
5094 lugia19/Echo-XI

Speech to text to speech using Elevenlabs

22
Experimental
5095 carmen-martin/Deep-Keyword-Spotting

A Small Footprint implementation of Keyword Spotting with different architectures.

22
Experimental
5096 ringabout/scim

[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

22
Experimental
5097 praneethpj/Unity-Android-Utilities

Open Source Unity-Android Platform Voice Text API and Text To Voice API.

22
Experimental
5098 antouanbg/Bulgarian_Linguistic

Collection and resources for Bulgarian Corpus, Datasets and Models used in...

22
Experimental
5099 katejay/Text-To-Speech

An android app for text to speech.

22
Experimental
5100 Mwamwaaaa/opentypeless

Provide seamless AI voice input for desktop to convert speech into clear,...

22
Experimental
« Prev 1 2 3 49 50 51 52 53 80 81 82 Next »