All Voice AI Tools

8,165 tools ranked by quality score · Page 58 of 82

Showing 5701–5800 of 8,165
# Tool Score Tier
5701 Vidyut/vidyut-tts

Streamlit frontend for Coqui-tts

19
Experimental
5702 Sid-V5/EchoSynth

Voice synthesis platform with TTS and STT. FastAPI backend, voice cloning,...

19
Experimental
5703 abpai/tts-gateway

A local text-to-speech gateway with a pluggable engine architecture

19
Experimental
5704 d1pankarmedhi/CascadeS2S

A low-latency (<5s) cascade-style speech-to-speech conversational system

19
Experimental
5705 Xalab/recognizer

Desktop app for recognize speech offline by using Vosk.

19
Experimental
5706 iuliiakr/TTS-Project-Framework

Architecture framework for building production-grade text-to-speech systems,...

19
Experimental
5707 jorelius/Speak

Speak is a command line utility for reading text aloud or writting the audio...

19
Experimental
5708 furkankarakuz/TranslateAI

TranslateAI is a powerful real-time speech translation desktop application...

19
Experimental
5709 sriramsme/VidCaptio

video captioning software

19
Experimental
5710 dehyabi/textor-ai

A powerful Speech-to-Text API built with Django REST Framework and...

19
Experimental
5711 graphcore/whisper-ai

Speech Recognition (ASR) on Graphcore IPUs using OpenAI's Whisper

19
Experimental
5712 vpakarinen2/omnilocal

Local voice-enabled assistant.

19
Experimental
5713 powerpig99/readaloud

Local-first text-to-speech reader powered by Qwen3-TTS. 9 voices, 10...

19
Experimental
5714 princesingh-ai-dev/JARVIS-Voice-Assistant

🤖 AI-powered voice assistant with Whisper STT, Groq LLM, real-time TTS,...

19
Experimental
5715 Thijsn04/MediClear-AI

An intelligent medical translator powered by Google Gemini 2.5. Simplifies...

19
Experimental
5716 Inc44/TheTTS

Synthesize speech using state-of-the-art open and closed-source tools

19
Experimental
5717 lostvikx/reddisyte

A program to extract content off of Reddit 🐛 The name is derived by reddit + parasite

19
Experimental
5718 saxil/mareen

Mareen - A privacy-focused voice assistant with 3D orb UI, powered by Ollama...

19
Experimental
5719 marcusau2/VOX-1-Audiobook-Maker

VOX-1 Audiobook Maker is a local, GPU-accelerated studio for creating...

19
Experimental
5720 ssharanyab/persona-tts

PersonaTTS is a personalized neural text-to-speech system that learns a...

19
Experimental
5721 egorsmkv/asr-datasets-cleaner

A pipeline to make ASR datasets better

19
Experimental
5722 shahruk10/go-sctk

Go CLI wrapper around SCTK binaries for word error rate evaluation and error...

19
Experimental
5723 weimeng23/audio-speech-datasets

:scroll: A list of various Audio/Speech datasets about Speech Recognition,...

19
Experimental
5724 brailcom/festival-czech

Czech support for Festival

19
Experimental
5725 adamelkholyy/whisper-yt

Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper...

19
Experimental
5726 arjunbazinga/speak

Select any text and have it read out loud

19
Experimental
5727 innerNULL/simpler-distil-whisper

Simpler Distil-Whisper

19
Experimental
5728 msalhab96/AraSpot

The official implementation of the AraSpot research paper

19
Experimental
5729 JoeBiellik/speechlauncher

Very simple, yet functional voice activated launcher

19
Experimental
5730 caitunai/wake_demo

An android project to show how to use snowboy to wake up app by voice

19
Experimental
5731 SprtnDio/Complete-Local-Discord-AI-Voice-Chat-Bot

AI Discord bot that acts as an insulting oracle. Ask questions by voice or...

19
Experimental
5732 Pasqual3/Stories-Teaching-Autism-Reality-storieAmiche

Piattaforma web innovativa per il supporto dell'autismo e della...

19
Experimental
5733 5j9/cliptalk

Clipboard monitor that converts copied text to speech (TTS) using...

19
Experimental
5734 nicremo/qwen3-tts-chunked-webui

Qwen3-TTS Voice Cloning WebUI with automatic text chunking - Optimized for...

19
Experimental
5735 muhammedsaban/coqui-xtts-v2-turkish-local

A locally running Turkish text-to-speech application developed with Coqui...

19
Experimental
5736 al-develop/SmartVocabulary

Dictionary, filled with your own words and phrases, for many languages. Uses...

19
Experimental
5737 jefrydco/text2speech-js

Wrapper around browser Text to Speech API

19
Experimental
5738 kuanyshbakytuly/camera-text-speech

Blind Text-Assistance

19
Experimental
5739 Voinic/microtts

Simple TTS library for MicroPython that works offline

19
Experimental
5740 robauto/bibli3.0

BiBli 3.0 for Raspberry Pi - Swarm Robotics and IoT Operating System - AI -...

19
Experimental
5741 khaykingleb/research-playground

Efficient ML/DL implementations across multiple domains with K3s multi-node...

19
Experimental
5742 Jyotibrat/Speech-To-Text

Speech to Text model

19
Experimental
5743 Adisol07/SharpSpeech

SharpSpeech is free, local and open source way to speech and wake word recognition.

19
Experimental
5744 SSusantAchary/AI_Resources

Have read and collected few Interesting Papers , Projects

19
Experimental
5745 ponchotitlan/google_text-to-speech_prompt_maker

Utility for Google Text-To-Speech batch audio files generator. Ideal for...

19
Experimental
5746 SouthernMethodistUniversity/whisper-transcription

Helm chart repo for application developed by OIT STARs students for audio...

19
Experimental
5747 tb0hdan/voiceplay

Client-side first music centered voice controlled player

19
Experimental
5748 tzneal/gopicotts

go wrapper around the pico text to speech engine

19
Experimental
5749 shun126/VoicevoxPlayer

VoicevoxのUnreal Engine 4.27.2 ~ / Unreal Engine 5 プラグイン

19
Experimental
5750 JacketsMask/Toland-Destiny-2-Bounty-Optimizer

Speech recognition to help optimize clearing bounties in Destiny 2

19
Experimental
5751 slemonide/lost

A maze exploring game with TTS messages

19
Experimental
5752 Luigi-Pizzolito/YukkuriTalk

A command-line program which uses AquesTalk10's Yukkuri TTS. Offline, single-binary.

19
Experimental
5753 jackaduma/speaker_recognition_models.pytorch

speaker recognition / speaker verification models in pytorch implementation

19
Experimental
5754 iamnortey/ninolex-gh

Open Ghanaian pronunciation dictionary for TTS and AI systems — IPA, CSV,...

19
Experimental
5755 ubisoft/ubisoft-laforge-french-homograph-dataset

Dataset for La Forge Speech Synthesis System Submission to the Blizzard...

19
Experimental
5756 tuanio/conformer-rnnt

Conformer RNN-Transducer

19
Experimental
5757 moego0/custom_KWS

End-to-end pipeline for training a custom keyword detection model with...

19
Experimental
5758 Vlad1343/Gesture-Translator

British Sign Language Translator is a real-time AI-powered system that...

19
Experimental
5759 neeraj-nagiri/Assistant-Bro-

Assistant "Bro" is a voice-controlled personal assistant that opens...

19
Experimental
5760 pl146/manga-voice-reader

AI-powered Chrome extension that reads manga speech bubbles aloud. Bubble...

19
Experimental
5761 masonintokyo/voicevox-srt-to-speak

VOICEVOX Engine APIを使ってSubRipファイルから各セリフ時間内に収まるように音声合成します。

19
Experimental
5762 UG-SEP/Text-to-speech-convertor

Blind people do not able to see so they cannot read text with their eyes so...

19
Experimental
5763 Thukyd/OpenAI-Spechify-Your-Docs

OpenAI-Spechify-Your-Docs is a Python project that converts text from...

19
Experimental
5764 Hexer10/HexTTS

Make client latedownload text to speech sounds

19
Experimental
5765 FairyDevicesRD/droid.josee.tts

軽量に動作するAndrid API対応のローカルTTSサービスアプリ

19
Experimental
5766 nmanikiran/ionic-allinone

This is to give a demo of each feature that are there in ionic and ionic-native

19
Experimental
5767 syedzubeen/podcasts

Podcasts.AI: Transcribe podcasts in a click and unlock a world of searchable...

19
Experimental
5768 myrmlbst/transcribe.AI

Webapp hosting machine learning models to generate downloadable audio...

19
Experimental
5769 ookgezellig/videotools

A collection of tools to cut, compress, extract, amplify and transcribe...

19
Experimental
5770 DuyguA/Interspeech2025-Smooth-Operating-LLMs-for-Disfluency

Innovative approach for modelling speech disfluencies with LLaMa and Conformer.

19
Experimental
5771 nick1udwig/ursr

UrSR: Urbit Speech Recognition

19
Experimental
5772 taeefnajib/Aximos

Aximos is an innovative AI-powered tool that transforms your content into...

19
Experimental
5773 isbendiyarovanezrin/SpeechDetection

Speech Detection 💬

19
Experimental
5774 parula-app/assistant

Parula - Digital assistant - Running entirely on your own device

19
Experimental
5775 passion-27/openai-whisper-api

A sample speech transcription app implementing OpenAI Text to Speech API...

19
Experimental
5776 ReadieFur/Stream-Tools

A stream chat tool that features AWS text to speech, voice commands, chat...

19
Experimental
5777 zguesmi/image2speech

Ethereum ready Dapp to speak your images.

19
Experimental
5778 LiamBrandt/tts_decode

A decoder for TTS files from 7 Days to Die

19
Experimental
5779 khaykingleb/automatic-speech-recognition

QuartzNet and DeepSpeech implementation for ASR

19
Experimental
5780 markus-m-u-e-l-l-e-r/CTC.ISL

ISL Speech Recognition Toolkit for training neural networks with the CTC...

19
Experimental
5781 Omitg24/IIS-ASR

Repositorio para Administración de Sistemas y Redes (ASR), asignatura del...

19
Experimental
5782 CSFelix/audio-to-text

🔊 Extract Text from Audios 🔊

19
Experimental
5783 Zuellni/XTTS-Server

XTTS Server for SillyTavern.

19
Experimental
5784 kiritoInd/YouTube_Audio_Transcripter

Youtube Audio transcription with WhisperAi , The script downloads audio from...

19
Experimental
5785 emirkaanozdemr/MultiLingualVoice

MultiLingualVoice is an innovative application designed to bridge language...

19
Experimental
5786 carolinezhao/speech-to-text

A google extension used for converting voice to text in real-time.

19
Experimental
5787 Ahmed5attab/Grades-Assistants-

Assistant iOS application helps the teacher review his students data and...

19
Experimental
5788 vladevelops/trainer

Your personal trainer, no yapping

19
Experimental
5789 adityajn105/google_speech_diarization_demo

A demo to show Speech Diarization (seperating audio of different speaker)...

19
Experimental
5790 attwad/cdf

Worker and elasticsearch for automated College de France audio transcripts

19
Experimental
5791 trypsynth/battery-mon

macOS application that lives in your menu bar and periodically reports your...

19
Experimental
5792 dannis999/trained_SpeechRecognition

此项目用于备份一个完整的中文语音识别环境,包括环境配置和预训练模型,以方便直接使用

19
Experimental
5793 armados/automaticschoolbell

Automatic School Bell

19
Experimental
5794 romestylez/pocketChat

Dein Stream in der Tasche — Chat lesen, schreiben und moderieren, Events von...

19
Experimental
5795 tanvi355/Video-to-PDF

⚡ Convert any video of your choice to a PDF file using this Python script.

19
Experimental
5796 purarue/tts

CLI tool to convert text to speech using the StreamLabs API

19
Experimental
5797 agungmahardikka/ConnectWave

🌐 Enable seamless communication for deaf and mute individuals with...

19
Experimental
5798 daftmaple/soundboard-channel-points-v2

Second version of Twitch soundboard/TTS application, with slightly improved...

19
Experimental
5799 CingZeoi/OneCore-SAPI5

Allow calling OneCore voice engine with SAPI5

19
Experimental
5800 alextsao1999/assistant

hypermind assistant 语音识别助手

19
Experimental
« Prev 1 2 3 56 57 58 59 60 80 81 82 Next »