All Voice AI Tools

8,165 tools ranked by quality score · Page 49 of 82

Showing 4801–4900 of 8,165
# Tool Score Tier
4801 yanorei32/libktts-server

A modern HTTP wrapper for the legacy KTTS Project Korean text-to-speech...

22
Experimental
4802 seanghay/khmertagger

KhmerTagger: Inverse Text Normalization for Khmer Automatic Speech Recognition

22
Experimental
4803 cr2007/cambai-python

Python SDK for the CambAI API

22
Experimental
4804 michsethowusu/kasanoma

Offline-first TTS models for African languages

22
Experimental
4805 mrkhachaturov/OpenAITTSKit

Minimal streaming TTS client for OpenAI audio/speech API. Swift 6.2, zero...

22
Experimental
4806 aminshamim/textream-ios

Textream iOS — Teleprompter app for iPhone/iPad with Director Mode, smooth...

22
Experimental
4807 hakancangunerli/hanyupinyin-helper

pinyin-helper for simple translation

22
Experimental
4808 kevin0818-lxd/ielts-speaking-coach

AI-powered IELTS Speaking practice app with Whisper ASR, IELTS band scoring,...

22
Experimental
4809 DanielPartida610/Realtime-Voice-Chat-System

🎤 Build a real-time voice chat system with text messaging, voice messages,...

22
Experimental
4810 rashedyasen/voice-assistant-v2

🗣️ Enhance your productivity with a local, event-driven voice assistant that...

22
Experimental
4811 vrajpatel30/homeassistant-voice-recipes

Enable local, GPU-accelerated voice control for Home Assistant with no cloud...

22
Experimental
4812 JhoneCasali/llm-batch

Process batches of large language model tasks efficiently using...

22
Experimental
4813 Nik-Kras/Live_ASR_Whisper_Gradio

Real Time Speech To Text with corrections powered by Gradio

22
Experimental
4814 AlbertSebastain/RobustConformer

Robust speech recognition using teacher-student learning

22
Experimental
4815 ahamfrank835-coder/vocabulary

📚 Discover essential tools and resources to enhance your German vocabulary...

22
Experimental
4816 hebbihebb/Audiopub

Audiopub transforms any EPUB into a clean, high-quality audiobook

22
Experimental
4817 Itachikoko/voice-task-manager

🗣️ Manage tasks effortlessly using voice commands with our AI-driven app,...

22
Experimental
4818 Nur-syafira/ai-agent-tts

🔊 Build a low-latency voice AI agent with streaming ASR and TTS using...

22
Experimental
4819 liu-dongfang/clinical-interview-voice-agent

Voice agent prototype for structured clinical interviewing, with VAD-based...

22
Experimental
4820 ruslantau/media-annotator

Web-based annotation tool for media data. The easiest way to create you own...

22
Experimental
4821 RhinoDevel/mt_tts

Pure C wrapper library to use Piper TTS with Linux and Windows as simple as possible.

22
Experimental
4822 WelkinYang/Tacotron2-pytorch

Tacotron2 implemented by pytorch

22
Experimental
4823 whitehatboy005/Virtual-Voice-Assistant

It's Python-based virtual assistant capable of performing various tasks...

22
Experimental
4824 Prashant-Surya/quintal

A quiz generator application that currently uses Wikipedia content to...

22
Experimental
4825 isothermal-capitalgainstax520/Whisper-Transcriber

🎤 Transcribe audio and video files into text or subtitles effortlessly on...

22
Experimental
4826 GodzCursed/whisper-vtt2srt

🎥 Convert WebVTT to SRT easily, refining messy AI transcripts into clear...

22
Experimental
4827 DmitryCherneckiy/text-to-speech

Telegram bot. Turns yours text message into a voice message.

22
Experimental
4828 blamarche/VoiceGoban

A voice command tool to play go (board game) on any windows go server.

22
Experimental
4829 marvitek0/Talk-to-Typer

Experience voice typing with Talk-to-Typer, a kid-friendly app that helps...

22
Experimental
4830 jasonwhwang/tensorflow_micro_speech_mbed

Tensorflow Micro Speech Example using Mbed (STM32F49ZI, NUCLEO-F429ZI)

22
Experimental
4831 fizoxt/openwhisper-app

Transcribe speech to text on macOS locally and offline with OpenWhisper, a...

22
Experimental
4832 stanlsv/sayboard

Privacy-first AI voice keyboard for iOS that turns speech into ready-to-send...

22
Experimental
4833 facejungle/fj_chat_to_speech

FJ Chat to Speech. Text To Speech: YouTube, Twitch

22
Experimental
4834 JJsilvera1/STT-Windows

An easy STT option to dictate text with your voice to your cursor using...

22
Experimental
4835 XiaoYi2018/OfflineRealtimeTranslator

Fully offline Android real-time Russian-to-Chinese simultaneous interpreter...

22
Experimental
4836 Hayder-IRAQ/srt-to-podcast

🎙️ Convert multilingual SRT subtitles (Arabic/Russian/English) into podcast...

22
Experimental
4837 Shuichi346/ja-dubbing

英語動画を話者の声質を保ったまま日本語吹替動画に変換。MioTTSによるボイスクローニング、PLaMo-2翻訳、2種のASRエンジン(Whisper /...

22
Experimental
4838 alexbhas/accessible-tts

Allows the user to input text or pdf/docx files and get a text to speech...

22
Experimental
4839 ywatanabe1989/scitex-audio

Text-to-Speech with Multiple Backend Fallback (elevenlabs → luxtts → gtts → pyttsx3)

22
Experimental
4840 xuan139/ai-publisher-local-studio

Local audiobook production studio MVP with FastAPI, SQLite, review workflow,...

22
Experimental
4841 mahathir444/yc-ui

Modern UI component library for Vue.js. `yc-ui` offers reusable,...

22
Experimental
4842 wkdrns202/TTSDataSetCleanser

TTSDataSetCleanser. This program can do the labeling work for the Raw Speech...

22
Experimental
4843 jgawrylkowicz/heyPi

A CLI voice assistent written in Python based on the SpeechRecognition package

22
Experimental
4844 Amir79Naziri/TextNormalization_Project

Implementing text normalization for Farsi(Persian) language.

22
Experimental
4845 glloydie/flowtts-byok

🔊 Streamline voice synthesis with FlowTTS BYOK, leveraging Tencent's FlowTTS...

22
Experimental
4846 ShunsukeHayashi/voicebox-tts

VOICEVOX音声生成キューイングシステム (Celery + Redis)

22
Experimental
4847 thibault-roux/metric-evaluator

Metric evaluator for Automatic Speech Recognition using the HATS dataset

22
Experimental
4848 vitomarcorubino/Parkinsons-detection

CNN and Attention Mechanisms for Parkinson's Diagnosis and Speech Deficit Detection

22
Experimental
4849 Cyrostar/ITTS-TR

An end-to-end, highly optimized Text-to-Speech (TTS) framework based on...

22
Experimental
4850 Zaid440/cosyvoice-docker

🎙️ Deploy a production-ready Text-to-Speech service with voice cloning and a...

22
Experimental
4851 gistrec/ClearTranscriptBot

Get the text from your video/audio with a simple Telegram bot — fast and easy

22
Experimental
4852 Ghalwash123/MiMo-Audio-Training

🔊 Train audio models efficiently with MiMo-Audio-Training, a toolkit...

22
Experimental
4853 jaketae/conformer

PyTorch implementation of Conformer: Convolution-augmented Transformer for...

22
Experimental
4854 winccoa/winccoa-ae-ts-text2speech

WinCC OA Text-To-Speech Library

22
Experimental
4855 navalnica/wav2vec2-belarusian

Speech to Text model for Belarusian language

22
Experimental
4856 Konstantinos123456789/JARVIS_AI

A modular Python AI Assistant (Jarvis) featuring Knowledge Graphs...

22
Experimental
4857 ankurs18/vspeak

A VS Code extension that offers voice based commands for a mouse-free coding...

22
Experimental
4858 AssemblyAI-Community/intro-to-espnet

Getting Started with ESPnet | AssemblyAI

22
Experimental
4859 frankcholula/sapr

Speech & Audio Processing & Recognition 🗣️

22
Experimental
4860 berangerthomas/ASR.lab

Benchmarking platform for automatic speech recognition models

22
Experimental
4861 sidgupta234/Indian_English_ASR

An Indian English ASR system based on Hidden Markov Models (HMM) has been...

22
Experimental
4862 masantoro/NETCore-Telemetry-LUIS-Speech-Recognition

Telemetry, IA and Speech Recognition | .NET Core | LUIS Microsoft |...

22
Experimental
4863 fr0stb1rd/Edge-TTS-Subtitle-Dubbing

High-performance SRT to Audio Dubbing tool using Microsoft Edge TTS with...

22
Experimental
4864 Anwarvic/mTEDx_auxiliary

These are different files I created to do different tasks when I was working...

22
Experimental
4865 AliceAuto/obsidian-auto-word-audio

一个为 Obsidian 单词笔记自动添加音频发音的插件

22
Experimental
4866 TheoTech/spf.io

spf.io is a platform providing real time captions and translations of live events

22
Experimental
4867 yashasviyadav30/Omnibox

📦 AI-powered CLI utility with voice support - One Tool, Infinite Possibilities

22
Experimental
4868 DarkOracle10/Video-to-Persian-Translator---Professional-AI-Translation-Pipeline

Professional-AI-Translation-Pipeline

22
Experimental
4869 chirag127/SpeechFlow-AI-Powered-Text-to-Speech-Browser-Extension

AI-powered text-to-speech browser extension. Transforms web content into...

22
Experimental
4870 TheMadMartina/Nexa

Nexa is a Python AI voice assistant leveraging speech recognition and...

22
Experimental
4871 neosapience/typecast-python

The official Python SDK for the Typecast API.

22
Experimental
4872 raghavkumar06/jarvis-ai-assistant

Python-based voice assistant that performs tasks using speech recognition...

22
Experimental
4873 Kit4Some/Voice_opencode

The open source vibe_voice coding agent.

22
Experimental
4874 eGroupAI/speech-integration-starter

Public-safe starter kit for Whisper integration

22
Experimental
4875 RykerWilder/jarvis

Just A Rather Very Intelligent System

22
Experimental
4876 wanghao15536870732/ChatWithEveryone

🚧The Internet + project YiLuYuBan.The project is too messy, has moved to...

22
Experimental
4877 hecx333/edge-tts-go

一个用于 Microsoft Edge 在线文本转语音服务的 Go 语言库。 本项目允许您免费使用 Microsoft Edge 的高质量神经 TTS 语音。

22
Experimental
4878 oueslati1990/Audiobook-Generator

AI-powered PDF to audiobook converter with LangGraph workflow orchestration....

22
Experimental
4879 DasariJayanth/Sign-to-Speech

sign-to-speech

22
Experimental
4880 dnyanshwalwadkar/SIMHA-Personal-Assistant-using-Artificial-intelligence

The rise of automation, along with increased computational power, novel...

22
Experimental
4881 tanzita/tf_asr

Improving Deep Neural Networks Based Speech Recognition System For Far-field Speech

22
Experimental
4882 uqqu/sync_book

audiobook generator with smart personalized translation

22
Experimental
4883 juanjosehr14/YingMusic-SVC

🎤 Transform singing voices effortlessly with YingMusic-SVC, a robust...

22
Experimental
4884 vinaymhubli/Flexa

Flutter-Alexa Application

22
Experimental
4885 JuliusFx131/Mozilla-Common-Voice-STT-Challenge

This is a web service that allows people with medical Issues describe them...

22
Experimental
4886 alexniemiz1/listnr

🎵 Enjoy a modern terminal-based music player that supports multiple audio...

22
Experimental
4887 motazsaad/jsc-news-broadcast

JSC news broadcast (speech corpus)

22
Experimental
4888 deepgram-starters/deno-text-to-speech

Get started using Deepgram's Text-to-Speech with this Deno demo app

22
Experimental
4889 kanugoyal/Virtual-Assistant

Virtual Assistant built using python libraries. It does almost anything...

22
Experimental
4890 iamabeljoshua/Cali

Cali: A simple virtual assistant that demonstrates how to use Google Speech...

22
Experimental
4891 techiaith/docker-coqui-tts-cy

Lleisiau synthetig testun i leferydd dwyieithog Cymraeg a Saesneg // //...

22
Experimental
4892 mariomastrandrea-poli/payments-vocal-assistant

Official repository of my Master's Thesis project: "Developing an AI-Powered...

22
Experimental
4893 NatGr/dc-tts-pytorch

pytorch implementation of dc-tts enabling mixed precision training and...

22
Experimental
4894 WildCraftsmanFilter/AI-Voice-Changer-Real-Time-Desktop

⭐️ AI Voice-Changer Real-Time 2026 is advanced AI voice changer software...

22
Experimental
4895 lif3time-secr3t-c0de/Meeting-Memory

Record meetings, transcribe with Whisper, extract action items, and send...

22
Experimental
4896 PMS61/Eloquence

An AI driven public speaking tutor guiding users towards improving their...

22
Experimental
4897 iam-smjamilsagar/Speech-Assistant

Today we will learn how to make speech assistant in Python.

22
Experimental
4898 lucianosimoni/ai-interviewer-client

React.js ▪️ Vite ▪️ TailwindCSS ▪️ Speech Recognition 🎙️

22
Experimental
4899 mayurkadampro/ChatterBoy-Chatbot

ChatterBoy - Basic Conversation Chatbot

22
Experimental
4900 sasharun/awesome-faceless

A curated list of 50+ AI tools for faceless YouTube content creators. Voice,...

22
Experimental
« Prev 1 2 3 47 48 49 50 51 80 81 82 Next »