All Voice AI Tools

8,165 tools ranked by quality score · Page 19 of 82

Showing 1801–1900 of 8,165
# Tool Score Tier
1801 evilC/HotVoice

Adds Speech Recognition support to AutoHotkey, via a C# DLL

38
Emerging
1802 ElmTran/praises

Praises is a text-to-speech tool that can help you read text easily.

38
Emerging
1803 falabrasil/kaldi-br

☕🇧🇷 Scripts para o Kaldi em Português Brasileiro

38
Emerging
1804 Proteusiq/saa

Making Time Speak! 🎙️

38
Emerging
1805 mxvsh/wave

Native macOS dictation app focused on fast voice-to-text workflows.

38
Emerging
1806 eminemahjoub/pdf-voice-reader

"PDF Reader: A Python application for seamless PDF viewing with enhanced...

38
Emerging
1807 noco-ai/spellbook-docker

AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many...

38
Emerging
1808 lars76/fastspeech2-clean

Clean and modernized implementation of FastSpeech2/LightSpeech using IPA

38
Emerging
1809 CMsmartvoice/One-Shot-Voice-Cloning

:relaxed: One Shot Voice Cloning base on Unet-TTS

38
Emerging
1810 ckaytev/tgisper

Telegram bot with ASR

38
Emerging
1811 1038lab/ComfyUI-MegaTTS

A ComfyUI custom node based on ByteDance MegaTTS3, enabling high-quality...

38
Emerging
1812 soldier444xd/KittenTTS

KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with...

38
Emerging
1813 mdingena/att-voodoo

A community-made magic mod for A Township Tale, a VR MMORPG game.

38
Emerging
1814 Citadawn/VoiceDAO

语道 (VoiceDAO) - 专注于文本转语音功能的 Android 应用

38
Emerging
1815 telecombcn-dl/2018-dlsl

UPC Deep Learning for Speech and Language 2018

38
Emerging
1816 CarrotYuan/openclaw-voice-control

A macOS local voice-control companion for OpenClaw with Siri-like wakeword...

38
Emerging
1817 paladini/voice-separator-demucs

A simple and efficient self-hosted application to separate vocals from music...

38
Emerging
1818 deepgram-devs/dg-translation-chrome-ext

A TypeScript chrome extension that uses Deepgram to provide live...

38
Emerging
1819 andi611/CS-Tacotron-Pytorch

Pytorch implementation of CS-Tacotron, a code-switching speech synthesis...

38
Emerging
1820 AndroidCodility/SpeechToText

Android application to text through which you can provide speech input to...

38
Emerging
1821 HelloChatterbox/py_responsivevoice

unoficial python api for responsive voice

38
Emerging
1822 GloomyGrave/Sinsy-NG

(discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis...

38
Emerging
1823 OpenVoiceOS/ovos-tts-plugin-beepspeak

experiment adding new r2d2 tts engine for mycroft

38
Emerging
1824 leduckhai/wav2graph

wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech

38
Emerging
1825 QuantiusBenignus/BlahST

Input text from speech in any Linux window, the lean, fast and accurate way,...

38
Emerging
1826 SpeechColab/Leaderboard

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform...

38
Emerging
1827 alam025/ai-voice-assistant-appointment-booking

Enterprise-grade AI voice assistant for automated appointment scheduling...

38
Emerging
1828 Kyubyong/specAugment

Tensor2tensor experiment with SpecAugment

38
Emerging
1829 AA-Factory/aafactory-prototype

⚡ AI Avatar Factory is an interface for creating and managing AI avatars. ⚡

38
Emerging
1830 xingchensong/Speech-Transformer-tf2.0

transformer for ASR-systerm (via tensorflow2.0)

38
Emerging
1831 asiff00/Training-TTS

Train and finutune text-to-speech models for Bengali and many other languages!

38
Emerging
1832 AI-TOOLKIT/VoiceBridge

VoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit

38
Emerging
1833 funway/audible-epub3-maker

Generate audiobooks from plain EPUB files in EPUB 3 Media Overlays format...

38
Emerging
1834 iceychris/LibreASR

:speech_balloon: An On-Premises, Streaming Speech Recognition System

38
Emerging
1835 instavar/qwen3-tts-lora-finetuning

Qwen3‑TTS LoRA fine‑tuning tools (companion repo) for custom voice adaptation

38
Emerging
1836 ondrejklejch/learning_to_adapt

Coordinate-wise meta-learner for speaker adaptation of ASR models.

38
Emerging
1837 fcjr/ltts

Quick CLI for local text-to-speech using Qwen3-TTS or Kokoro TTS.

38
Emerging
1838 Harsh-0-7/PDF-Reader

PDF reader with read aloud feature

38
Emerging
1839 siddhant-vij/Health-Fitness-Tracker

Health & fitness app with natural language processing, custom...

38
Emerging
1840 gkrsv/split_audio

A rough and ready Python utility which splits audio files based on silence...

38
Emerging
1841 scarletcho/prep4kaldi

Data preparation code for building Kaldi ASR system

38
Emerging
1842 ayshrv/memento-app

Android App which serves as an AI assistant for human memory

38
Emerging
1843 krestaino/prankstr

📞 Prank your friends with text-to-speech phone calls powered by Twilio and...

38
Emerging
1844 sskorol/vosk-api-gpu

Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC

38
Emerging
1845 bedriyan/speaky

Voice-to-text for macOS, powered by on-device AI. Press a hotkey, speak, and...

38
Emerging
1846 jbmiller10/transcribrr

Transcribrr is a python desktop gui application that uses transcribes ...

38
Emerging
1847 tochilkinva/tg_bot_stt_tts

Telegram bot with voice message recognition and generation. Speech to Text...

38
Emerging
1848 naeruru/mimiuchi

a free, customizable, osc capable speech-to-text interface for relaying text...

38
Emerging
1849 JSON2Video/json2video-php-sdk

Video automation with PHP: add watermarks, resize videos, create slideshows,...

38
Emerging
1850 kroko-ai/kroko-onnx

Kroko ASR - Speech-to-text

38
Emerging
1851 aiola-lab/drax

Drax: Speech Recognition with Discrete Flow Matching

38
Emerging
1852 taresh18/orpheus-streaming

Orpheus TTS Server with streaming support (TTFB ~160ms)

38
Emerging
1853 HawkAaron/RNN-Transducer

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction...

38
Emerging
1854 amadeomano/persian-tts

🔊 A simple human-based text-to-speach synthesiser and ReactNative app for...

38
Emerging
1855 kaiaai/kaia.js

Kaia.ai platform's JS client library

38
Emerging
1856 rxlabz/sytody

a Flutter "speech to todo" app example

38
Emerging
1857 ericc-ch/edge-tts

Use Microsoft Edge's online text-to-speech service from JS code directly!

38
Emerging
1858 hutchresearch/latex2speech

TeX2Speech is an application that turns LaTeX documents into spoken audio.

38
Emerging
1859 BraceYourselfGames/UE-BYGTextToSpeech

A plugin that uses the Windows Speech API to speak text in Unreal Engine 4.

38
Emerging
1860 sexfrance/RecaptchaV2-Solver

A Python-based solution for solving Google's reCAPTCHA v2 challenges...

38
Emerging
1861 UFOAlastor/AI-Waifu-Project-LaIN

一个拥有长期记忆, 表情动作, 语音对话/打断/声纹识别, FunctionCall, 多模型支持的AI Waifu客户端.

38
Emerging
1862 AsaoluElijah/say-it

A mobile web application that helps you convert spoken words to...

38
Emerging
1863 Ronik22/Voice-Controlled-Email

A python-based voice-controlled email application for visually impaired persons.

38
Emerging
1864 ng-web-apis/speech

A library for using Web Speech API with Angular

38
Emerging
1865 zalo/OpenAI-Voice

A simple proof of concept for voice-to-voice interaction.

38
Emerging
1866 dokterbob/macos-speech-server

Local, fast and efficient Speech to Text (STT) and Text to Speech (TTS) on...

38
Emerging
1867 lcraver/ProxiTalk

This is the repo for ProxiTalk OS. ProxiTalk is a custom operating system...

38
Emerging
1868 aidayang/LatentSync-OneClick

免费视频对口型软件LatentSync一键启动整合包

38
Emerging
1869 bhashini-ai/bhashini-api-examples

Sample programs for calling Bhashini.ai REST/WebSocket APIs - TTS, STT/ASR,...

38
Emerging
1870 mozilla/deepspeech-playbook

A crash course for training speech recognition models using DeepSpeech.

38
Emerging
1871 Fooftilly/kokoro-extension

Send text from browser to Kokoro-FastAPI for TTS generation

38
Emerging
1872 Better-Player/espeakng-sys

Rust bindings to eSpeak NG

38
Emerging
1873 cristofima/AI-Tech-Interview-Preparation

An AI-powered technical interview preparation platform that generates...

38
Emerging
1874 karrarkazuya/ArabicTTS

ArabicTTS (TextToSpeech) Android library with a sample

38
Emerging
1875 HCI-LAB-UGSPEECHDATA/speech_data_ghana_ug

The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani,...

38
Emerging
1876 hcy71o/SC-CNN

SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker...

38
Emerging
1877 Frida7771/PyVoice

A Python-based speech processing tool that supports both speech-to-text...

38
Emerging
1878 speechsuper/SpeechSuper-API-Samples

Deep learning based speech and pronunciation assessment API for 8 languages.

38
Emerging
1879 botbahlul/whisper_autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using...

38
Emerging
1880 IBM/text-to-speech-code-pattern

WARNING: This repository is no longer maintained

38
Emerging
1881 wannaphong/KhanomTan-TTS-v1.0

KhanomTan TTS (ขนมตาล) is an open-source Thai text-to-speech model that...

38
Emerging
1882 sciforce/phones-las

Articulatory features estimation using Listen Attend and Spell architecture.

38
Emerging
1883 sayak-brm/espeakng-python

An eSpeak NG TTS binding for Python3.

38
Emerging
1884 henry-richard7/Natural-Text-to-Speech

This python program uses https://naturaltts.com API to convert given text to...

38
Emerging
1885 manhph2211/ViSR

This repo builds an end-to-end deep learning application that supports...

38
Emerging
1886 AkishinoShiame/Chinese-Speech-Emotion-Datasets

Datasets of A Deep Convolutional Neural Network Based Virtual Elderly...

38
Emerging
1887 jenswittmann/CurlyFramework

Tiny Framework for accessibility and sustainability, not only for MODX or Kirby CMS.

38
Emerging
1888 tmanderson/ivona-node

Ivona Cloud (via Amazon services) client library for Node

38
Emerging
1889 HnDK0/NoveLA

Free Android reader for web novels, light novels, ranobe & EPUB. 25+...

38
Emerging
1890 npuichigo/ttsflow

tensorflow speech synthesis c++ inference for voicenet

38
Emerging
1891 andi611/ZeroSpeech-TTS-without-T

A Pytorch implementation for the ZeroSpeech 2019 challenge.

38
Emerging
1892 askrella/speech-rest-api

Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)

38
Emerging
1893 alan-ai/alan-sdk-reactnative

The Self-Coding System for Your App — Alan AI SDK for React Native

38
Emerging
1894 nexmo-community/voice-azure-speechtotext-py

Sample Code for Realtime Transcription using Nexmo, Microsoft Azure Speech...

38
Emerging
1895 i4Ds/whisper-prep

Data preparation utility for the finetuning of OpenAI's Whisper model.

38
Emerging
1896 Deepak5j/PyTranscriber

Speech to Text

38
Emerging
1897 persiandataset/PersianSpeech

Persian ASR dataset

37
Emerging
1898 asmith26/speech2caret

Use your speech to write to the current caret position!

37
Emerging
1899 masonthemaker/saidwell

Open Source Voice AI Dashboard

37
Emerging
1900 Kalebu/image-to-sound-python-

A python project for converting an Image into audible sound using OCR and...

37
Emerging
« Prev 1 2 3 17 18 19 20 21 80 81 82 Next »