All Voice AI Tools

8,165 tools ranked by quality score · Page 4 of 82

Showing 301–400 of 8,165
# Tool Score Tier
301 taigrr/elevenlabs

ElevenLabs Artificial Voice Synthesis Client

53
Established
302 kaldi-asr/kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

53
Established
303 deepgram-starters/node-transcription

Get started using Deepgram's Transcription with this Node demo app

53
Established
304 Agents365-ai/video-podcast-maker

AI-powered video podcast creation skill for coding agents. Supports Bilibili...

53
Established
305 EveryVoiceTTS/EveryVoice

The EveryVoice TTS Toolkit - Text To Speech for your language

53
Established
306 aedocw/epub2tts

Turn an epub or text file into an audiobook

53
Established
307 BolajiAyodeji/chat-with-siri

🤖 A text-to-speech chatbot built using Nextjs, OpenAI, and ElevenLabs.

53
Established
308 BoltzmannEntropy/MimikaStudio

MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic...

53
Established
309 deepgram-starters/node-voice-agent

Get started using Deepgram's Voice Agent with this Node demo app

53
Established
310 yanorei32/discord-tts

TTS Discord Bot [VOICEROID, VOICEVOX, AivisSpeech, kttsproject, WinRT, and...

53
Established
311 nl8590687/ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

53
Established
312 PaciStardust/HOSCY

Companion for OSC and Communication

53
Established
313 unilight/seq2seq-vc

A sequence-to-sequence voice conversion toolkit.

53
Established
314 Macoron/whisper.unity

Running speech to text model (whisper.cpp) in Unity3d on your local machine.

53
Established
315 echogarden-project/echogarden

Cross-platform speech toolset, used from the command-line or as a Node.js...

53
Established
316 ciffelia/koe

Discord 読み上げ Bot

53
Established
317 primepake/wav2lip_288x288

Wav2Lip version 288 and pipeline to train

53
Established
318 Weilbyte/tiktok-tts

Generate TikTok Text-to-Speech voices in your browser

52
Established
319 abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS,...

52
Established
320 TuananhCR/Dia-Finetuning-Vietnamese

TTS Dia finetuning for Vietnamese

52
Established
321 adrianlyjak/obsidian-aloud-tts

Obsidian TTS Plugin

52
Established
322 deepgram-devs/nextjs-text-to-speech

Get started using Deepgram's Text-to-Speech with this Next.js demo app

52
Established
323 PrzemyslawSwiderski/python-gradle-plugin

Gradle plugin to run Python projects.

52
Established
324 jonatasgrosman/huggingsound

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

52
Established
325 FENRlR/MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

52
Established
326 mathigatti/midi2voice

Singing synthesis from MIDI file

52
Established
327 HeyWillow/willow

Open source, local, and self-hosted Amazon Echo/Google Home competitive...

52
Established
328 robdmac/talkito

TalkiTo lets developers interact with AI systems through speech across...

52
Established
329 scarletcho/KoLM

Korean text normalization and language preparation package for LM in...

52
Established
330 misyaguziya/VRCT

VRCT(VRChat Chatbox Translator & Transcription)

52
Established
331 reazon-research/ReazonSpeech

Massive open Japanese speech corpus

52
Established
332 yeyupiaoling/YeAudio

Python的音频工具

52
Established
333 mlalma/KokoroTestApp

Test application for Kokoro TTS model

52
Established
334 OpenVoiceOS/ovos-tts-plugin-cotovia

galician tts plugin for OVOS

52
Established
335 soniqo/speech-swift

AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and...

52
Established
336 Thiagohgl/ai-pronunciation-trainer

This tool uses AI to evaluate your pronunciation.

52
Established
337 zaigie/FunSpeech

开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端

52
Established
338 saharmor/whisper-playground

Build real time speech2text web apps using OpenAI's Whisper...

52
Established
339 ArdaGnsrn/elevenlabs-laravel

This is an Open Source PHP Laravel package for ElevenLabs Text to Speech API.

52
Established
340 asiff00/On-Device-Speech-to-Speech-Conversational-AI

This is an on-CPU real-time conversational system for two-way speech...

52
Established
341 alphacep/awesome-russian-speech

Russian speech technology links

52
Established
342 h5p/h5p-speak-the-words

Create questions answered through speech

52
Established
343 lucasnewman/nanospeech

A simple, hackable text-to-speech system in PyTorch and MLX

52
Established
344 thorstenMueller/Thorsten-Voice

Thorsten-Voice: A free to use, offline working, high quality german TTS...

52
Established
345 pszemraj/vid2cleantxt

Python API & command-line tool to easily transcribe speech-based video files...

52
Established
346 stefantaubert/pinyin-to-ipa

Command-line interface and Python library to transcribe pinyin to IPA. The...

52
Established
347 JSchmie/ScrAIbe-WebUI

WebUI for ScAIbe

52
Established
348 manyeyes/ManySpeech

AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment...

52
Established
349 voicegain/platform

Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)

52
Established
350 mgonzs13/audio_common

A PortAudio based audio_common with text to speech for ROS 2

52
Established
351 FunAudioLLM/SenseVoice

Multilingual Voice Understanding Model

52
Established
352 react-native-voice/voice

:microphone: React Native Voice Recognition library for iOS and Android...

51
Established
353 shhossain/BanglaSpeech2Text

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla...

51
Established
354 readium/speech

💬 A TypeScript library for implementing read aloud on the Web

51
Established
355 Sharrnah/whispering-ui

Native UI for the Whispering Tiger project -...

51
Established
356 canopyai/Orpheus-TTS

Towards Human-Sounding Speech

51
Established
357 pannous/tensorflow-speech-recognition

🎙Speech recognition using the tensorflow deep learning framework,...

51
Established
358 dangvansam/viet-tts

VietTTS: An Open-Source Vietnamese Text to Speech

51
Established
359 RageAgainstThePixel/com.rest.elevenlabs

A non-official Eleven Labs voice synthesis client for Unity (UPM)

51
Established
360 MasuRii/opencode-smart-voice-notify

🔊 Smart voice notification plugin for OpenCode with multiple TTS engines...

51
Established
361 athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

51
Established
362 Kyubyong/dc_tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

51
Established
363 pnnbao97/Kani-TTS-Vie

Fast Vietnamese TTS. 370M params, 3-second inference.

51
Established
364 bambocher/pocketsphinx-python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

51
Established
365 HA6Bots/Automatic-Youtube-Reddit-Text-To-Speech-Video-Generator-and-Uploader

A series of 3 programs that will automatically receive scripts from Reddit,...

51
Established
366 google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural...

51
Established
367 alexa-pi/AlexaPi

Alexa client for all your devices! # No active development. PRs welcome #...

51
Established
368 vannu07/jarvis

🤖 Jarvis - AI Voice Assistant with Face Recognition | Hacktoberfest 2025...

51
Established
369 spring-media/TransformerTTS

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based...

51
Established
370 TheStageAI/TheWhisper

Optimized Whisper models for streaming and on-device use

51
Established
371 WhiteMagic2014/tts-edge-java

java sdk for Edge Read Aloud

51
Established
372 whitphx/streamlit-stt-app

Real time web based Speech-to-Text app with Streamlit

51
Established
373 transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

51
Established
374 yuvraj108c/ComfyUI-Whisper

Transcribe audio and add subtitles to videos using Whisper in ComfyUI

51
Established
375 mallorbc/whisper_mic

Project that allows one to use a microphone with OpenAI whisper.

51
Established
376 codeforequity-at/botium-speech-processing

Botium Speech Processing

51
Established
377 keithito/tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with...

51
Established
378 zai-org/GLM-ASR

GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters

51
Established
379 xiangyuecn/Recorder

html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid...

51
Established
380 ekwek1/soprano

Soprano: Instant, Ultra-Realistic Text-to-Speech

51
Established
381 BolisettySujith/J.A.R.V.I.S

A voice assistant 🗣️ which can be used to interact with your computer 💻 and...

51
Established
382 ArkanDash/Multi-Model-RVC-Inference

RVC Inference with multiple model and huggingface support

51
Established
383 XDcobra/react-native-sherpa-onnx

React Native TurboModule for Sherpa-ONNX offline on-device Speech Processing...

51
Established
384 MycroftAI/adapt

Adapt Intent Parser

51
Established
385 at16k/at16k

Trained models for automatic speech recognition (ASR). A library to quickly...

51
Established
386 kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN &...

51
Established
387 ftyers/commonvoice-utils

Linguistic processing for Common Voice

51
Established
388 soobinseo/Transformer-TTS

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

51
Established
389 drethage/speech-denoising-wavenet

A neural network for end-to-end speech denoising

51
Established
390 gooofy/py-nltools

A collection of basic python modules for spoken natural language processing

51
Established
391 marytts/marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system...

51
Established
392 NVIDIA/OpenSeq2Seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

51
Established
393 srvk/eesen

The official repository of the Eesen project

51
Established
394 doctoroyy/edge-tts-as-a-service

This is a simple HTTP service that uses the Edge-TTS library to generate...

51
Established
395 pierreaubert/spinorama

A library to display and compare spinorama (speakers measurements) graphs.

51
Established
396 jaywalnut310/glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

51
Established
397 totalvoice/totalvoice-node

Client em NodeJS para API da Totalvoice

51
Established
398 AdolfVonKleist/Phonetisaurus

Phonetisaurus G2P

51
Established
399 AI4Bharat/Chitralekha

Chitralekha - A video transcreation platform for Indic languages, supporting...

51
Established
400 julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

51
Established
« Prev 1 2 3 4 5 6 80 81 82 Next »