All Voice AI Tools
8,165 tools ranked by quality score · Page 17 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 1601 |
adelacvg/ttts
Train the next generation of TTS systems. |
|
Emerging |
| 1602 |
rryam/SakuraKit
Swift SDK for Prototyping AI Speech Generation |
|
Emerging |
| 1603 |
Ijwi-ry-Ikirundi-AI/Kirundi_Dataset
🇧🇮 The first large-scale, open-source speech and text dataset for Kirundi... |
|
Emerging |
| 1604 |
DrewThomasson/ebook2audiobookpiper-tts
Converts ebooks into audiobooks with piper-tts |
|
Emerging |
| 1605 |
ninjahuttjr/hal-answering-service
I'm sorry, Dave. I'm afraid I can't let that spam call through. — Local AI... |
|
Emerging |
| 1606 |
1ytic/open_stt_e2e
PyTorch end-to-end speech recognition |
|
Emerging |
| 1607 |
MuGuiLin/VoiceDictation
迅飞 语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息,让机器能够“听懂”人类语言,相当于给机器安装上“耳朵”,使其具备“能听”的功能。 |
|
Emerging |
| 1608 |
taikun114/VOICEVOX-TTS-for-Home-Assistant
Custom integration for Japanese TTS using VOICEVOX in Home Assistant. |
|
Emerging |
| 1609 |
collectivat/cmusphinx-models
Acoustic and language models for minorised languages. |
|
Emerging |
| 1610 |
rhasspy/piper-samples
Samples for Piper text to speech system |
|
Emerging |
| 1611 |
M0Rf30/shisper
A quick & dirty script to generate and view subtitles and transcriptions for... |
|
Emerging |
| 1612 |
Anwarvic/RasaChatbot-with-ASR-and-TTS
This repository contains an attempt to incorporate Rasa Chatbot with... |
|
Emerging |
| 1613 |
pkozul/ha-tts-bluetooth-speaker
TTS Bluetooth Speaker for Home Assistant |
|
Emerging |
| 1614 |
rcspam/dictee
Push-to-talk voice dictation for Linux — 100% local, multilingual (25+... |
|
Emerging |
| 1615 |
spokestack/spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS.... |
|
Emerging |
| 1616 |
JusperLee/Conv-TasNet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech... |
|
Emerging |
| 1617 |
oleges1/quartznet-pytorch
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261] |
|
Emerging |
| 1618 |
Supremesujay/murf-voice-agent-starter
🎤 Build a low-latency voice agent with real-time TTS and STT, powered by... |
|
Emerging |
| 1619 |
just-ai/aimybox-ios-sdk
Voice assistant SDK for iOS devices written in Swift |
|
Emerging |
| 1620 |
takahi-ro/ConvivialChat
This system provides the web space where text and speech coexist, and you... |
|
Emerging |
| 1621 |
hariketsheth/Article_Repository_Management_System
In this Tech Savvy era, with lot of advancements in the field of AI, ML, IoT... |
|
Emerging |
| 1622 |
moutaouakkil/tts-text-to-speech
Text-to-Speech (TTS) enables developers to synthesize natural-sounding... |
|
Emerging |
| 1623 |
nuance-communications/mix-demo-client-azstaticwebapps
Nuance Mix Demo Client for use with Azure Static Web Apps |
|
Emerging |
| 1624 |
WismutHansen/READ2ME
Turn text from websites into spoken audio with edge-tts, F5, etc. and save... |
|
Emerging |
| 1625 |
TrevorS/qwen3-tts-rs
Rust implementation of Qwen3-TTS speech synthesis |
|
Emerging |
| 1626 |
uetuluk/xcodec2-infer-lib
CPU support for xcodec2 |
|
Emerging |
| 1627 |
ProperCode/Work-by-Speech
Windows app which allows efficient work on a computer by speech alone. |
|
Emerging |
| 1628 |
ShawnHymel/tflite-speech-recognition
Demo for training a convolutional neural network to classify words and... |
|
Emerging |
| 1629 |
asticode/go-astibob
Golang framework to build an AI that can understand and speak back to you,... |
|
Emerging |
| 1630 |
smartherd/SpeechToText
Speech To Text in Android |
|
Emerging |
| 1631 |
sljavi/handsfree-for-web-control-speech-recognition-module
Handsfree for Web module useful to ask for start or stop listening for voice commands |
|
Emerging |
| 1632 |
daisy/obi
Obi is an open source audio book production tool that produces digital... |
|
Emerging |
| 1633 |
poretsky/ru_tts
Compact and portable Russian speech synthesizer |
|
Emerging |
| 1634 |
uiuc-sst/asr24
24-hour Automatic Speech Recognition |
|
Emerging |
| 1635 |
npuichigo/voicenet
Speech synthesis platform based on tensorflow and sonnet |
|
Emerging |
| 1636 |
megaease/easevoice-trainer
EaseVoice Trainer is a simple and user-friendly voice cloning and speech... |
|
Emerging |
| 1637 |
kaieberl/paper2speech
Convert any english paper or scientific book to audio |
|
Emerging |
| 1638 |
gauthelo/kallaama-speech-dataset
A transcribed speech dataset in Wolof, Pulaar and Sereer, to support... |
|
Emerging |
| 1639 |
SiddhantSadangi/st_deepgram_playground
API playground for Deepgram built with Streamlit |
|
Emerging |
| 1640 |
SungFeng-Huang/Meta-TTS
Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More... |
|
Emerging |
| 1641 |
jorge-menjivar/super-stt
Super STT enables effortless voice-to-text in any application, using the... |
|
Emerging |
| 1642 |
loretoparisi/htk
HTK Toolkit with Linux 64 bit and Docker support |
|
Emerging |
| 1643 |
allseeteam/ai-secretary
Smart assistant in Telegram bot format for transcribing online meetings |
|
Emerging |
| 1644 |
akku2005/VocalInk
Next-gen open-source voice-to-blog platform with AI, TTS, gamification, and... |
|
Emerging |
| 1645 |
xifan2333/fcitx5-vinput
Local offline voice input plugin for Fcitx5 |
|
Emerging |
| 1646 |
brewusinc/Edge-TTS
Edge-TTS is a Swift implementation of Microsoft Edge's Text-to-Speech (TTS)... |
|
Emerging |
| 1647 |
kauazin394/vibevoice.swift
🎤 Create low-latency text-to-speech on macOS with VibeVoice.swift,... |
|
Emerging |
| 1648 |
art1415926535/yandex_speech
Generation of speech using Yandex SpeechKit. |
|
Emerging |
| 1649 |
felipefacundes/brasiltts
Brasil TTS é um conjunto de sintetizadores de voz, em português do Brasil,... |
|
Emerging |
| 1650 |
mostafaelaraby/Tensorflow-Keyword-Spotting
Keyword spotting using various architecture like convolutional vggnet , 1D... |
|
Emerging |
| 1651 |
manishdhakal/ASR-Nepali-using-CNN-BiLSTM-ResNet
Automatic speech recognition for the Nepali language using CNN,... |
|
Emerging |
| 1652 |
royshil/cloudvocal
Cloud AI live transcription and translation service plugin |
|
Emerging |
| 1653 |
yuanshanhua/video-dubbing
AI 驱动的视频译配工具. An AI powered tool to execute end-to-end video dubbing. |
|
Emerging |
| 1654 |
fewieden/MMM-TTS
Text-To-Speech Module for MagicMirror² |
|
Emerging |
| 1655 |
sooftware/speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch. |
|
Emerging |
| 1656 |
tomchang25/whisper-auto-transcribe
Auto transcribe tool based on whisper |
|
Emerging |
| 1657 |
atrzaska/VoiceStressAnalysis
VoiceStressAnalysis - Detects stress in your voice |
|
Emerging |
| 1658 |
JstnMcBrd/dectalk-tts
API wrapper for the Dectalk TTS system |
|
Emerging |
| 1659 |
OpenVoiceOS/ovos-tts-plugin-pico
pico-tts-plugin |
|
Emerging |
| 1660 |
ReneTode/My-AppDaemon
My apps, my helpfiles, all about AppDaemon for Home Assistant |
|
Emerging |
| 1661 |
seanhweb/Twitch-Text-to-Speech
Text to speech tool for twitch |
|
Emerging |
| 1662 |
privapps/TTS-Mandarin
text to speech in mandarin |
|
Emerging |
| 1663 |
asrajeh/arabic-tts
Arabic TTS ( الناطق العربي ) |
|
Emerging |
| 1664 |
6drf21e/ChatTTS_colab
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。 |
|
Emerging |
| 1665 |
harisbinzia/PronouncUR
PronouncUR: An Urdu Pronunciation Lexicon Generator |
|
Emerging |
| 1666 |
warisqr007/vocos
Causal version of Vocos (neural vocoders for high-quality audio synthesis)... |
|
Emerging |
| 1667 |
wangz-code/legado-tts
Book Reader阅读Legado 应用内置EdgeTTS大声朗读, 听书无需额外部署 即装即听, 语音引擎采用rany2/edge-tts... |
|
Emerging |
| 1668 |
hathibelagal-dev/str2speech
An easy-to-use library and command-line tool for TTS |
|
Emerging |
| 1669 |
hmartelb/speech-denoising
Speech Denoising project for the Deep Learning course at Tsinghua... |
|
Emerging |
| 1670 |
saurabhshri/CCAligner
🔮 Word by word audio subtitle synchronisation tool and API. Developed under... |
|
Emerging |
| 1671 |
awexandrr/audioWhisper
Listen to any audio stream on your machine and print out the transcribed or... |
|
Emerging |
| 1672 |
liuhaozhe6788/voice-cloning-collab
an improved version of Real-time-voice-cloning |
|
Emerging |
| 1673 |
gmltmd789/UnitSpeech
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis... |
|
Emerging |
| 1674 |
smtiitm/Fastspeech2_MFA
Indic TTS for Indian Languages: This is a project on developing... |
|
Emerging |
| 1675 |
mrtrizer/UnityPiper
Offline text to speech inside Unity |
|
Emerging |
| 1676 |
ivanvovk/compressed-tacotron2-pytorch
Compressed version of Tacotron 2 using Tensor Train + Waveglow. |
|
Emerging |
| 1677 |
Yazdi9/TTS-MultiLingual
Text To Speech Multilingual Support (+20 Language) |
|
Emerging |
| 1678 |
unza-speech-lab/zambezi-voice
Repository for multilingual speech data resources for native languages of Zambia. |
|
Emerging |
| 1679 |
rishikksh20/SoundStorm-pytorch
Google's SoundStorm: Efficient Parallel Audio Generation |
|
Emerging |
| 1680 |
Executedone/Chinese-FastSpeech2
基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏 |
|
Emerging |
| 1681 |
twn39/EdgeTTS.DotNet
EdgeTTS.DotNet is a C# (.NET) library that allows you to use Microsoft... |
|
Emerging |
| 1682 |
souvikg544/TTS_Data_Maker
Text to speech is an emerging zone of AI. This repository helps to create a... |
|
Emerging |
| 1683 |
AIFSH/ComfyUI-GPT_SoVITS
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now |
|
Emerging |
| 1684 |
hiteshsahu/Android-TTS-STT
One line solution for Android Text to speech(TTS) & Speech to Text(STT)... |
|
Emerging |
| 1685 |
second-state/gsv_tts
Streaming TTS API server written in Rust |
|
Emerging |
| 1686 |
harvard-edge/multilingual_kws
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus |
|
Emerging |
| 1687 |
llm-believer/slide-to-video
A tool that converts a slide deck into a video, complete with your voice... |
|
Emerging |
| 1688 |
tnicola/vue-voice
Speech to text and text to speech Vue library |
|
Emerging |
| 1689 |
umair13adil/background_stt
A flutter plugin to run always-on speech to text service in the background. |
|
Emerging |
| 1690 |
SergeyShk/Speech-to-Text-Russian
Проект для распознавания речи на русском языке на основе pykaldi. |
|
Emerging |
| 1691 |
LedoKun/028-simple-queue-system
A real-time, responsive queue calling system designed for TV displays,... |
|
Emerging |
| 1692 |
syhw/wer_are_we
Attempt at tracking states of the arts and recent results (bibliography) on... |
|
Emerging |
| 1693 |
espnet/interspeech2019-tutorial
INTERSPEECH 2019 Tutorial Materials |
|
Emerging |
| 1694 |
usabarashi/voicevox-cli
Japanese text-to-speech using VOICEVOX Core |
|
Emerging |
| 1695 |
DataXujing/ASR-paper
:fire: ASR教程: https://dataxujing.github.io/ASR-paper/ |
|
Emerging |
| 1696 |
westonruter/spoken-word
Spoken Word |
|
Emerging |
| 1697 |
tabahi/contexless-phonemes-CUPE
pytorch model for contexless-phoneme prediction from speech audio |
|
Emerging |
| 1698 |
18F/tts-buy-bug-bounty
Solicitation and acquisition documents created for the TTS Bug Bounty... |
|
Emerging |
| 1699 |
VITA-Group/Audio-Lottery
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight,... |
|
Emerging |
| 1700 |
chrisvdev/obs-chat
Also known as CVTalk is a Twitch chat viewer made with React for use in OBS... |
|
Emerging |