All Voice AI Tools

8,165 tools ranked by quality score · Page 17 of 82

Showing 1601–1700 of 8,165
# Tool Score Tier
1601 adelacvg/ttts

Train the next generation of TTS systems.

40
Emerging
1602 rryam/SakuraKit

Swift SDK for Prototyping AI Speech Generation

40
Emerging
1603 Ijwi-ry-Ikirundi-AI/Kirundi_Dataset

🇧🇮 The first large-scale, open-source speech and text dataset for Kirundi...

40
Emerging
1604 DrewThomasson/ebook2audiobookpiper-tts

Converts ebooks into audiobooks with piper-tts

40
Emerging
1605 ninjahuttjr/hal-answering-service

I'm sorry, Dave. I'm afraid I can't let that spam call through. — Local AI...

39
Emerging
1606 1ytic/open_stt_e2e

PyTorch end-to-end speech recognition

39
Emerging
1607 MuGuiLin/VoiceDictation

迅飞 语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息,让机器能够“听懂”人类语言,相当于给机器安装上“耳朵”,使其具备“能听”的功能。

39
Emerging
1608 taikun114/VOICEVOX-TTS-for-Home-Assistant

Custom integration for Japanese TTS using VOICEVOX in Home Assistant.

39
Emerging
1609 collectivat/cmusphinx-models

Acoustic and language models for minorised languages.

39
Emerging
1610 rhasspy/piper-samples

Samples for Piper text to speech system

39
Emerging
1611 M0Rf30/shisper

A quick & dirty script to generate and view subtitles and transcriptions for...

39
Emerging
1612 Anwarvic/RasaChatbot-with-ASR-and-TTS

This repository contains an attempt to incorporate Rasa Chatbot with...

39
Emerging
1613 pkozul/ha-tts-bluetooth-speaker

TTS Bluetooth Speaker for Home Assistant

39
Emerging
1614 rcspam/dictee

Push-to-talk voice dictation for Linux — 100% local, multilingual (25+...

39
Emerging
1615 spokestack/spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS....

39
Emerging
1616 JusperLee/Conv-TasNet

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech...

39
Emerging
1617 oleges1/quartznet-pytorch

Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]

39
Emerging
1618 Supremesujay/murf-voice-agent-starter

🎤 Build a low-latency voice agent with real-time TTS and STT, powered by...

39
Emerging
1619 just-ai/aimybox-ios-sdk

Voice assistant SDK for iOS devices written in Swift

39
Emerging
1620 takahi-ro/ConvivialChat

This system provides the web space where text and speech coexist, and you...

39
Emerging
1621 hariketsheth/Article_Repository_Management_System

In this Tech Savvy era, with lot of advancements in the field of AI, ML, IoT...

39
Emerging
1622 moutaouakkil/tts-text-to-speech

Text-to-Speech (TTS) enables developers to synthesize natural-sounding...

39
Emerging
1623 nuance-communications/mix-demo-client-azstaticwebapps

Nuance Mix Demo Client for use with Azure Static Web Apps

39
Emerging
1624 WismutHansen/READ2ME

Turn text from websites into spoken audio with edge-tts, F5, etc. and save...

39
Emerging
1625 TrevorS/qwen3-tts-rs

Rust implementation of Qwen3-TTS speech synthesis

39
Emerging
1626 uetuluk/xcodec2-infer-lib

CPU support for xcodec2

39
Emerging
1627 ProperCode/Work-by-Speech

Windows app which allows efficient work on a computer by speech alone.

39
Emerging
1628 ShawnHymel/tflite-speech-recognition

Demo for training a convolutional neural network to classify words and...

39
Emerging
1629 asticode/go-astibob

Golang framework to build an AI that can understand and speak back to you,...

39
Emerging
1630 smartherd/SpeechToText

Speech To Text in Android

39
Emerging
1631 sljavi/handsfree-for-web-control-speech-recognition-module

Handsfree for Web module useful to ask for start or stop listening for voice commands

39
Emerging
1632 daisy/obi

Obi is an open source audio book production tool that produces digital...

39
Emerging
1633 poretsky/ru_tts

Compact and portable Russian speech synthesizer

39
Emerging
1634 uiuc-sst/asr24

24-hour Automatic Speech Recognition

39
Emerging
1635 npuichigo/voicenet

Speech synthesis platform based on tensorflow and sonnet

39
Emerging
1636 megaease/easevoice-trainer

EaseVoice Trainer is a simple and user-friendly voice cloning and speech...

39
Emerging
1637 kaieberl/paper2speech

Convert any english paper or scientific book to audio

39
Emerging
1638 gauthelo/kallaama-speech-dataset

A transcribed speech dataset in Wolof, Pulaar and Sereer, to support...

39
Emerging
1639 SiddhantSadangi/st_deepgram_playground

API playground for Deepgram built with Streamlit

39
Emerging
1640 SungFeng-Huang/Meta-TTS

Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More...

39
Emerging
1641 jorge-menjivar/super-stt

Super STT enables effortless voice-to-text in any application, using the...

39
Emerging
1642 loretoparisi/htk

HTK Toolkit with Linux 64 bit and Docker support

39
Emerging
1643 allseeteam/ai-secretary

Smart assistant in Telegram bot format for transcribing online meetings

39
Emerging
1644 akku2005/VocalInk

Next-gen open-source voice-to-blog platform with AI, TTS, gamification, and...

39
Emerging
1645 xifan2333/fcitx5-vinput

Local offline voice input plugin for Fcitx5

39
Emerging
1646 brewusinc/Edge-TTS

Edge-TTS is a Swift implementation of Microsoft Edge's Text-to-Speech (TTS)...

39
Emerging
1647 kauazin394/vibevoice.swift

🎤 Create low-latency text-to-speech on macOS with VibeVoice.swift,...

39
Emerging
1648 art1415926535/yandex_speech

Generation of speech using Yandex SpeechKit.

39
Emerging
1649 felipefacundes/brasiltts

Brasil TTS é um conjunto de sintetizadores de voz, em português do Brasil,...

39
Emerging
1650 mostafaelaraby/Tensorflow-Keyword-Spotting

Keyword spotting using various architecture like convolutional vggnet , 1D...

39
Emerging
1651 manishdhakal/ASR-Nepali-using-CNN-BiLSTM-ResNet

Automatic speech recognition for the Nepali language using CNN,...

39
Emerging
1652 royshil/cloudvocal

Cloud AI live transcription and translation service plugin

39
Emerging
1653 yuanshanhua/video-dubbing

AI 驱动的视频译配工具. An AI powered tool to execute end-to-end video dubbing.

39
Emerging
1654 fewieden/MMM-TTS

Text-To-Speech Module for MagicMirror²

39
Emerging
1655 sooftware/speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

39
Emerging
1656 tomchang25/whisper-auto-transcribe

Auto transcribe tool based on whisper

39
Emerging
1657 atrzaska/VoiceStressAnalysis

VoiceStressAnalysis - Detects stress in your voice

39
Emerging
1658 JstnMcBrd/dectalk-tts

API wrapper for the Dectalk TTS system

39
Emerging
1659 OpenVoiceOS/ovos-tts-plugin-pico

pico-tts-plugin

39
Emerging
1660 ReneTode/My-AppDaemon

My apps, my helpfiles, all about AppDaemon for Home Assistant

39
Emerging
1661 seanhweb/Twitch-Text-to-Speech

Text to speech tool for twitch

39
Emerging
1662 privapps/TTS-Mandarin

text to speech in mandarin

39
Emerging
1663 asrajeh/arabic-tts

Arabic TTS ( الناطق العربي )

39
Emerging
1664 6drf21e/ChatTTS_colab

🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。

39
Emerging
1665 harisbinzia/PronouncUR

PronouncUR: An Urdu Pronunciation Lexicon Generator

39
Emerging
1666 warisqr007/vocos

Causal version of Vocos (neural vocoders for high-quality audio synthesis)...

39
Emerging
1667 wangz-code/legado-tts

Book Reader阅读Legado 应用内置EdgeTTS大声朗读, 听书无需额外部署 即装即听, 语音引擎采用rany2/edge-tts...

39
Emerging
1668 hathibelagal-dev/str2speech

An easy-to-use library and command-line tool for TTS

39
Emerging
1669 hmartelb/speech-denoising

Speech Denoising project for the Deep Learning course at Tsinghua...

39
Emerging
1670 saurabhshri/CCAligner

🔮 Word by word audio subtitle synchronisation tool and API. Developed under...

39
Emerging
1671 awexandrr/audioWhisper

Listen to any audio stream on your machine and print out the transcribed or...

39
Emerging
1672 liuhaozhe6788/voice-cloning-collab

an improved version of Real-time-voice-cloning

39
Emerging
1673 gmltmd789/UnitSpeech

An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis...

39
Emerging
1674 smtiitm/Fastspeech2_MFA

Indic TTS for Indian Languages: This is a project on developing...

39
Emerging
1675 mrtrizer/UnityPiper

Offline text to speech inside Unity

39
Emerging
1676 ivanvovk/compressed-tacotron2-pytorch

Compressed version of Tacotron 2 using Tensor Train + Waveglow.

39
Emerging
1677 Yazdi9/TTS-MultiLingual

Text To Speech Multilingual Support (+20 Language)

39
Emerging
1678 unza-speech-lab/zambezi-voice

Repository for multilingual speech data resources for native languages of Zambia.

39
Emerging
1679 rishikksh20/SoundStorm-pytorch

Google's SoundStorm: Efficient Parallel Audio Generation

39
Emerging
1680 Executedone/Chinese-FastSpeech2

基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏

39
Emerging
1681 twn39/EdgeTTS.DotNet

EdgeTTS.DotNet is a C# (.NET) library that allows you to use Microsoft...

39
Emerging
1682 souvikg544/TTS_Data_Maker

Text to speech is an emerging zone of AI. This repository helps to create a...

39
Emerging
1683 AIFSH/ComfyUI-GPT_SoVITS

a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now

39
Emerging
1684 hiteshsahu/Android-TTS-STT

One line solution for Android Text to speech(TTS) & Speech to Text(STT)...

39
Emerging
1685 second-state/gsv_tts

Streaming TTS API server written in Rust

39
Emerging
1686 harvard-edge/multilingual_kws

Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus

39
Emerging
1687 llm-believer/slide-to-video

A tool that converts a slide deck into a video, complete with your voice...

39
Emerging
1688 tnicola/vue-voice

Speech to text and text to speech Vue library

39
Emerging
1689 umair13adil/background_stt

A flutter plugin to run always-on speech to text service in the background.

39
Emerging
1690 SergeyShk/Speech-to-Text-Russian

Проект для распознавания речи на русском языке на основе pykaldi.

39
Emerging
1691 LedoKun/028-simple-queue-system

A real-time, responsive queue calling system designed for TV displays,...

39
Emerging
1692 syhw/wer_are_we

Attempt at tracking states of the arts and recent results (bibliography) on...

39
Emerging
1693 espnet/interspeech2019-tutorial

INTERSPEECH 2019 Tutorial Materials

39
Emerging
1694 usabarashi/voicevox-cli

Japanese text-to-speech using VOICEVOX Core

39
Emerging
1695 DataXujing/ASR-paper

:fire: ASR教程: https://dataxujing.github.io/ASR-paper/

39
Emerging
1696 westonruter/spoken-word

Spoken Word

39
Emerging
1697 tabahi/contexless-phonemes-CUPE

pytorch model for contexless-phoneme prediction from speech audio

39
Emerging
1698 18F/tts-buy-bug-bounty

Solicitation and acquisition documents created for the TTS Bug Bounty...

39
Emerging
1699 VITA-Group/Audio-Lottery

[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight,...

39
Emerging
1700 chrisvdev/obs-chat

Also known as CVTalk is a Twitch chat viewer made with React for use in OBS...

39
Emerging
« Prev 1 2 3 15 16 17 18 19 80 81 82 Next »