All Voice AI Tools

8,165 tools ranked by quality score · Page 16 of 82

Showing 1501–1600 of 8,165
# Tool Score Tier
1501 HurroWorld/text-to-audio2face

Web interface to convert text to speech and route it to an Audio2Face...

40
Emerging
1502 hwRG/End-to-End-TTS-Fine-Tune

Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.

40
Emerging
1503 qforge-dev/qspeak

qSpeak is a powerful voice transcription and AI assistant tool that helps...

40
Emerging
1504 definitio/ha-rhvoice

Home Assistant integration for RHVoice - a local text-to-speech engine.

40
Emerging
1505 jimbobbennett/SpeechToTextSamples

Sample code showing how to use the Azure Speech to Text service from Python 🗣

40
Emerging
1506 henryhale/ttspeech

🔊 A fully basic voice synthesizer in vanillaJS

40
Emerging
1507 tianbot/rosecho

Tianbot Rosecho (Tianecho),中文语音人机交互模块,支持ROS即插即用

40
Emerging
1508 inboxpraveen/Speech-Annotation-Tool

Review, correct, and export ASR transcripts at scale. Web-based ASR accuracy...

40
Emerging
1509 oscie57/tiktok-voice

Simple Python script to interact with the TikTok TTS API

40
Emerging
1510 RafalWilinski/serverless-medium-text-to-speech

🔊 Serverless-based, text-to-speech service for Medium articles

40
Emerging
1511 QiBowen2008/SuperTextToolBox

一个免费的文字处理工具箱

40
Emerging
1512 SadeghKrmi/pertts-streamlit

Persian text-to-speech streamlit interface

40
Emerging
1513 Saganaki22/ComfyUI-KittenTTS

😻 A simple ComfyUI custom node for KittenTTS - an ultra-lightweight...

40
Emerging
1514 gladchinda/web-speech-demo

Learn how to build a simple text-to-speech voice app for the web using the...

40
Emerging
1515 MicheleYin/misaki-rs

Rust port of Misaki

40
Emerging
1516 HerbertHe/edge-tts-server

Server for edge-tts

40
Emerging
1517 jscrane/TTS

Arduino Text-to-Speech Library

40
Emerging
1518 kaushiknishchay/ComfyUI-Qwen3-ASR

ComfyUI nodes for Qwen3-ASR (0.6B/1.7B) and ForcedAligner. Supports...

40
Emerging
1519 lucasnewman/vocos-mlx

Implementation of 'Vocos: Closing the gap between time-domain and...

40
Emerging
1520 IceFog72/pocket-tts-openapi

Fast, local, OpenAI-compatible TTS server with voice cloning support powered...

40
Emerging
1521 coqui-ai/STT-models

Open models for Coqui STT

40
Emerging
1522 soundhound/houndify-sdk-go

The official Houndify SDK for Go

40
Emerging
1523 satyam9090/Automatic-Indian-Sign-Language-Translator-ISL

I created an application which takes in live speech or audio recording as...

40
Emerging
1524 nerdaxic/glados-voice-assistant

DIY Voice Assistant based on the GLaDOS character from Portal video game...

40
Emerging
1525 saadbutt32/Conversion-of-Pakistan-Sign-Languag-into-Text-and-Speech-using-OpenPose-and-Machine-Learning

Real-time translation of Pakistan sign language into text and speech using...

40
Emerging
1526 naschorr/hawking

The retro text-to-speech bot for Discord

40
Emerging
1527 RoySheffer/im2wav

Official implementation of the pipeline presented in I hear your true...

40
Emerging
1528 AEmotionStudio/ComfyUI-FFMPEGA

Intelligent FFMPEG agent node for ComfyUI - transforms natural language...

40
Emerging
1529 akinsella/yt-transcript-rs

🎬️ A Rust library for accessing YouTube Video Infos & Transcripts

40
Emerging
1530 AndroidMaryTTS/AndroidMaryTTS

Android MARY TTS - an open-source, offline HMM-Based text-to-speech...

40
Emerging
1531 RapidAI/RapidTTS

A cross platform implementation of Text-to-Speech based on ONNXRuntime.

40
Emerging
1532 PhuocElec/zipformer-asr-api

REST-API implementation of ZipFormer for automatic speech recognition (ASR)...

40
Emerging
1533 moeru-ai/ortts

𖣘🔊 Simple and Easy-to-use local TTS inference server, Powered by ONNX Runtime

40
Emerging
1534 myuan19/voiceInput

Windows AI 语音输入🎙 — 按快捷键说话即输入,支持润色。摆脱打字限制,实现无拘束、高效率的表达。

40
Emerging
1535 dmatekenya/Chichewa-Speech2Text

Automated Speech Recognition for Chichewa.

40
Emerging
1536 CoffeeMethod/KokoroGUI

An advanced TTS software, built for audiobooks, podcasts, videos, and more.

40
Emerging
1537 keonlee9420/Robust_Fine_Grained_Prosody_Control

PyTorch Implementation of Robust and fine-grained prosody control of...

40
Emerging
1538 skshadan/WhisCall

A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo,...

40
Emerging
1539 speechio/BigCiDian

Pronunciation lexicon covering both English and Chinese languages for...

40
Emerging
1540 mapluisch/OpenAI-Text-To-Speech-for-Unity

Implementation of OpenAI's Text-To-Speech in Unity. Synthesize any text and...

40
Emerging
1541 rapidaai/rapida-go

Open-source Golang SDK for Rapida to build real-time, observable Voice AI...

40
Emerging
1542 robmsmt/ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR...

40
Emerging
1543 soupslurpr/Transcribro

Private and on-device speech recognition keyboard and service for Android.

40
Emerging
1544 Hritikraj8804/Autotube

🤖 Automated YouTube Shorts creation using n8n, AI script generation, and...

40
Emerging
1545 foamliu/Listen-Attend-Spell-v2

PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).

40
Emerging
1546 eellak/gsoc2019-sphinx

Creation of an online Greek mail dictation system, using Sphinx and...

40
Emerging
1547 FaceOnLive/Spleeter-Android-iOS

On-device, Offline Spleeter Solution For Mobile

40
Emerging
1548 DmitryRyumin/INTERSPEECH-2023-24-Papers

INTERSPEECH 2023-2024 Papers: A complete collection of influential and...

40
Emerging
1549 zw76859420/ASR_Syllable

基于卷积神经网络的语音识别声学模型的研究

40
Emerging
1550 pymike00/YouTube-Tutorials

:open_file_folder: Source Code for (some of) the Programming Tutorials from...

40
Emerging
1551 alan890104/sumi

Sumi — Free, open-source voice dictation for macOS. Local-first Whisper +...

40
Emerging
1552 hcy71o/SNAC

Unofficial Pytorch implementation of SNAC: Speaker-normalized affine...

40
Emerging
1553 atakanakin/TutunSabri

He is not our hero. He is a silent guardian. A watchful protector.

40
Emerging
1554 Warma10032/easytts

打造最简单的TTS前端集合,最简单的有声小说制作工作流。基于正则规则对小说进行分句,基于RoBERTa对小说中的对话进行说话人识别,从而实现一键式生成多人...

40
Emerging
1555 zh217/torch-asg

Auto Segmentation Criterion (ASG) implemented in pytorch

40
Emerging
1556 tristan-mcinnis/Multimodal-voice-assistant

This project is a multi-modal AI voice assistant that uses LM Studio, OpenAI...

40
Emerging
1557 Igorcbraz/Calculadora

📐 Calculadora simples e intuitiva com suporte a comandos de voz e temas...

40
Emerging
1558 apluka34/Bud500

Bud500: A Comprehensive Vietnamese ASR Dataset

40
Emerging
1559 WeiChiaChang/happy-halloween

🗣 Say "happy halloween" to your browser 🎃

40
Emerging
1560 markmiddo/synthia

AI-powered voice assistant that respects your privacy. Control your desktop,...

40
Emerging
1561 FedericaPaoli1/stm32-speech-recognition-and-traduction

stm32-speech-recognition-and-traduction is a project developed for the...

40
Emerging
1562 marytts/gradle-marytts-voicebuilding-plugin

A replacement for the legacy VoiceImportTools in MaryTTS

40
Emerging
1563 lokkelvin2/tacotron2-tts-GUI

Text To Speech (TTS) GUI wrapper for NVIDIA Tacotron 2+Waveglow. For custom...

40
Emerging
1564 AcTePuKc/Kokoro-Local-Gui

Hyper-fast, local, high-quality TTS based on Kokoro-82M. PySide6 GUI included.

40
Emerging
1565 grebtsew/Text_To_Speech_Server_Node

A super simple speaking server node that receives requests and reads them...

40
Emerging
1566 Allan-Nava/fakeyou.go

A powerful golang sdk library for interacting with the FakeYouAPI easily

40
Emerging
1567 Jdreioe/Wingmate

A project to make people who cannot speak, speak!

40
Emerging
1568 vkosuri/dialogflow-lite

[Maintainer Required] A light-weight python library REST agent for Dialogflow

40
Emerging
1569 yeyupiaoling/VITS-Pytorch

本项目是基于Pytorch的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复杂的流程,...

40
Emerging
1570 user3301/ssml_builder

:sound: a general SSML(Speech Synthesis Markup Language) builder

40
Emerging
1571 sunshine0523/MNNServer

A third-party MNN server supporting external calls, embedding model, TTS...

40
Emerging
1572 pschatzmann/arduino-espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than...

40
Emerging
1573 FlooferLand/ttvoice-mod

A Minecraft mod that lets you type to speak!

40
Emerging
1574 shahules786/mayavoz

Pytorch based speech enhancement toolkit.

40
Emerging
1575 daanzu/speech-training-recorder

Simple GUI application to help record audio dictated from given text...

40
Emerging
1576 maum-ai/nuwave2

NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling...

40
Emerging
1577 ShadowForests/VoiceToSpeech

Live speech recognition to synthesized speech with hundreds of voices, TTS,...

40
Emerging
1578 sophiefy/StellaVoiceChanger

Deep-learning-based voice changer, supporting local inference.

40
Emerging
1579 weimeng23/speech-recognition-learning-resources

:white_check_mark: A list of speech recognition learning resources including...

40
Emerging
1580 felivalencia3/RealVoiceGPT

RealVoiceGPT is a web application that lets you have voice conversations...

40
Emerging
1581 itspyguru/Tkinter-Applications

A collection of small tkinter apps made by me

40
Emerging
1582 Adamiito0909/mlx-swift-audio

🎤 Enhance your apps with MLX Swift Audio, offering robust text-to-speech and...

40
Emerging
1583 reybahl/Assistant

A machine learning powered, voice-based virtual assistant for Raspberry Pi....

40
Emerging
1584 smx-smx/KodiSharp

Use Kodi python APIs in C#, and write rich addons using the .NET framework/Mono

40
Emerging
1585 1ytic/pytorch-edit-distance

Levenshtein edit-distance on PyTorch and CUDA

40
Emerging
1586 MattePalte/Verbify-TTS

Simple and free Text-to-Speech (TTS) engine that reads for you any text on...

40
Emerging
1587 aks-devs/mod_google_asr

Freeswitch Speech-to-Text module

40
Emerging
1588 TeaPoly/Conformer-Athena

Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.

40
Emerging
1589 andi611/TTS-Tacotron-Pytorch

Pytorch implementation of Tacotron, a speech synthesis end-to-end generative...

40
Emerging
1590 pviotti/sayit

A text-to-speech command line tool backed by Azure Cognitive Services.

40
Emerging
1591 hyeonsangjeon/computing-Korean-STT-error-rates

STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지

40
Emerging
1592 LetovKai/call-translator

Real-time voice translator for video calls. Speak your language on Google...

40
Emerging
1593 TigreGotico/phoonnx

A Python library for multilingual phonemization and Text-to-Speech (TTS)...

40
Emerging
1594 shi-gg/Auditional-Text

The source code of the Auditional Text discord Boat

40
Emerging
1595 double22a/asr_nlp_paper_code

Papers of ASR, Tools of ASR

40
Emerging
1596 johunsang/octo-captures

화면 녹화의 모든 것 — Auto Zoom, 아바타, 음성 변조, BGM, 타임라인 편집을 지원하는 무료 오픈소스 macOS 앱....

40
Emerging
1597 racai-ai/RobinASR

Romanian Automatic Speech Recognition from the ROBIN project

40
Emerging
1598 abus-aikorea/kara-audio

Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports...

40
Emerging
1599 bnsantoso/sub-to-audio

Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS...

40
Emerging
1600 dusty-nv/jetson-voice

ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch...

40
Emerging
« Prev 1 2 3 14 15 16 17 18 80 81 82 Next »