All Voice AI Tools

8,165 tools ranked by quality score · Page 15 of 82

Showing 1401–1500 of 8,165
# Tool Score Tier
1401 tihu-nlp/tihu

Persian Text-To-Speech

41
Emerging
1402 markokosticdev/cloud_text_to_speech_flutter

Single interface to Google, Microsoft, and Amazon Text-To-Speech.

41
Emerging
1403 orange2ai/youtube-subtitle-translator

🌐 Real-time YouTube subtitle translator browser extension. Translate...

41
Emerging
1404 rudrankriyam/Glosik

Sample project for F5-TTS using MLX Swift

41
Emerging
1405 lucko515/speech-recognition-neural-network

This is the end-to-end Speech Recognition neural network, deployed in Keras....

41
Emerging
1406 cameronking4/VapiBlocks

Vapi Blocks is a library of components & api snips to copy and paste into...

41
Emerging
1407 Lunarien/Lunariens-Mental-Math-Trainer

Mental math trainer made in C#.

41
Emerging
1408 holm-aune-bachelor2018/ctc

Speech recognition with CTC in Keras with Tensorflow backend

41
Emerging
1409 AryanVBW/AiVoiceClonerPRO

Revolutionize Your Voice with AI Voice Cloner! Transform Your Speech into...

41
Emerging
1410 Emotional-Text-to-Speech/hmm-for-emo-tts

:computer: A repository with comprehensive instructions for using the...

41
Emerging
1411 declare-lab/speech-adapters

Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient...

41
Emerging
1412 modelscope/FunCodec

FunCodec is a research-oriented toolkit for audio quantization and...

41
Emerging
1413 Kini218/speech-to-text

Speech to text script on python

41
Emerging
1414 alias454/YATSEE

YATSEE - Yet Another Tool for Speech Extraction & Enrichment

41
Emerging
1415 MHaggis/ASRGEN

ASR Configurator, Essentials and Atomic Testing

41
Emerging
1416 nl8590687/ASRT_SDK_Python3

ASRT语音识别系统的Python版SDK

41
Emerging
1417 1038lab/ComfyUI-SparkTTS

ComfyUI-SparkTTS is a custom ComfyUI node implementation of SparkTTS, an...

41
Emerging
1418 Dostoyewski/django_voice_bot

Package for django onpage support bot with speech recognition and voice commands

41
Emerging
1419 iBrammm/qwen-asr

🎙️ Implement fast, dependency-free C inference for Qwen3-ASR speech-to-text...

41
Emerging
1420 yl4579/HiFTNet

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter...

41
Emerging
1421 titilambert/pynuance

Wrapper for Nuance Communications services

41
Emerging
1422 Andrewcpu/elevenlabs-api

🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs...

41
Emerging
1423 Frikallo/parakeet.cpp

Ultra fast and portable Parakeet implementation for on-device inference in...

41
Emerging
1424 tktcorporation/discord-tts-bot

A discord bot to use tts in your voice channel.

41
Emerging
1425 janewu77/ela-extension

English Learner Assistant

41
Emerging
1426 1neReality/MITSUHA

World's First Multilingual Inexpensive Therapeutic Sophisticated...

41
Emerging
1427 bhattbhavesh91/wav2vec2-huggingface-demo

Speech to Text with self-supervised learning based on wav2vec 2.0 framework...

41
Emerging
1428 kokimame/joytan

Creative Audio/Textbook Maker 🎵 📖 See our YouTube channel

41
Emerging
1429 serpapps/ai-voice-cloner

AI Voice Cloning Desktop Application that runs locally on your computer and...

41
Emerging
1430 ssssssilver/sherpa-ncnn-unity

在Unity环境下,借助sherpa-ncnn框架,实现实时并准确的中英双语语音识别功能。

41
Emerging
1431 Kaljurand/Arvutaja

An Android app for voice actions in Estonian and English

41
Emerging
1432 quangvu3/coqui-xtts

Coqui XTTS model with Vietnamese added

41
Emerging
1433 yzfly/awesome-voice-agents

A curated list of voice AI agent frameworks, tools, resources, and best practices

41
Emerging
1434 zhangzijie-pro/Speaker-Verification

Dual-model speech AI toolkit for speaker verification and speaker-aware...

41
Emerging
1435 pika-online/AESRC2020

a deep accent recognition network

41
Emerging
1436 zeropointnine/tts-audiobook-tool

Audiobook creation tool with support for multiple TTS models (Qwen3-TTS,...

41
Emerging
1437 Edw590/VISOR---A-Voice-Assistant

V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!

41
Emerging
1438 CodeBySonu95/VoxSherpa-TTS

🎙️ VoxSherpa TTS Offline Neural Text-to-Speech Engine for Android ⚡...

41
Emerging
1439 renorari/VoiceJP-Discord

A discord-app can text-to-speech and speech-to-text

41
Emerging
1440 TETYYS/SAPI4

Web interface for Microsoft Sam & friends

41
Emerging
1441 mattmireles/kokoro-coreml

PyTorch → CoreML conversion pipeline for Kokoro TTS. Unlocks fast on-device...

41
Emerging
1442 mapluisch/OpenAI-Realtime-API-for-Unity

Implementation of OpenAI's Realtime API in Unity. Easily integrate...

41
Emerging
1443 shenbengit/TTSTool

科大讯飞离线语音,Text to Speech,TTS

41
Emerging
1444 aditya-an1l/RILearn

Reinventing Reading with a touch of Interactivity aided Learning

41
Emerging
1445 leprosus/golang-tts

Text-to-Speach golang package based in Amazon Polly service

41
Emerging
1446 cherts/mspeech

Program for speech recognition using the Google Speech API, voice commands,...

41
Emerging
1447 nithincvpoyyil/voice-listener

An reusable angular component for voice based input using web speech API

41
Emerging
1448 aboda-dirbas/whisperclip

🎤 Enhance your voice-to-text transcriptions with WhisperClip, prioritizing...

41
Emerging
1449 Renovamen/Speech-and-Text

Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech...

41
Emerging
1450 antifield/vmt

Discord App for Transcribing & Translating Voice Messages

41
Emerging
1451 smaranjitghose/AIAudioTranscriber

A minimalistic web app to generate transciption for audio built using Python

41
Emerging
1452 N6UDP/SteamDiscordTTSBot

A steam chat to Discord TTS bridge

41
Emerging
1453 deepgram-starters/php-transcription

Get started using Deepgram's speech-to-text with this PHP demo app

41
Emerging
1454 doveg/whisper-real-time

A real time offline transcriber with gui, based on OpenAI whisper

41
Emerging
1455 rishikksh20/gmvae_tacotron

Gaussian Mixture VAE Tacotron

41
Emerging
1456 EndlessReform/fish-speech.rs

A Fish Speech implementation in Rust, with Candle.rs

40
Emerging
1457 gillesdemey/google-speech-v2

:speech_balloon: Reverse Engineering Google's Speech To Text API (v2)

40
Emerging
1458 mramshaw/Speech-Recognition

Speech recognition with Python

40
Emerging
1459 yapit-tts/yapit

Listen to anything. TTS for documents, papers, and web pages.

40
Emerging
1460 PhilippeRo/IBus-Speech-To-Text

A speech to text IBus engine using VOSK

40
Emerging
1461 rishikksh20/Avocodo-pytorch

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

40
Emerging
1462 Alex-Tremayne/LaTeXt

Python package for converting LaTeX to text which can be read by text to...

40
Emerging
1463 Harshit-shrivastav/TikTok-TTS-Bot

A python TikTok Text to speech generator telegram bot.

40
Emerging
1464 jing332/tts-server-android

这是一个Android系统TTS应用,内置微软演示接口,可自定义HTTP请求,可导入其他本地TTS引擎,以及根据中文双引号的简单旁白/对话识别朗读...

40
Emerging
1465 saurabhdaware/bol

Slightly more consistent Text-to-speech for Web and a wrapper around speechSynthesis

40
Emerging
1466 danielclough/vibevoice-rs

Rust implementation of VibeVoice text-to-speech with voice cloning and...

40
Emerging
1467 ehtisham91/Django-Speech-to-text-Chat

This App allows users to convert their speech into text and send that text...

40
Emerging
1468 0xPD33/sonori

Sonori is a fully local STT app for Linux (Wayland).

40
Emerging
1469 gheyret/UQSpeechDataset

Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット

40
Emerging
1470 izwi-ai/izwi

On-device AI engine for transcription, TTS, and voice workflows.

40
Emerging
1471 Nighthawk42/mOrpheus

Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.

40
Emerging
1472 aws-samples/sample-voicebot-nova-sonic

A sample implementation of real-time voice assistant using Amazon Nova 2...

40
Emerging
1473 dsi-icl/do-voice-interaction

The goal of this project is to provide a voice assistant to the Data...

40
Emerging
1474 kaituoxu/Listen-Attend-Spell

A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End...

40
Emerging
1475 bgArray/ZhiYin

知音 - AI音频听觉功能集成软件。提供声乐技术识别分析、伴奏分离等伴奏多种工具。

40
Emerging
1476 Labmem-Zhouyx/CDFSE_FastSpeech2

The Official Implementation of “Content-Dependent Fine-Grained Speaker...

40
Emerging
1477 speechly/speechly

Client libraries, examples and demos of Speechly API for the Web.

40
Emerging
1478 domesticatedviking/TextyMcSpeechy

Easily create Piper text-to-speech models in any voice. Make a...

40
Emerging
1479 thinh-vu/ur_audio_sub

Generate text captions for audio files & youtube video using OpenAI Whisper...

40
Emerging
1480 lucascamillomd/anki-tts

A free, open-source app for Anki text-to-speech in MacOS.

40
Emerging
1481 tugstugi/mongolian-speech-recognition

Mongolian speech recognition with PyTorch

40
Emerging
1482 loretoparisi/wave2vec-recognize-docker

Wave2vec 2.0 Recognize pipeline

40
Emerging
1483 Baidu-AIP/speech-tts-cors

百度语音 语音合成 跨域demo以及支持库

40
Emerging
1484 HeyHeyChicken/NOVA-Python

NOVA is a customizable voice assistant made with Python.

40
Emerging
1485 mmpneo/curses

Speech to Text and KB input captions for OBS, VRChat, Twitch chat and Discord

40
Emerging
1486 Umbaji/NMTMD

Official repository for the Opensource Textdataset for NMT for local langues...

40
Emerging
1487 ethicalabs-ai/Kurtis-E1-MLX-Voice-Agent

A lightweight voice companion, optimized for macOS.

40
Emerging
1488 p1an-lin-jung/teochew-g2p

这是一个潮州话文本端的处理工具和正字标准,主要为潮州方言的语音合成服务

40
Emerging
1489 FR33TR1ST/VoiceAssistant

A VoiceAsistant with WhisperAI speech recognition

40
Emerging
1490 wwdok/faster-whisper-webui-cn

Cloned from https://huggingface.co/spaces/aadnk/faster-whisper-webui, and...

40
Emerging
1491 tsensei/OpenReels

Open-source AI pipeline that turns any topic into a fully rendered...

40
Emerging
1492 yui-mhcp/text_to_speech

(Multi Speaker) Text-To-Speech (TTS) project

40
Emerging
1493 ritazh/EchoML

🔉 A web app to play, visualize, and annotate your audio files for machine learning

40
Emerging
1494 ahaocd/davinci-voice-clone

DaVinci Subtitle Alignment + Voice Clone + AI Emotion Optimization | CosyVoice2 TTS

40
Emerging
1495 eellak/gsoc2021-audio-annotation-tool

Creation of a multi user audio first annotation tool - GSoC 2021

40
Emerging
1496 small-cactus/Jarvis-ChatGPT-VoiceAssistant

Jarvis powered by GPT-3.5/GPT-4

40
Emerging
1497 ibm-self-serve-assets/Watson-Speech

This collection demonstrates how to help you to quickly embed Watson Speech...

40
Emerging
1498 maum-ai/wavegrad2

Unofficial Pytorch Implementation of WaveGrad2

40
Emerging
1499 carleeno/elevenlabs_tts

Custom TTS Integration using ElevenLabs API

40
Emerging
1500 awslabs/speech-representations

Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)

40
Emerging
« Prev 1 2 3 13 14 15 16 17 80 81 82 Next »