All Voice AI Tools

8,165 tools ranked by quality score · Page 14 of 82

Showing 1301–1400 of 8,165
# Tool Score Tier
1301 rafaballerini/AssistentePessoal

Assistente pessoal virtual desenvolvida com Python 🤖

42
Emerging
1302 repodiac/german_transliterate

Python module to clean and transliterate (i.e. normalize) German text...

42
Emerging
1303 lancejames221b/jarvis-voice

OpenJarvis — Real-time AI voice assistant for Discord. Talk to the same...

42
Emerging
1304 ranchlai/mandarin-tts

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 ,...

42
Emerging
1305 atomicoo/PTTS-WebAPP

Parallel TTS web demo based on Flask + Vue (Vuetify). 基于 Flask + Vue 的语音合成单网页演示项目。

42
Emerging
1306 Skeli010/GaryTTS

强大免费的本地文本转语音软件

42
Emerging
1307 puff-dayo/Kokoro-82M-Android

A minimal Android demo app for Kokoro-TTS

42
Emerging
1308 NateRickard/Xamarin.Cognitive.Speech

A client library that makes it easy to work with the Microsoft Cognitive...

42
Emerging
1309 sksalahuddin2828/AI_Personal_Digital_Assistant

AI Personal Voice Assistant Project (Male - Female version)

42
Emerging
1310 Youdef20/voxtral.c

🔊 Streamline audio processing with Voxtral.c, a pure C implementation for...

42
Emerging
1311 aahl/qwen-tts2api

🗣️ Qwen TTS to OpenAI Speech API

42
Emerging
1312 wq2012/SpeakerRecognitionFromScratch

Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家

42
Emerging
1313 tikhonp/yandex-speechkit-lib-python

Python SDK for Yandex Speechkit API.

42
Emerging
1314 BlinkTagInc/gtfs-tts

Review GTFS stop pronunciations to determine which stops need a tts_stop_name value.

42
Emerging
1315 scart97/thunder-speech

A Hackable speech recognition library.

42
Emerging
1316 showlab/whisperVideo

Find out who said what in the video.

42
Emerging
1317 PyThaiNLP/tts-thai

Thai TTS

42
Emerging
1318 googlecreativelab/obvi

A Polymer 3+ webcomponent / button for doing speech recognition

42
Emerging
1319 twilio-labs/sample-autopilot-voice-ivr

Voice-Powered IVR Chatbot with Autopilot

42
Emerging
1320 ErcinDedeoglu/WhisperDock

Dockerized Whisper C++ speech-to-text API for easy deployment and rapid...

42
Emerging
1321 SteTR/Emost-Bot

Discord Music Bot using Voice Recognition to receive commands.

42
Emerging
1322 kamiazya/ngx-speech-recognition

Angular 5+ speech recognition service (based on browser implementation such...

42
Emerging
1323 jordicor/santa-claus-is-calling

A magical Christmas experience where Santa Claus (AI with Santa's voice)...

42
Emerging
1324 hcy71o/AutoVocoder

Autovocoder: Fast Waveform Generation from a Learned Speech Representation...

42
Emerging
1325 nipponjo/tts_arabic

🎙️ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format — Python...

42
Emerging
1326 everydaycodings/MimicMania

MimicMania is a web application that allows you to generate speech and clone...

42
Emerging
1327 linagora-labs/ssak

SSAK contains helpers and tools to process data and train/infer ASR models.

42
Emerging
1328 kristofferv98/VoiceProcessingToolkit

The VoiceProcessingToolkit is an all-encompassing suite designed for...

42
Emerging
1329 ringger/transcribe-critic

Multi-source transcript merging inspired by textual criticism — LLM...

41
Emerging
1330 WilleIshere/SimplerKokoro

A Python package that makes it easy to use the Kokoro voice synthesis library.

41
Emerging
1331 huckiyang/Voice2Series-Reprogramming

ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time...

41
Emerging
1332 AkojimaSLP/Beamforming-for-speech-enhancement

simple delaysum, MVDR and CGMM-MVDR

41
Emerging
1333 gittyeric/FAlexa

Create your own verbal commands that fuzzily map to custom Javascript /...

41
Emerging
1334 book000/audio-transcriber-docker

Automatically transcribe the audio of video / audio files using Speech Recognition.

41
Emerging
1335 jing332/tts-server-go

微软TTS服务转发,以便在阅读APP中通过网络导入方式收听微软TTS / Edge大声朗读

41
Emerging
1336 Saganaki22/ComfyUI-Step_Audio_EditX_TTS

ComfyUI nodes for Step Audio EditX - State-of-the-art zero-shot voice...

41
Emerging
1337 gianpaj/sexyvoice

Voice Cloning, Voice Call and Text to Speech platform. Perfect for content...

41
Emerging
1338 CoffeeVampir3/audiocraft-webui

Quick webui for audiocraft

41
Emerging
1339 seven-io/net-client

Official .NET API Client for seven

41
Emerging
1340 nabz0r/mac-local-translator

Local translation app for Mac using speech recognition and offline translation

41
Emerging
1341 mostafa-kermaninia/speech-processing-toolkit

A comprehensive machine learning pipeline for robust Speaker Identification...

41
Emerging
1342 sotelo/parrot

RNN-based generative models for speech.

41
Emerging
1343 TeamAudio/reaspeech

Speech recognition for REAPER

41
Emerging
1344 bishop-ai/bishop-ai

Voice and text virtual assistant

41
Emerging
1345 Lastorder-DC/chatreader-kor

채팅 읽어주는 로봇

41
Emerging
1346 spokestack/spokestack-ios

Spokestack: give your iOS app a voice interface!

41
Emerging
1347 HenestrosaDev/audiotext

A desktop application that transcribes audio from files, microphone input or...

41
Emerging
1348 jianchang512/fireredasr-ui

一个中文语音转文字项目,封装自FireRedASR

41
Emerging
1349 WangHelin1997/SSR-Speech

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis

41
Emerging
1350 COBACOBAINI/vibe

Transcribe audio and video offline with OpenAI Whisper on your device,...

41
Emerging
1351 hubendubler/gTTS.js

A Promise based Node.js/TypeScript port of the gTTS Google-Text-To-Speech...

41
Emerging
1352 FontaineRiant/wrAIter

AI writing assistant with voiced narrator and characters and an illustrator

41
Emerging
1353 JasonLovesDoggo/Flow

Native MacOS dictation that captures audio, transcribes speech, and formats...

41
Emerging
1354 DeeepMaker/subtitle-to-audio

A python script to generate .wav audio files for .srt subtitle files

41
Emerging
1355 alsrb0607/KoreanSTT

kospeech를 활용한 한국어 음성 인식 모델 개발

41
Emerging
1356 MikeyParton/react-speech-kit

React hooks for Speech Recognition and Speech Synthesis

41
Emerging
1357 botbahlul/pyvosklivesubtitle

PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23...

41
Emerging
1358 botbahlul/VOSK-Powered-Live-Subtitle-V3

ANDROID APP that can RECOGNIZE ANY LIVE AUDIO/VIDEO STREAMING (using free...

41
Emerging
1359 OwenEdwards/videojs-speak-descriptions-track

A Video.js 7 middleware that uses browser speech synthesis to speak...

41
Emerging
1360 Johnson145/voxtral_wyoming

Offline Speech-to-Text (STT) service using Mistral's Voxtral model with...

41
Emerging
1361 gdoudeng/react-native-baidu-asr

The react-native Baidu voice library provides voice recognition, voice...

41
Emerging
1362 XimilalaXiang/DeLive

DeLive is a cross-platform desktop app that captures system audio output and...

41
Emerging
1363 OpenMOSS/MOSS-Audio-Tokenizer

MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on...

41
Emerging
1364 georgezhao2010/apple_airplayer

Make your AirPlay devices as TTS speakers

41
Emerging
1365 totalvoice/totalvoice-php

Client em PHP para API da Totalvoice

41
Emerging
1366 MainRo/docker-deepspeech-server

A dockerfile to run deepspeech-server

41
Emerging
1367 aks-devs/mod_openai_asr

Freeswitch Speech-To-Text module

41
Emerging
1368 hhguo/MSMC-TTS

Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS

41
Emerging
1369 TartuNLP/text-to-speech-api

REST API for neural text-to-speech synthesis

41
Emerging
1370 finos/greenkey-asrtoolkit

A collection of useful tools for handling speech recognition data

41
Emerging
1371 AIFSH/ComfyUI-FishSpeech

a custom comfyui node for fish-speech

41
Emerging
1372 OwenTyme/voice-zero

Collection of samples suitable for use with zero-shot text to speech engines.

41
Emerging
1373 revdotcom/reverb

Open source inference code for Rev's model

41
Emerging
1374 yxshee/speech-command-recognition

speech command recognition using CNNs, with preprocessing, model training,...

41
Emerging
1375 kapi2800/qwen3-tts-apple-silicon

Run Qwen3-TTS text-to-speech locally on Mac (M1/M2/M3/M4). Voice cloning,...

41
Emerging
1376 kgnlp/allophant

A multilingual phoneme recognizer capable of generalizing zero-shot to...

41
Emerging
1377 fqueis/pollinationsai

🔥 TypeScript SDK wrapper for Pollinations AI services

41
Emerging
1378 HectorPulido/chatbot-with-voice

Jarvis like chatbot with voice

41
Emerging
1379 rtzr/Awesome-Korean-Speech-Recognition

한국어 음성인식 STT API 리스트. 각 성능 벤치마크.

41
Emerging
1380 amitdev01/awesome-voice-ai

Awesome Voice Ai

41
Emerging
1381 petewarden/spchcat

Speech recognition tool to convert audio to text transcripts, for Linux and...

41
Emerging
1382 tuan3w/cnn_vocoder

A fast cnn-based vocoder

41
Emerging
1383 alamparelli/mcp-claude-say

Voice interaction for Claude Code - Talk to Claude and hear responses using...

41
Emerging
1384 kahne/SpeechTransProgress

Tracking the progress in end-to-end speech translation

41
Emerging
1385 forfrt/SteerMoE

SteerMoE: Efficient Audio-Language Models with Preserved Reasoning Capabilities

41
Emerging
1386 Edw590/VISOR---Android-Version-Assistant

V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!

41
Emerging
1387 mobassir94/comprehensive-bangla-tts

Aiming to achieve ultimate Multilingual TTS pipeline with main focus on...

41
Emerging
1388 dpm76/QuickRouteMap

Simple route guidance application.

41
Emerging
1389 18F/dol-whd-14c

The 14(c) system will become a modern, digital-first service. Applicants...

41
Emerging
1390 priyanujgogoi-28/flowery-tts

Wrapper of Flowery Text to Speech API for Dart

41
Emerging
1391 Yuan-ManX/audio-development-tools

Audio Development Tools (ADT) is a project for advancing sound, speech, and...

41
Emerging
1392 solaoi/lycoris

Real-time speech recognition & AI-powered note-taking app for macOS with...

41
Emerging
1393 arpy8/ESP32_Voice_Assistant

This project combines embedded system and AI inference to create an...

41
Emerging
1394 dsfsi/dsfsi-datasets

Official DSFSI Public Datasets Registry - Comprehensive catalog of 50+...

41
Emerging
1395 TheMorpheus407/OpenAI-Audiobook-Generator

This project is a web-based application that converts text into audio,...

41
Emerging
1396 TartuNLP/text-to-speech-worker

Estonian multi-speaker neural text-to-speech worker that processes requests...

41
Emerging
1397 Pranjalya/tts-tortoise-gradio

A Gradio setup for Tortoise TTS.

41
Emerging
1398 ardha27/AI-Waifu-Vtuber

AI Vtuber for Streaming on Youtube/Twitch

41
Emerging
1399 yeahhe365/PageTalk

一个简洁且优秀的描述是:这是一款在任何网页上实现无缝语音转文字的 Chrome 扩展,使用先进的 ASR API。

41
Emerging
1400 JoelShine/Jarvis-v2.0

This is a major update of my project JARVIS-The-Ultimate-Project. You can...

41
Emerging
« Prev 1 2 3 12 13 14 15 16 80 81 82 Next »