All Voice AI Tools

8,165 tools ranked by quality score · Page 13 of 82

Showing 1201–1300 of 8,165
# Tool Score Tier
1201 HordRicJr/HordVoice

HordVoice - AI-powered voice assistant built with Flutter and Azure AI...

43
Emerging
1202 Baidu-AIP/speech-demo

语音api示例

43
Emerging
1203 teamsudocode/dexter

Let your talking do the code

43
Emerging
1204 Markfryazino/wav2lip-hq

Extension of Wav2Lip repository for processing high-quality videos.

43
Emerging
1205 ZeroneBit/Edge-TTS-Net

Use Microsoft Edge's online text-to-speech service from .NET WITHOUT needing...

43
Emerging
1206 youmebangbang/TTS-dataset-tools

Automatically generates TTS dataset using audio and associated text. Make...

43
Emerging
1207 IBM/watson-streaming-stt

Example of using Watson's Streaming Speech to Text websockets interface for...

43
Emerging
1208 gunarakulangunaretnam/real-time-language-translator

A voice recognition-based tool for translating languages in real-time.

43
Emerging
1209 jianchang512/chatterbox-api

一个基于 Chatterbox-TTS的文字转语音(TTS)服务。提供与 OpenAI TTS 兼容的 API 接口并支持声音克隆,附带简洁的 Web 用户界面。

43
Emerging
1210 hddevteam/speechify

🎧 Text-to-speech VS Code extension with 200+ Azure voices, TypeScript...

42
Emerging
1211 jackaduma/LAS_Mandarin_PyTorch

Listen, attend and spell Model and a Chinese Mandarin Pretrained model ...

42
Emerging
1212 kamilc/speech-recognition

Companion repository for the blog article:...

42
Emerging
1213 amd/LIRA

This tool helps you easily deploy ASR models on NPUs on AMD's Ryzen AI 300...

42
Emerging
1214 lperezmo/real-time-translator

A quick app to translate speech in real time using the Whisper API for...

42
Emerging
1215 USStateDept/State-TalentMAP

A comprehensive research, bidding, and matching system to match Foreign...

42
Emerging
1216 vb000/Waveformer

A deep neural network architecture for low-latency audio processing

42
Emerging
1217 Gauff/EpubToAudioBookConverter

Convert EPUB files to MP3 audio books with ease using this intuitive and...

42
Emerging
1218 Bebra777228/PolGen-RVC

Преобразование голоса на основе VITS. Ориентировано на простоту, качество и...

42
Emerging
1219 cxyfer/GeminiASR

A Python tool that uses Google Gemini API to transcribe video or audio files...

42
Emerging
1220 Rongjiehuang/Multi-Singer

PyTorch Implementation of Multi-Singer (ACM-MM'21)

42
Emerging
1221 botany-labs/voice-ai-js-starter

Starter project for building real-time AI Voice Assistants

42
Emerging
1222 ProsusAI/project-echo

An AI-powered voice director assistant for creating engaging audio content...

42
Emerging
1223 mrtozner/vox

Local voice AI framework for Rust. Whisper + LLM + TTS with no cloud dependencies.

42
Emerging
1224 IBM/BigLittleNet

Official repository for Big-Little Net

42
Emerging
1225 sc0ty/subsync

Subtitle Speech Synchronizer

42
Emerging
1226 tempo-riz/deepgram_speech_to_text

A Deepgram client for Dart and Flutter, supporting all Speech-to-Text and...

42
Emerging
1227 Saganaki22/ComfyUI-Maya1_TTS

A ComfyUI node for Maya1, a 3B-parameter speech model built for expressive...

42
Emerging
1228 CiscoDevNet/g2p_seq2seq_pytorch

Grapheme to phoneme model for PyTorch

42
Emerging
1229 Gyyyn/OpenWebTTS

Open source Speechify alternative. Read PDFs and EPUBs with local models.

42
Emerging
1230 keonlee9420/Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across...

42
Emerging
1231 LitoMore/mac-say

The macOS built-in `say` interface for JavaScript

42
Emerging
1232 keonlee9420/FastPitchFormant

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based...

42
Emerging
1233 NeuralFalconYT/Video-Dubbing

Since most video dubbing services are paid, this project explores an...

42
Emerging
1234 codyw912/open-asr-server

OpenAI-compatible ASR server with pluggable local backends (Parakeet,...

42
Emerging
1235 team-telnyx/ai

Official one-stop shop for AI Agents and developers building with Telnyx.

42
Emerging
1236 seven-io/js-client

Official JavaScript API Client for seven.io

42
Emerging
1237 GoogleCloudPlatform/text-to-speech-epg-demo

This repository contains a reference implementation demonstrating how the...

42
Emerging
1238 BogiHsu/WG-WaveNet

Real-Time High-Fidelity Speech Synthesis without GPU

42
Emerging
1239 aviaryan/voice-writing-electron

A real-time, instant dictation desktop application built on Electron that...

42
Emerging
1240 WangYixuan12/openai_tts

OpenAI Text-to-Speech Interface

42
Emerging
1241 34j/neural-source-filter

Python package for NSF and NSF-HiFi-GAN (unofficial)

42
Emerging
1242 mush42/optispeech

A lightweight end-to-end text-to-speech model

42
Emerging
1243 jinserk/pytorch-asr

ASR with PyTorch

42
Emerging
1244 spokestack/react-native-spokestack

Spokestack: give your React Native app a voice interface!

42
Emerging
1245 sberdevices/assistant-client

Инструмент для тестирования и отладки СanvasApps — навыков семейства...

42
Emerging
1246 DanRuta/xVA-Synth

Machine learning based speech synthesis Electron app, with voices from...

42
Emerging
1247 deepgram-starters/go-voice-agent

Get started using Deepgram's Voice Agent with this Go demo app

42
Emerging
1248 mailong25/self-supervised-speech-recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

42
Emerging
1249 astramind-ai/Auralis

A Fast TTS Engine

42
Emerging
1250 primaryobjects/voice-gender

Gender recognition by voice and speech analysis

42
Emerging
1251 googlecreativelab/morse-speak-demo

Text-to-Speech (TTS) demo web app that converts written text into spoken...

42
Emerging
1252 Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to...

42
Emerging
1253 MyrtleSoftware/deepspeech

A PyTorch implementation of DeepSpeech and DeepSpeech2.

42
Emerging
1254 apinge/MeloTTS.cpp

A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO,...

42
Emerging
1255 VoXera/VoXera

An Open-Source Persian Language Techs Toolkit with Python

42
Emerging
1256 moulish-dev/vita

Plug-and-play TTS integration toolkit powered by Kokoro-82M. Python + CLI...

42
Emerging
1257 ayutaz/uPiper

Unity TTS plugin: Piper neural synthesis + pure C# G2P (Japanese/English) +...

42
Emerging
1258 patrickmonteiro/quasar-speech-api

🎤 🔉 Projeto de um SPA desenvolvido com Quasar Framework 1.0 + Speech API...

42
Emerging
1259 spotify/basic-pitch-ts

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection.

42
Emerging
1260 nodef/wikipedia-tts

Crawl Wikipedia pages and upload TTS to Youtube.

42
Emerging
1261 weespin/WillFromAfarDownloader

acapellabox pwned.

42
Emerging
1262 mravanelli/pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid...

42
Emerging
1263 34j/mecab-text-cleaner

Simple Python package (CLI/Python API) for getting japanese readings...

42
Emerging
1264 ybouhjira/claude-code-tts

🔊 Text-to-Speech MCP plugin for Claude Code - hear audio feedback while...

42
Emerging
1265 ActiveNick/Unity-MS-SpeechSDK

Sample Unity project used to demonstrate Speech Recognition using the new...

42
Emerging
1266 phyce/Narration-Studio

Narration Studio, your all in one TTS Solution!

42
Emerging
1267 sljavi/handsfree-for-web-zoom-module

Zoom module implementation for Handsfree for web

42
Emerging
1268 mobilequickie/AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and...

42
Emerging
1269 keonlee9420/StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive...

42
Emerging
1270 belambert/asr-tools

Libraries and scripts for manipulating and handling ASR output/n-bests/etc.

42
Emerging
1271 fizamusthafa/whisper-app

This repository contains a web application for multi-lingual transcription...

42
Emerging
1272 Bunlong/react-webspeech

The official WebSpeech for React.

42
Emerging
1273 ioBroker/ioBroker.sonus

Control ioBroker with voice

42
Emerging
1274 SameeraMurthy/sanskrit-tts

Generate Text-to-Speech for Sanskrit

42
Emerging
1275 gachi0/konishiTTS

VOICEVOXを使用したのDiscordの読み上げbot

42
Emerging
1276 EvilFreelancer/docker-whisper-server

whisper.cpp HTTP transcription server with OpenAI-like API in Docker

42
Emerging
1277 litongjava/whisper-cpp-server

whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper...

42
Emerging
1278 sebastienrousseau/akande

An innovative, open-source voice assistant powered by OpenAI's GPT-3,...

42
Emerging
1279 charlesliucn/awesome-end2end-asr

💬 A list of End-to-End speech recognition, including papers, codes and other...

42
Emerging
1280 LynxLine/qtspeech

QtSpeech is cross-platform library based on Qt to provide common...

42
Emerging
1281 neosapience/editts

Official implementation of EdiTTS: Score-based Editing for Controllable...

42
Emerging
1282 michaelzhang-ai/Text2Video

ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with...

42
Emerging
1283 Detoxfox4234/Qwen3-Voice-Factory

Local, portable GUI for Qwen3-TTS. Optimized for NVIDIA RTX 50 Series (CUDA...

42
Emerging
1284 wdbm/deep_throat

speech synthesis program

42
Emerging
1285 sberdevices/smart_app_framework

SmartApp Framework для создания навыков семейства Виртуальных Ассистентов...

42
Emerging
1286 keonlee9420/Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning...

42
Emerging
1287 Navatusein/Silero-TTS-Service

Silero TTS backend service. Can be used with Home Assistant and Rhasspy.

42
Emerging
1288 declare-lab/jamify

JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and...

42
Emerging
1289 shekit/alexa-sign-language-translator

A project to make Amazon Echo respond to sign language using your webcam

42
Emerging
1290 lemonadeforlife/nerminal

A simple lightweight & efficient voice assistant built with Python & Vosk.

42
Emerging
1291 DangerDaza/Dooms-Enhancement-Suite

An immersive RPG enhancement extension for SillyTavern — character tracking,...

42
Emerging
1292 mapbox/mapbox-speech-swift

Natural-sounding text-to-speech in Swift or Objective-C on iOS, macOS, tvOS,...

42
Emerging
1293 BonifacioCalindoro/whatsapp-AI-assistant

AI assistant that reads you whatsapp conversations and audio messages, and...

42
Emerging
1294 sayksii/Aria

ARIA - AI Realtime Intelligent Audio | Universal real-time AI subtitles for Windows

42
Emerging
1295 voice-engine/make-a-smart-speaker

A collection of resources to make a smart speaker

42
Emerging
1296 Mobile-Artificial-Intelligence/babylon

Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and...

42
Emerging
1297 coqui-ai/stt-model-manager

Coqui STT Model Manager - install, manage and try out Coqui STT models from...

42
Emerging
1298 skirdey/voicerestore

VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration

42
Emerging
1299 trungnguyen21/AutomatedYoutubeShorts

Automatically Generate video based on given content!

42
Emerging
1300 SlashNephy/SimpleVoiceroid2Proxy

VOICEROID 2 を HTTP API で操作できます

42
Emerging
« Prev 1 2 3 11 12 13 14 15 80 81 82 Next »