All Voice AI Tools
8,165 tools ranked by quality score · Page 13 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 1201 |
HordRicJr/HordVoice
HordVoice - AI-powered voice assistant built with Flutter and Azure AI... |
|
Emerging |
| 1202 |
Baidu-AIP/speech-demo
语音api示例 |
|
Emerging |
| 1203 |
teamsudocode/dexter
Let your talking do the code |
|
Emerging |
| 1204 |
Markfryazino/wav2lip-hq
Extension of Wav2Lip repository for processing high-quality videos. |
|
Emerging |
| 1205 |
ZeroneBit/Edge-TTS-Net
Use Microsoft Edge's online text-to-speech service from .NET WITHOUT needing... |
|
Emerging |
| 1206 |
youmebangbang/TTS-dataset-tools
Automatically generates TTS dataset using audio and associated text. Make... |
|
Emerging |
| 1207 |
IBM/watson-streaming-stt
Example of using Watson's Streaming Speech to Text websockets interface for... |
|
Emerging |
| 1208 |
gunarakulangunaretnam/real-time-language-translator
A voice recognition-based tool for translating languages in real-time. |
|
Emerging |
| 1209 |
jianchang512/chatterbox-api
一个基于 Chatterbox-TTS的文字转语音(TTS)服务。提供与 OpenAI TTS 兼容的 API 接口并支持声音克隆,附带简洁的 Web 用户界面。 |
|
Emerging |
| 1210 |
hddevteam/speechify
🎧 Text-to-speech VS Code extension with 200+ Azure voices, TypeScript... |
|
Emerging |
| 1211 |
jackaduma/LAS_Mandarin_PyTorch
Listen, attend and spell Model and a Chinese Mandarin Pretrained model ... |
|
Emerging |
| 1212 |
kamilc/speech-recognition
Companion repository for the blog article:... |
|
Emerging |
| 1213 |
amd/LIRA
This tool helps you easily deploy ASR models on NPUs on AMD's Ryzen AI 300... |
|
Emerging |
| 1214 |
lperezmo/real-time-translator
A quick app to translate speech in real time using the Whisper API for... |
|
Emerging |
| 1215 |
USStateDept/State-TalentMAP
A comprehensive research, bidding, and matching system to match Foreign... |
|
Emerging |
| 1216 |
vb000/Waveformer
A deep neural network architecture for low-latency audio processing |
|
Emerging |
| 1217 |
Gauff/EpubToAudioBookConverter
Convert EPUB files to MP3 audio books with ease using this intuitive and... |
|
Emerging |
| 1218 |
Bebra777228/PolGen-RVC
Преобразование голоса на основе VITS. Ориентировано на простоту, качество и... |
|
Emerging |
| 1219 |
cxyfer/GeminiASR
A Python tool that uses Google Gemini API to transcribe video or audio files... |
|
Emerging |
| 1220 |
Rongjiehuang/Multi-Singer
PyTorch Implementation of Multi-Singer (ACM-MM'21) |
|
Emerging |
| 1221 |
botany-labs/voice-ai-js-starter
Starter project for building real-time AI Voice Assistants |
|
Emerging |
| 1222 |
ProsusAI/project-echo
An AI-powered voice director assistant for creating engaging audio content... |
|
Emerging |
| 1223 |
mrtozner/vox
Local voice AI framework for Rust. Whisper + LLM + TTS with no cloud dependencies. |
|
Emerging |
| 1224 |
IBM/BigLittleNet
Official repository for Big-Little Net |
|
Emerging |
| 1225 |
sc0ty/subsync
Subtitle Speech Synchronizer |
|
Emerging |
| 1226 |
tempo-riz/deepgram_speech_to_text
A Deepgram client for Dart and Flutter, supporting all Speech-to-Text and... |
|
Emerging |
| 1227 |
Saganaki22/ComfyUI-Maya1_TTS
A ComfyUI node for Maya1, a 3B-parameter speech model built for expressive... |
|
Emerging |
| 1228 |
CiscoDevNet/g2p_seq2seq_pytorch
Grapheme to phoneme model for PyTorch |
|
Emerging |
| 1229 |
Gyyyn/OpenWebTTS
Open source Speechify alternative. Read PDFs and EPUBs with local models. |
|
Emerging |
| 1230 |
keonlee9420/Daft-Exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across... |
|
Emerging |
| 1231 |
LitoMore/mac-say
The macOS built-in `say` interface for JavaScript |
|
Emerging |
| 1232 |
keonlee9420/FastPitchFormant
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based... |
|
Emerging |
| 1233 |
NeuralFalconYT/Video-Dubbing
Since most video dubbing services are paid, this project explores an... |
|
Emerging |
| 1234 |
codyw912/open-asr-server
OpenAI-compatible ASR server with pluggable local backends (Parakeet,... |
|
Emerging |
| 1235 |
team-telnyx/ai
Official one-stop shop for AI Agents and developers building with Telnyx. |
|
Emerging |
| 1236 |
seven-io/js-client
Official JavaScript API Client for seven.io |
|
Emerging |
| 1237 |
GoogleCloudPlatform/text-to-speech-epg-demo
This repository contains a reference implementation demonstrating how the... |
|
Emerging |
| 1238 |
BogiHsu/WG-WaveNet
Real-Time High-Fidelity Speech Synthesis without GPU |
|
Emerging |
| 1239 |
aviaryan/voice-writing-electron
A real-time, instant dictation desktop application built on Electron that... |
|
Emerging |
| 1240 |
WangYixuan12/openai_tts
OpenAI Text-to-Speech Interface |
|
Emerging |
| 1241 |
34j/neural-source-filter
Python package for NSF and NSF-HiFi-GAN (unofficial) |
|
Emerging |
| 1242 |
mush42/optispeech
A lightweight end-to-end text-to-speech model |
|
Emerging |
| 1243 |
jinserk/pytorch-asr
ASR with PyTorch |
|
Emerging |
| 1244 |
spokestack/react-native-spokestack
Spokestack: give your React Native app a voice interface! |
|
Emerging |
| 1245 |
sberdevices/assistant-client
Инструмент для тестирования и отладки СanvasApps — навыков семейства... |
|
Emerging |
| 1246 |
DanRuta/xVA-Synth
Machine learning based speech synthesis Electron app, with voices from... |
|
Emerging |
| 1247 |
deepgram-starters/go-voice-agent
Get started using Deepgram's Voice Agent with this Go demo app |
|
Emerging |
| 1248 |
mailong25/self-supervised-speech-recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework |
|
Emerging |
| 1249 |
astramind-ai/Auralis
A Fast TTS Engine |
|
Emerging |
| 1250 |
primaryobjects/voice-gender
Gender recognition by voice and speech analysis |
|
Emerging |
| 1251 |
googlecreativelab/morse-speak-demo
Text-to-Speech (TTS) demo web app that converts written text into spoken... |
|
Emerging |
| 1252 |
Purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to... |
|
Emerging |
| 1253 |
MyrtleSoftware/deepspeech
A PyTorch implementation of DeepSpeech and DeepSpeech2. |
|
Emerging |
| 1254 |
apinge/MeloTTS.cpp
A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO,... |
|
Emerging |
| 1255 |
VoXera/VoXera
An Open-Source Persian Language Techs Toolkit with Python |
|
Emerging |
| 1256 |
moulish-dev/vita
Plug-and-play TTS integration toolkit powered by Kokoro-82M. Python + CLI... |
|
Emerging |
| 1257 |
ayutaz/uPiper
Unity TTS plugin: Piper neural synthesis + pure C# G2P (Japanese/English) +... |
|
Emerging |
| 1258 |
patrickmonteiro/quasar-speech-api
🎤 🔉 Projeto de um SPA desenvolvido com Quasar Framework 1.0 + Speech API... |
|
Emerging |
| 1259 |
spotify/basic-pitch-ts
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection. |
|
Emerging |
| 1260 |
nodef/wikipedia-tts
Crawl Wikipedia pages and upload TTS to Youtube. |
|
Emerging |
| 1261 |
weespin/WillFromAfarDownloader
acapellabox pwned. |
|
Emerging |
| 1262 |
mravanelli/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid... |
|
Emerging |
| 1263 |
34j/mecab-text-cleaner
Simple Python package (CLI/Python API) for getting japanese readings... |
|
Emerging |
| 1264 |
ybouhjira/claude-code-tts
🔊 Text-to-Speech MCP plugin for Claude Code - hear audio feedback while... |
|
Emerging |
| 1265 |
ActiveNick/Unity-MS-SpeechSDK
Sample Unity project used to demonstrate Speech Recognition using the new... |
|
Emerging |
| 1266 |
phyce/Narration-Studio
Narration Studio, your all in one TTS Solution! |
|
Emerging |
| 1267 |
sljavi/handsfree-for-web-zoom-module
Zoom module implementation for Handsfree for web |
|
Emerging |
| 1268 |
mobilequickie/AmazonSpeechTranslator
End-to-end Solution for Speech Recognition, Text Translation, and... |
|
Emerging |
| 1269 |
keonlee9420/StyleSpeech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive... |
|
Emerging |
| 1270 |
belambert/asr-tools
Libraries and scripts for manipulating and handling ASR output/n-bests/etc. |
|
Emerging |
| 1271 |
fizamusthafa/whisper-app
This repository contains a web application for multi-lingual transcription... |
|
Emerging |
| 1272 |
Bunlong/react-webspeech
The official WebSpeech for React. |
|
Emerging |
| 1273 |
ioBroker/ioBroker.sonus
Control ioBroker with voice |
|
Emerging |
| 1274 |
SameeraMurthy/sanskrit-tts
Generate Text-to-Speech for Sanskrit |
|
Emerging |
| 1275 |
gachi0/konishiTTS
VOICEVOXを使用したのDiscordの読み上げbot |
|
Emerging |
| 1276 |
EvilFreelancer/docker-whisper-server
whisper.cpp HTTP transcription server with OpenAI-like API in Docker |
|
Emerging |
| 1277 |
litongjava/whisper-cpp-server
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper... |
|
Emerging |
| 1278 |
sebastienrousseau/akande
An innovative, open-source voice assistant powered by OpenAI's GPT-3,... |
|
Emerging |
| 1279 |
charlesliucn/awesome-end2end-asr
💬 A list of End-to-End speech recognition, including papers, codes and other... |
|
Emerging |
| 1280 |
LynxLine/qtspeech
QtSpeech is cross-platform library based on Qt to provide common... |
|
Emerging |
| 1281 |
neosapience/editts
Official implementation of EdiTTS: Score-based Editing for Controllable... |
|
Emerging |
| 1282 |
michaelzhang-ai/Text2Video
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with... |
|
Emerging |
| 1283 |
Detoxfox4234/Qwen3-Voice-Factory
Local, portable GUI for Qwen3-TTS. Optimized for NVIDIA RTX 50 Series (CUDA... |
|
Emerging |
| 1284 |
wdbm/deep_throat
speech synthesis program |
|
Emerging |
| 1285 |
sberdevices/smart_app_framework
SmartApp Framework для создания навыков семейства Виртуальных Ассистентов... |
|
Emerging |
| 1286 |
keonlee9420/Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning... |
|
Emerging |
| 1287 |
Navatusein/Silero-TTS-Service
Silero TTS backend service. Can be used with Home Assistant and Rhasspy. |
|
Emerging |
| 1288 |
declare-lab/jamify
JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and... |
|
Emerging |
| 1289 |
shekit/alexa-sign-language-translator
A project to make Amazon Echo respond to sign language using your webcam |
|
Emerging |
| 1290 |
lemonadeforlife/nerminal
A simple lightweight & efficient voice assistant built with Python & Vosk. |
|
Emerging |
| 1291 |
DangerDaza/Dooms-Enhancement-Suite
An immersive RPG enhancement extension for SillyTavern — character tracking,... |
|
Emerging |
| 1292 |
mapbox/mapbox-speech-swift
Natural-sounding text-to-speech in Swift or Objective-C on iOS, macOS, tvOS,... |
|
Emerging |
| 1293 |
BonifacioCalindoro/whatsapp-AI-assistant
AI assistant that reads you whatsapp conversations and audio messages, and... |
|
Emerging |
| 1294 |
sayksii/Aria
ARIA - AI Realtime Intelligent Audio | Universal real-time AI subtitles for Windows |
|
Emerging |
| 1295 |
voice-engine/make-a-smart-speaker
A collection of resources to make a smart speaker |
|
Emerging |
| 1296 |
Mobile-Artificial-Intelligence/babylon
Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and... |
|
Emerging |
| 1297 |
coqui-ai/stt-model-manager
Coqui STT Model Manager - install, manage and try out Coqui STT models from... |
|
Emerging |
| 1298 |
skirdey/voicerestore
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration |
|
Emerging |
| 1299 |
trungnguyen21/AutomatedYoutubeShorts
Automatically Generate video based on given content! |
|
Emerging |
| 1300 |
SlashNephy/SimpleVoiceroid2Proxy
VOICEROID 2 を HTTP API で操作できます |
|
Emerging |