All Voice AI Tools
8,165 tools ranked by quality score · Page 25 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 2401 |
mascotbot/elevenlabs-avatar
Open-source example for integrating ElevenLabs conversational AI with... |
|
Emerging |
| 2402 |
adeepak7/Speech-To-Code
Speech To Code is Google Chrome Extension to convert Speech into Code. |
|
Emerging |
| 2403 |
Ggorets0dev/rantovox-telegram-bot
Telegram bot for text-to-speech and speech-to-speech translation, works with... |
|
Emerging |
| 2404 |
LuluW8071/VocalMind
Automatic Speech Recognition using Conformer with Speech Sentiment Analysis... |
|
Emerging |
| 2405 |
nuhmanpk/PyttsBot
A Pyrogram Bot for gtts module, Text to speech Telegram bot. |
|
Emerging |
| 2406 |
trabdlkarim/voce-browser
Voice Controlled Chromium Web Browser |
|
Emerging |
| 2407 |
agentvoiceresponse/avr-asr-vosk
This repository provides a real-time speech-to-text transcription service... |
|
Emerging |
| 2408 |
candlewill/AiVoice
Deep CNN networks for Speech Synthesis |
|
Emerging |
| 2409 |
nickpending/clarvis
Jarvis-style voice notifications for Claude Code that transforms AI... |
|
Emerging |
| 2410 |
philsyn/DiffWave-Vocoder
Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and... |
|
Emerging |
| 2411 |
FlutterHack20/FlutterBand
Flutter built retro cyberpunk CB Radio App for Hack20 Flutter Hackathon.... |
|
Emerging |
| 2412 |
vliu15/adversarial-tts
End-to-end Text-to-Speech with Generative Adversarial Networks |
|
Emerging |
| 2413 |
edde746/tiktok-askreddit
A content generation & posting bot for TikTok, scraping posts from r/AskReddit |
|
Emerging |
| 2414 |
berk76/words
Voice vocabulary :gb: :de: :fr: :es: :ru: :jp: :cn: ... |
|
Emerging |
| 2415 |
audo-ai/magic-mic
Open Source Noise Cancellation App for Virtual Meetings |
|
Emerging |
| 2416 |
heymrhayes/text-to-speech
A basic Text-to-Speech app |
|
Emerging |
| 2417 |
OpenTSLab/BELLE
Official implementation of BELLE "Bayesian Speech Synthesizers Can Learn... |
|
Emerging |
| 2418 |
messiaen/full-lattice-search
Full Text Search Over Probabilistic Lattices with Elasticsearch! |
|
Emerging |
| 2419 |
techiaith/docker-marytts
Lleisiau synthetig cadwynedig Cymraeg gyda MaryTTS a Docker // Welsh... |
|
Emerging |
| 2420 |
ReneeYe/XSTNet
This is an implementation of paper "End-to-end Speech Translation via... |
|
Emerging |
| 2421 |
akashmjn/cs224n-gpu-that-talks
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18) |
|
Emerging |
| 2422 |
decasteljau/waapi-text-to-speech
Wwise text-to-speech integration using external editors. |
|
Emerging |
| 2423 |
RodneyKoolman/Azure-Speech-TextToSpeech
Written in Python using the Azure Speech SDK. App.py provides an easy way to... |
|
Emerging |
| 2424 |
Blackwood416/AstraTTS
基于 ONNX Runtime 的跨平台高性能 TTS 合成方案,支持流式输出与低延迟播放,支持自定义音色与中英混合生成。 |
|
Emerging |
| 2425 |
Asaayu/integrated-voice-control-system
Integrated AI Voice Control System allows players to give commands to AI... |
|
Emerging |
| 2426 |
GlobalTechInfo/gspeak
Google Text to Speech for Node.js — modern, typed, zero deprecated dependencies. |
|
Emerging |
| 2427 |
lpalbou/VoiceLLM
A modular Python library for voice interactions with AI systems, featuring... |
|
Emerging |
| 2428 |
luongnv89/voice-cast
Your words, any voice. Voice cloning and text-to-speech with multiple TTS... |
|
Emerging |
| 2429 |
ArdaGnsrn/elevenlabs-js
This is an Open Source NodeJS package for ElevenLabs Text to Speech API. |
|
Emerging |
| 2430 |
phanxuanphucnd/wav2asr
A library version of wav2vec 2.0 framework for Automatic Speech Recognition task. |
|
Emerging |
| 2431 |
kssteven418/Q-ASR
[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition |
|
Emerging |
| 2432 |
khuangaf/ITRI-speech-recognition-dataset-generation
Automatic Speech Recognition Dataset Generation |
|
Emerging |
| 2433 |
nvmoyar/aind2-speech-recognition
Some approaches based on deep learning to build the acoustic model for an... |
|
Emerging |
| 2434 |
botbahlul/Live-Subtitle-V2
ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free... |
|
Emerging |
| 2435 |
ShivamRajSharma/Transformer-Text-To-Speech
Pytorch implementation of Transformer-TTS for converting text into speech. |
|
Emerging |
| 2436 |
PRITHIVSAKTHIUR/Vision-to-VibeVoice-en
A Gradio-based demo for end-to-end vision-to-speech inference: Extract text... |
|
Emerging |
| 2437 |
AndreDalwin/Whisper2Summarize
Whisper2Summarize is an application that uses Whisper for audio processing... |
|
Emerging |
| 2438 |
heezes/Hand-gesture-to-speech
This project aims at providing speech to the mute people. |
|
Emerging |
| 2439 |
OpenVoiceOS/status
Open Voice OS Server Status Page |
|
Emerging |
| 2440 |
Fatma-Chaouech/audioverse
Breathe Life Into Your Books! 📚🌱 |
|
Emerging |
| 2441 |
C0NZZ/better-teletask
Browser extension that adds useful features like subtitles to HPI Tele-Task. |
|
Emerging |
| 2442 |
FNBUBBLES420-ORG/Speech-to-Text-Application
🎙️ Welcome to the Speech to Text Application! 📝 This tool converts your... |
|
Emerging |
| 2443 |
kaiidams/Voice100AndroidApp
Voice100 Android App is a TTS/ASR sample app that uses ONNX Runtime and... |
|
Emerging |
| 2444 |
speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on... |
|
Emerging |
| 2445 |
cjhoward/cedict-tts
TTS audio files for the CC-CEDICT Chinese-English dictionary |
|
Emerging |
| 2446 |
MichaelGrafnetter/defender-asr-admx
Administrative Template (ADMX) for Microsoft Defender Attack Surface Reduction (ASR) |
|
Emerging |
| 2447 |
LucaLuke13/TalkyBotty
Simply forward a video or voice message in any language to the bot, and it... |
|
Emerging |
| 2448 |
snowy-0wl/piper-mode
A vibe-coded text-to-speech for Emacs using the Piper TTS engine. Features... |
|
Emerging |
| 2449 |
mmpneo/simple-obs-stt
Speech-to-text and keyboard input captions for OBS. |
|
Emerging |
| 2450 |
lepisma/emacs-speech-input
Set of packages for speech and voice inputs in Emacs |
|
Emerging |
| 2451 |
khakers/go-subgen
Automatically generate subtitles for your media using whisper.cpp via... |
|
Emerging |
| 2452 |
ThetaOne-AI/HiKE
Hierarchical Korean-English Code-Switching Speech Recognition Benchmark... |
|
Emerging |
| 2453 |
kristofferv98/whisper_turboapi
An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo... |
|
Emerging |
| 2454 |
naskopw/read_aloud
A cross-platform text-to-speech library |
|
Emerging |
| 2455 |
pevers/parkiet
Parkiet is a 1.6B parameter Dutch text-to-speech model (TTS) |
|
Emerging |
| 2456 |
ivan770/ems
EMS (External Media Server) |
|
Emerging |
| 2457 |
hacktronaut/azure-avatar-demo
Text To Speech Demo in ReactJS Application using Azure Avatar AI Service. |
|
Emerging |
| 2458 |
jeantimex/F5-TTS-Server
F5-TTS server APIs for voice cloning and text-to-speech generation with... |
|
Emerging |
| 2459 |
m-nathani/speech_to_text
how to use the Google Cloud Speech API to transcribe audio/video files. |
|
Emerging |
| 2460 |
yufan-aslp/AliMeeting
The project is associated with the recently-launched ICASSP 2022... |
|
Emerging |
| 2461 |
A-Jacobson/tacotron2
pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf |
|
Emerging |
| 2462 |
Aman22sharma/Python-AI-Virtual-Assistant
This is python AI Virtual Assistant. |
|
Emerging |
| 2463 |
ACT900/faster-whisper-railway
Deploy Faster Whisper on Railway — Speech-to-Text & Text-to-Speech API with 52 voices |
|
Emerging |
| 2464 |
yuyq96/pyshengyun
A Python converter for Chinese Pinyin and Shengyun (initials and finals) |
|
Emerging |
| 2465 |
DragonDiffusionbyBoyo/Boyonodes
A set of Comfyui nodes |
|
Emerging |
| 2466 |
go-restream/zipenhancer-rs
🚀 High-Performance Real-Time Audio Noise Reduction Library - Rust... |
|
Emerging |
| 2467 |
jorcelinojunior/whisper-vtt2srt
A robust WebVTT to SRT converter optimized for AI transcriptions (Whisper,... |
|
Emerging |
| 2468 |
jianchang512/parakeet-api
一个基于 NVIDIA Parakeet-tdt-0.6b 模型的本地语音转录服务。它提供了一个与 OpenAI API 兼容的接口和一个简洁的 Web 用户界面 |
|
Emerging |
| 2469 |
cdyangbo/end2endASR
implement end-to-end asr algorithm with tensorflow |
|
Emerging |
| 2470 |
iotjin/JhPrivacyAuthTool
隐私权限判断 - 封装了几种常用的隐私权限判断(定位服务,通讯录, 日历,提醒事项, 照片, 蓝牙共享,麦克风, 相机)和通知的注册和判断。定位服务,蓝牙共享是单独调用的 |
|
Emerging |
| 2471 |
De-Technocrats/simple-text-to-speech-javascript
Simple text to speech with javascript. |
|
Emerging |
| 2472 |
msjsc001/Anki-TTS-Edge
A modern text-to-speech tool powered by Microsoft Edge TTS. Creates Anki... |
|
Emerging |
| 2473 |
vhanagwal/speech-recognition
A speech-to-text app using AVAudioEngine. |
|
Emerging |
| 2474 |
rishikksh20/VQ-TTS-pytorch
Unofficial Pytorch implementation of paper VQTTS: High-Fidelity... |
|
Emerging |
| 2475 |
deepkyu/ml-talking-face
Cloned repository from Hugging Face Spaces (CVPR 2022 Demo) |
|
Emerging |
| 2476 |
Pzc-Neo/vue-web-reader
城墨网页小说朗读 ( Novel read aloud on web. ) |
|
Emerging |
| 2477 |
blakkd/faster-whisper-hotkey
Effortless Push-to-Talk Transcription, Anywhere. |
|
Emerging |
| 2478 |
keonlee9420/Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a... |
|
Emerging |
| 2479 |
EvilFreelancer/docker-fish-speech-server
OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model. |
|
Emerging |
| 2480 |
keonlee9420/Stepwise_Monotonic_Multihead_Attention
PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to... |
|
Emerging |
| 2481 |
mmahdibarghi/finglish-dataset
Persian to Finglish dataset with all the sentences voice for TTS dataset... |
|
Emerging |
| 2482 |
aditya-joglekar/FS02_Scoring_Toolkit
Scoring Toolkit for the Fearless Steps Challenge Phase-02 Tasks |
|
Emerging |
| 2483 |
brailcom/festival-freebsoft-utils
Festival extensions and utilities, focused on interaction with Speech Dispatcher |
|
Emerging |
| 2484 |
cyberboysumanjay/VoiceAssistant
Python Project |
|
Emerging |
| 2485 |
GeorgiosIoannouCoder/vera
Voice Emotion Recognition of Audio (VERA) is an open-source project created... |
|
Emerging |
| 2486 |
Arbazkhan4712/Speech-To-Text
A program that can convert Speech into Text using python |
|
Emerging |
| 2487 |
gowtham4545/Project
Sign2Sound is dedicated to revolutionizing communication for non-verbal... |
|
Emerging |
| 2488 |
soheil-mp/Speech-Recognition
End-to-End Speech Recognition using Neural Networks. |
|
Emerging |
| 2489 |
keenresearch/keenasr-swift-poc
Proof-of-concept app that showcases use of KeenASR SDK in a Swift app. WE... |
|
Emerging |
| 2490 |
buddyeorl/deep-talk
Deep-speech react app to test trained models,to visualize the speech to text... |
|
Emerging |
| 2491 |
KilianB/GoogleTranslatorTTS
Converts a string of text to mp3 files utilizing the google translator text... |
|
Emerging |
| 2492 |
stgloorious/stm32-speech-recognition
Speech Recognition using STM32 and Machine Learning |
|
Emerging |
| 2493 |
slp-rl/HebTTS
The official implementation of "A Language Modeling Approach to... |
|
Emerging |
| 2494 |
rishiskhare/parrot
A free, offline, private AI text-to-speech desktop app built on Rust 🦜 |
|
Emerging |
| 2495 |
tiansztiansz/voice-assistant
重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。 |
|
Emerging |
| 2496 |
SynHub/syn-speech-samples
An application that demostrate the usage of Syn.Speech library for Speech Recognition |
|
Emerging |
| 2497 |
c99koder/AudioClassifier-MQTT
Use the yamnet TensorFlow model to classify live audio from a microphone and... |
|
Emerging |
| 2498 |
grammatek/simaromur
Icelandic TTS (text-to-speech) service for Android |
|
Emerging |
| 2499 |
tasmirz/EyeWear
Eyewear with OCR and live WebRTC based calling for the visually impaired.... |
|
Emerging |
| 2500 |
veralvx/xtts-finetune
XTTS fine-tuning via CLI |
|
Emerging |