All Voice AI Tools
8,165 tools ranked by quality score · Page 28 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 2701 |
JhaAyush01/Multimodal-AI-Assistant
Multimodal AI Assistant with Google Gemini-1.5-pro, gTTS, PIL, and... |
|
Emerging |
| 2702 |
jindongwang/EasyEspnet
Making Espnet easier to use |
|
Emerging |
| 2703 |
aks-devs/mod_piper_tts
Freeswitch Text-to-Speech module |
|
Emerging |
| 2704 |
mvanzulli/MeetingAssistant.py
A local deployable version of an AI meeting assitant |
|
Emerging |
| 2705 |
b7s/whisper-php
State-of-the-art speech recognition to your PHP/Laravel applications |
|
Emerging |
| 2706 |
KickerMix/Discord-Local-LLM-VoiceChat-Bot
Saya Voice Assistant for Discord AI voice bot: listens, detects keywords,... |
|
Emerging |
| 2707 |
hwRG/FastSpeech2-Pytorch-Korean-Multi-Speaker
Multi-Speaker FastSpeech2 applicable to Korean. Description about train and... |
|
Emerging |
| 2708 |
TheProfessorsLab/Oracle-VocalAI-Interface-DISCONTINUED
A custom version of J.A.R.V.I.S. made to be my personal digital assistant... |
|
Emerging |
| 2709 |
MatheusKProt/SpeechToText
Este bot para telegram tem a função principal de transformar os seus áudios... |
|
Emerging |
| 2710 |
MysteryPancake/Discord-Lyrebird
[DEPRECATED] Text to speech Discord bot using the Lyrebird API |
|
Emerging |
| 2711 |
vroomai/vst
🎹 Generate sounds from words. Directly in your DAW. |
|
Emerging |
| 2712 |
TBETool/ibm-watson-tts-php
IBM Watson Text to Speech PHP Library to convert written text into... |
|
Emerging |
| 2713 |
kapi2800/qwen3-tts-mac
Optimized implementation of Qwen3-TTS for Apple Silicon (M1-M4) |
|
Emerging |
| 2714 |
TSG405/Automated-Email--BOT
This Bot can send emails to anyone, any number of times from a USER's... |
|
Emerging |
| 2715 |
t0mer/ttsbot
ttsbot is a Telepot powerd, easy to use Telegram bot allowing you to convert... |
|
Emerging |
| 2716 |
fvarrui/PowerPointToVideo
:clapper: PowerPoint to MP4 converter with synthesized interlocutor voice. |
|
Emerging |
| 2717 |
teamtee/LLM-ASR-Error-Correction
This is a framework for using large language models to improve ASR... |
|
Emerging |
| 2718 |
EvilFreelancer/docker-canary-serve
Canary-Serve is a FastAPI server with Docker support that provides an HTTP... |
|
Emerging |
| 2719 |
madzadev/voice-cue
📣 Find sentiments, tags, entities, and actions in your voice recordings instantly |
|
Emerging |
| 2720 |
Workplace-Futurists/DiScribe
An automated meeting transcriber which autonomously connects to scheduled... |
|
Emerging |
| 2721 |
surfaceyu/edge-tts-go
Use Microsoft Edge's online text-to-speech service from golang WITHOUT... |
|
Emerging |
| 2722 |
hannabdul/ldasr
Official repo for the paper "LDASR: An Experimental Study on Layer Drop... |
|
Emerging |
| 2723 |
bauyrzhanospan/VirtualAssistant
Virtual Assistant project done in the Middlesex University with Dr. Nawaz... |
|
Emerging |
| 2724 |
Rubiksman78/RenAI-Chat
VN Like Interface for Chatbots |
|
Emerging |
| 2725 |
koudounasalkis/Audio-Speech-Tutorial
This repository contains a short introduction on the topic of audio and... |
|
Emerging |
| 2726 |
egorsmkv/asr-corpus-creator
This app is intended to automatically create a corpus for ASR systems using... |
|
Emerging |
| 2727 |
DannyBen/voicemaker
Create Text to Speech files with the Voicemaker API from Ruby or the command line |
|
Emerging |
| 2728 |
Acelogic/Retrieval-based-Voice-Conversion-MLX
A pure MLX implementation of RVC for Apple Silicon, delivering 8.71x faster... |
|
Emerging |
| 2729 |
AI-TOOLKIT/VoiceData
Automatic Speech Recognition (ASR) Data Generator Toolkit |
|
Emerging |
| 2730 |
Tech-Cravers/Gesture-Speech
To develop an application which could be used by especially abled person to... |
|
Emerging |
| 2731 |
Yashkapure06/TextToSpeech-ChromeExtension
Text To Speech - Chrome Extension |
|
Emerging |
| 2732 |
arnobt78/In-Browser-ML-Speech-Transcription-Translation--NextJS-Frontend
An open-source, educational app for speech-to-text & text translation that... |
|
Emerging |
| 2733 |
Lev-etd/Multimodal-emotion-recognition
Audio-Visual Group Emotion Recognition in the wild using cross-modal attention |
|
Emerging |
| 2734 |
Ordyns/TextToSpeech-TikTokAPI
Small program that uses the TikTok API to convert text to speech |
|
Emerging |
| 2735 |
rn0x/TelegramWhisperer
بوت تيليجرام يعمل على تحويل الصوت إلى نص باستخدام نموذج Whisper، مع تحسينات... |
|
Emerging |
| 2736 |
lord-lethris/ComfyUI-lethris-dia2
ComfyUI custom nodes for the Dia2 TTS model — generate speech, timestamps,... |
|
Emerging |
| 2737 |
swarms/mozilla-common-voice
Swarms supports the Common Voice Project from Mozilla! This repo contains... |
|
Emerging |
| 2738 |
PranavMishra17/VoicePersona-Dataset
A comprehensive voice persona dataset for character consistency in voice... |
|
Emerging |
| 2739 |
amritsinghcse/Say-Hi
This Android app pronounces a word in different languages using TTS and... |
|
Emerging |
| 2740 |
zozonteq/yomiage-bot
RVCをサポートしたテキスト読み上げDiscordBot |
|
Emerging |
| 2741 |
speechly/slu-client
Interact with Speechly SLU API from the command line |
|
Emerging |
| 2742 |
ShawnPi233/SynParaSpeech
Official Repository of Paper: "SynParaSpeech: Automated Synthesis of... |
|
Emerging |
| 2743 |
Gust4voSales/Marvin-VirtualAssistent
A dinamic virtual assistent made with Python, you can easily add more voice... |
|
Emerging |
| 2744 |
gheyret/uyghur-asr-ctc
Speech Recognition for Uyghur using deep learning |
|
Emerging |
| 2745 |
tsukumijima/TarakoTalk
Cross-platform CLI TTS Tools for Hiroyuki's Voice |
|
Emerging |
| 2746 |
ALERTua/styletts2-ukrainian-openai-tts-api
OpenAI TTS Compatible Ukrainian TTS StyleTTS2 Pipeline |
|
Emerging |
| 2747 |
habitual69/speakify
Speakify is a web application that uses Edge TTS to convert text to speech... |
|
Emerging |
| 2748 |
Foxify52/RVG_tts
A retrieval based voice generation text to speech |
|
Emerging |
| 2749 |
RapDoodle/Web-Real-Time-Speech-Recognition-with-Azure
An example project that provides a web interface to real-time speech-to-text... |
|
Emerging |
| 2750 |
DeepSwissVoice/DeepVoice
A TensorFlow implementation of Baidu's DeepSpeech architecture |
|
Emerging |
| 2751 |
kaiidams/Voice100Sharp
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost... |
|
Emerging |
| 2752 |
Shyguy99/Whatsapp-bot
A simple WhatsApp Bot made using open-wa library with some additional features. |
|
Emerging |
| 2753 |
kromme/Teams-Notetaker
Let AI create the notes of your Teams Meeting |
|
Emerging |
| 2754 |
Unicorn-Commander/Unicorn-Orator
🦄 Text-to-Speech offloaded to iGPU and/or NPU |
|
Emerging |
| 2755 |
JesusGautamah/chatgpt_assistant
ChatGPT Virtual Assistant to Telegram and Discord with Voice Recognition |
|
Emerging |
| 2756 |
EX3exp/MiriVoice
Open-Free TTS Platform For All |
|
Emerging |
| 2757 |
arunk140/serve-piper-tts
Go Lang API Wrapper around Piper TTS - Supports TTS Inference and List of Voices |
|
Emerging |
| 2758 |
Leapward-Koex/Namida-OCR
A purely browser based OCR tool designed recognizing, copying, and... |
|
Emerging |
| 2759 |
amanda-emerick/guess-the-animal
:monkey_face: Guess the Animal :frog: is a didactic game developed for... |
|
Emerging |
| 2760 |
speechpro/speechpro-cloud-asr-examples
Примеры использования Beta-версии gRPC API потокового распознавания речи в ЦРТ Облаке |
|
Emerging |
| 2761 |
Jor02/DectalkNET
Use the Dectalk voice sythesizer directly in .NET applications |
|
Emerging |
| 2762 |
Syduan0921/Muliti-Role_Cosyvoice2
🤖一键部署,利用TTS与LLM将长文本小说转化为多角色音/视频。 |
|
Emerging |
| 2763 |
Cabbagito/Fine-Tuning-Whisper-on-LibriSpeech
The code for fine-tuning OpenAI's Whisper model on the LibriSpeech dataset. |
|
Emerging |
| 2764 |
codejs-kr/stt.js
Speech To Text library for browser 🎤 |
|
Emerging |
| 2765 |
arthurfortes/speech2text_keras
This repository reports how to build a speech to text model to recognize... |
|
Emerging |
| 2766 |
shinchanat/Py
Pyreader is a python project created for reading pdf and text files by applying tts. |
|
Emerging |
| 2767 |
mush42/leanspeech
Unofficial pytorch implementation of LeanSpeech: The Microsoft Lightweight... |
|
Emerging |
| 2768 |
mathquis/node-picotts
SVOX PicoTTS binding for Node.js |
|
Emerging |
| 2769 |
biaji/kokoro-tts
基于Kokoro的Android TTS引擎 |
|
Emerging |
| 2770 |
osteele/speech-provider
A unified TypeScript interface for browser speech synthesis and Eleven Labs... |
|
Emerging |
| 2771 |
zhongyuchen/speech-classification
CNN and VGG speech classification with interactive website for testing |
|
Emerging |
| 2772 |
ArthurBabkin/Parimate
A Telegram bot for validating audio and video content using CV models, SR... |
|
Emerging |
| 2773 |
Anwarvic/Arabic-Speech-Recognition
This repository contains my attempt to use two famous speech recognition... |
|
Emerging |
| 2774 |
Deimos-M/DL-Virtual-Assistant
It is a virtual assistant for visually impaired which include models like... |
|
Emerging |
| 2775 |
arthurxlw/cytonNss
Cyton Online Neural Sentence Segmentation for Simultaneous Interpretation |
|
Emerging |
| 2776 |
KiLJ4EdeN/Persian_Speech_To_Text
Simple Speech to text prototype using google api |
|
Emerging |
| 2777 |
manascb1344/zonos-api
Production-ready FastAPI wrapper for Zonos TTS models with GPU acceleration,... |
|
Emerging |
| 2778 |
egorsmkv/speech-recognition-uk
🇺🇦 Speech Recognition & Synthesis for Ukrainian |
|
Emerging |
| 2779 |
cyrta/broadcast-news-videos-dataset
Collection of broadcast news video clips |
|
Emerging |
| 2780 |
MahtaFetrat/VirgoolInformal-Speech-Dataset
A dataset of informal Persian audio and text chunks, along with a fully open... |
|
Emerging |
| 2781 |
debelopumento/phaser-test
A voice controlled runner game for Chrome |
|
Emerging |
| 2782 |
IbrokhimN/IJAI
IJAI is a modular AI assistant that supports text and voice interactions... |
|
Emerging |
| 2783 |
KennethanCeyer/awesome-audio-speech
Awesome list of Audio, Speech, and DSP(Digital signal processing) |
|
Emerging |
| 2784 |
ibelgin/Text-To-Speech-App
This App is Made Using React Native. |
|
Emerging |
| 2785 |
ShawnPi233/HQ-SVC
Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice... |
|
Emerging |
| 2786 |
consulfedor/VoiceGrab
🎙️ Voice-to-Text Bridge for AI & Any Application. Record voice → Get text →... |
|
Emerging |
| 2787 |
bougieL/tts-fluent
Text to speech |
|
Emerging |
| 2788 |
DavidBradbury/tts-assistant
TTS Assistant: A front-end app utilizing OpenAI's TTS API. Easily input text... |
|
Emerging |
| 2789 |
Baibhav-nag/SER-using-MLP-and-CNN
Speech emotion recognition using MLP and CNN on four benchmark datasets... |
|
Emerging |
| 2790 |
csikasote/bembaspeech-exps
Bemba ASR model obtained by fine-tuning a well performing DeepSpeech English... |
|
Emerging |
| 2791 |
karkranikhil/voice-notes
Voice Note taking app using Svelte. |
|
Emerging |
| 2792 |
n0an/VivaDicta
Voice Transcription, Reimagined |
|
Emerging |
| 2793 |
korniichuk/google-speech
QuickStart. Google Cloud Speech-to-Text API with Python |
|
Emerging |
| 2794 |
helemanc/ambient-intelligence
Application for Disruptive Situations Detection in public transports through... |
|
Emerging |
| 2795 |
isthistechsupport/tts_for_discord
Using Discord.py and the Azure Cognitive Services Python SDK to bring Azure... |
|
Emerging |
| 2796 |
nilakshdas/ADAGIO
Adversarial Defense for Audio in a Gadget with Interactive Operations |
|
Emerging |
| 2797 |
daymade/chattts-seed-example
这是一个 ChatTTS 音频仓库,包含用不同 seed 生成的不同音色,你可以方便地挑选你喜欢的 seed。 |
|
Emerging |
| 2798 |
SharkyRawr/go-tiktok-tts
Go library for TikToks Text2Speech engine |
|
Emerging |
| 2799 |
othneildrew/open-whisperer
AI Video Translator and Subtitler |
|
Emerging |
| 2800 |
jarmitage/tts-cli
Simple CLI app for TTS |
|
Emerging |