All Voice AI Tools
8,165 tools ranked by quality score · Page 29 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 2801 |
LiaTemplates/Speech-Recognition-Quiz
Create quizzes that check spoken text |
|
Emerging |
| 2802 |
ScottishFold007/TTSAudioNormalizer
TTSAudioNormalizer is a specialized tool for TTS data production,... |
|
Emerging |
| 2803 |
Medvedu/Yandex-Speech-API
Text to speech translation. Supports next languages: english, turkey,... |
|
Emerging |
| 2804 |
madushan1000/voxcpm_rs
Rust (using burn) implementation of VoxCPM |
|
Emerging |
| 2805 |
ThaaoBlues/Blue
An open source vocal assistant for windows and Linux. Made to be upgraded... |
|
Emerging |
| 2806 |
EnjiRouz/Habr-Reader-Extension
Простое расширение-читалка для Chrome/Opera, позволяющее воспроизводить... |
|
Emerging |
| 2807 |
SnappsiSnappes/Jarvis-free-bingGPT-voice-assistant
Голосовой помощник - чат с bingGPT / Bard (на русском) / ChatGPT 3.5 для... |
|
Emerging |
| 2808 |
kubo/ruby-flite
a small speech synthesis library for ruby using CMU Flite(http://cmuflite.org) |
|
Emerging |
| 2809 |
Issac-Moses/liebea
AI voice-activated girlfriend assistant with wake word detection, speech... |
|
Emerging |
| 2810 |
Sgvkamalakar/Azure_AI_Speech_Services
This repository contains a Streamlit-based application that leverages Azure... |
|
Emerging |
| 2811 |
JN513/Ana
Assistente feita em Python utilizando Speech_recognition, e APIs do Google |
|
Emerging |
| 2812 |
Snesnopic/Morser
SwiftUI recreation of my UIKit Morse Code experiment |
|
Emerging |
| 2813 |
sipeter/CloneTTS
A lightweight, offline Android Text-to-Speech (TTS) engine enabling seamless... |
|
Emerging |
| 2814 |
jianchang512/kokoro-uiapi
用于kokoro TTS的webui界面和兼容openai api |
|
Emerging |
| 2815 |
152334H/CTN-webapp
Refactored ControllableTalkNet with Flask/uwsgi |
|
Emerging |
| 2816 |
ErnestAroozoo/GPT-Discord-Chatbot
Discord chatbot powered by OpenAI and ElevenLabs that enables natural and... |
|
Emerging |
| 2817 |
turinaf/Sagalee
Automatic Speech Recognition Dataset for Oromo Language |
|
Emerging |
| 2818 |
YizheZhang-Ervin/AI_FinTech
Artifical Intelligence (React+Flask RESTful+Sqlite+Antd+Echarts) |
|
Emerging |
| 2819 |
gokhaneraslan/tacotron2-tts-training
Training Tacotron 2 Text-to-Speech (TTS) |
|
Emerging |
| 2820 |
QuantiusBenignus/NoteWhispers
Voice memos recorded from the microphone, transcribed offline to text and... |
|
Emerging |
| 2821 |
YChenL/DS-TDNN
Official implement of "Dual-stream Time-Delay Neural Network with Dynamic... |
|
Emerging |
| 2822 |
super13/tensorflow-speech-recognition-pai
Speech recognition using tensorflow in aliyun pai. |
|
Emerging |
| 2823 |
DominicTWHV/LJSpeech_Dataset_Generator
LJSpeech dataset generator for TTS model training/fine tuning |
|
Emerging |
| 2824 |
dsrivastavv/Android-Continuous-SpeechRecognition
Code to continuously detect spoken language and convert to text using Google... |
|
Emerging |
| 2825 |
Aadv1k/reddit-tts-gui
A GUI to auto-generate TTS videos from reddit posts and comments |
|
Emerging |
| 2826 |
harshil748/VoiceAPI
A lightweight, multi-lingual Text-to-Speech system supporting 11 Indian... |
|
Emerging |
| 2827 |
kaiidams/NeMoOnnxSharp
Text-to-speech and speech recognition, VAD with NVIDIA NeMo and ONNX Runtime... |
|
Emerging |
| 2828 |
ShihabYasin/Isolated-Bengali-Word-and-Speaker-Recognition.
Isolated Bengali word and speaker recognition. |
|
Emerging |
| 2829 |
royangkr/BabyReady
CNN to predict the reason why a baby is crying |
|
Emerging |
| 2830 |
6Morpheus6/alltalk-tts
[NVIDIA ONLY] AllTalk-TTS is a unified UI for F5-TTS, XTTS, Vite TTS, Piper... |
|
Emerging |
| 2831 |
sera619/S4M-2.0
German supported VoiceAssist without BigData |
|
Emerging |
| 2832 |
pinkpixel-dev/comeback-ai
🎤🔥 AI-powered clapback machine that transforms mean comments into witty... |
|
Emerging |
| 2833 |
Goblincomet/digitaltwin
Using a single image and just 10 seconds of sample audio, our project... |
|
Emerging |
| 2834 |
NICEElevateAI/ElevateAIPythonSDK
ElevateAI - Speech-to-text API Python SDK |
|
Emerging |
| 2835 |
KinglittleQ/Tacotron
An implementation of Tacotron with Pytorch0.4 |
|
Emerging |
| 2836 |
rohanprichard/fastrtc-demo
A simple POC of FastRTC, a framework to use voice mode in python! |
|
Emerging |
| 2837 |
mazzasaverio/youtube-auto-dub
Automated voice dubbing for YouTube videos using Docker, OpenVoice, and... |
|
Emerging |
| 2838 |
aiyu-ayaan/tts-engine
The TTS-Engine is a simple and efficient library that provides... |
|
Emerging |
| 2839 |
JuJu2181/Automatic-Nepali-Speech-Recognition-and-Summarizer
A system capable of converting Nepali speech to text and generate summary of text |
|
Emerging |
| 2840 |
yandex-cloud-examples/yc-speechkit-web-ui
SpeechKit Web UI Example |
|
Emerging |
| 2841 |
guan-yuan/Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion
A paper and project list about the cutting edge Speech Synthesis,... |
|
Emerging |
| 2842 |
Ephrem-ETH/E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM |
|
Emerging |
| 2843 |
R3tr0gh057/Celeste
A voice-activated desktop assistant and automation toolkit built with... |
|
Emerging |
| 2844 |
chase-west/VocaSpanish
Python app using tts and speech recognition to memorize spanish vocabulary |
|
Emerging |
| 2845 |
The-Swarm-Corporation/Voice-Agents
Voice-Agents is a production-ready Python library for building... |
|
Emerging |
| 2846 |
SCRN-VRC/Voice-Recognition-Shader
Audio detection with visemes in a fragment shader |
|
Emerging |
| 2847 |
TheVoxProject/calcvox
Accessible and open-source talking calculator for everyone. |
|
Emerging |
| 2848 |
Miihir79/Messaging_app
This is an advanced messaging app which has smart log in options smart... |
|
Emerging |
| 2849 |
Yangyangii/TPGST-Tacotron
Google's TPGST reimplementation. |
|
Emerging |
| 2850 |
biyoml/End-to-End-Mandarin-ASR
End-to-end speech recognition on AISHELL dataset. |
|
Emerging |
| 2851 |
EtienneAb3d/WhisperHallu
Experimental code: sound file preprocessing to optimize Whisper... |
|
Emerging |
| 2852 |
Forne/ha-yandexcloudtts
Yandex.Cloud SpeechKit for Home Assistant |
|
Emerging |
| 2853 |
ancs21/awesome-openai-whisper
A curated list of awesome OpenAI's Whisper |
|
Emerging |
| 2854 |
zzw922cn/LPC_for_TTS
Linear Prediction Coefficients estimation from mel-spectrogram implemented... |
|
Emerging |
| 2855 |
mrmanna/Nvidia_Nemo_FastPitch_TTS_Example
How to Build a High-Quality Text-to-Speech (TTS) System Locally with Nvidia... |
|
Emerging |
| 2856 |
TranHuuDat2004/tts-flask-app
Text-to-Speech Generator Powered by Python, Flask, and Piper TTS |
|
Emerging |
| 2857 |
Chelsea486MHz/debat-politique-ia
Génération automatique de débats politiques par IA. Audio + vidéo. |
|
Emerging |
| 2858 |
Wookie-VUI/Wokiee
Cross-platform Voice User Interface for your Desktop |
|
Emerging |
| 2859 |
birros/pico2wave.js
JS port of pico2wave (Emscripten) |
|
Emerging |
| 2860 |
csikasote/bigc
This repository contains the data resources for the LacunaFund supported... |
|
Emerging |
| 2861 |
botbahlul/VOSK-Powered-LIVE-SUBTITLE-V2
ANDROID APP that can RECOGNIZE LIVE AUDIO/VIDEO STREAMING (using free VOSK... |
|
Emerging |
| 2862 |
arpabot/ohno-bot
Discord Japanese text-to-speech bot |
|
Emerging |
| 2863 |
shreyasnisal/VoiceQuiz-v2
Verstion 2 of the quiz-app, this is the repository for the voice-based quiz.... |
|
Emerging |
| 2864 |
snaraya7/Ok_Eclipse
CSC 510 Software Engineering (Spring 2018) project - Group 'O' |
|
Emerging |
| 2865 |
muqadasejaz/Text-to-Speech-Converter-
A simple Python project that converts text into speech using different... |
|
Emerging |
| 2866 |
kaiidams/voice100
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost... |
|
Emerging |
| 2867 |
louischen737/PodCast-Master
AI驱动的播客生成工具,具备台词级脚本编辑功能与多语音文本转语音合成能力 |
|
Emerging |
| 2868 |
thewh1teagle/israwave
Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet |
|
Emerging |
| 2869 |
CoffreLv/ASR_CNN_CTC
从零开始搭建一个基于CNN+CTC的语音识别系统。 |
|
Emerging |
| 2870 |
ace19-dev/tensorflow-speech-recognition-challenge
Kaggle Competitions: TensorFlow Speech Recognition Challenge |
|
Emerging |
| 2871 |
aloproducao/Live-captions-for-broadcast
The Real-Time Speech Recognition System is an innovative tool designed to... |
|
Emerging |
| 2872 |
akukerang/StudySurfer
Subway Surfer TikTok Study Tool |
|
Emerging |
| 2873 |
pranayjoshi/speech_to_text
This is a speech_to_text script by Pranay Joshi |
|
Emerging |
| 2874 |
ye-kyaw-thu/myG2P
Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary... |
|
Emerging |
| 2875 |
rock3125/tts
Simple text to speech server in docker using coqui-ai/TTS |
|
Emerging |
| 2876 |
sap1119/voice-agent-0.01
A self-hosted, AI-powered voice assistant system with real-time voice... |
|
Emerging |
| 2877 |
ckaznable/yt-cli-live
Youtube Text Live Streaming in CLI |
|
Emerging |
| 2878 |
pnkvalavala/multivoice
Multivoice: Enhance your foreign-language movie and TV show experience with... |
|
Emerging |
| 2879 |
siva-sub/NekoTTS
🔊 Local Text-to-Speech service for Android with system-wide integration.... |
|
Emerging |
| 2880 |
NullEnt1ty/GCloudSpeech
Transcribe voice data to text using Google Cloud Speech-to-Text |
|
Emerging |
| 2881 |
FragJage/PicoVoiceCpp
PicoVoiceCpp is a simple TTS (text to speech) class base on picovoice (svox). |
|
Emerging |
| 2882 |
dgnsrekt/Discorgeous
Discord + GTTS = a discord bot that sends google text to speech voice... |
|
Emerging |
| 2883 |
AntoBrandi/Robotics-and-ROS-Learn-by-Doing-Manipulators
3D Printed robot arm powered by ROS and Arduino and controlled via MoveIt!... |
|
Emerging |
| 2884 |
TeaPoly/CTC-OptimizedLoss
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge... |
|
Emerging |
| 2885 |
abumubaarak/Wellbeing-Doctor
Doctor management app |
|
Emerging |
| 2886 |
1038lab/ComfyUI-FireRedTTS
A ComfyUI integration for FireRedTTS‑2, a real-time multi-speaker TTS system... |
|
Emerging |
| 2887 |
alfianlosari/flutter_cloud_text_to_speech
Flutter project that uses the Google Cloud Text to Speech API to synthesize... |
|
Emerging |
| 2888 |
sberdevices/smartspeech
SmartSpeech — это сервис для синтеза и распознавания речи |
|
Emerging |
| 2889 |
ArkS0001/IIT-Bombay-Whisper-Hindi-ASR-Model-Machine-Learning-Intern
Whisper is an automatic speech recognition (ASR) system trained on 680,000... |
|
Emerging |
| 2890 |
atahanuz/yt2text
Extract text from a YouTube video in a single command, using OpenAi's... |
|
Emerging |
| 2891 |
linagora-labs/asr_benchmark
Toolkit to benchmark various speech recognition APIs (NeMo, Whisper...) and... |
|
Emerging |
| 2892 |
whiteSHADOW1234/WhisperTranscriber
🎙️ Effortlessly transcribe YouTube videos, MP4, and MP3 files to text using... |
|
Emerging |
| 2893 |
Cosmos-Break/asr
沪语(上海话)ASR(语音识别)模型 |
|
Emerging |
| 2894 |
SALT-Research/SHALLOW
SHALLOW, the first hallucination benchmark for ASR models |
|
Emerging |
| 2895 |
ZoraizQ/urdu-speech-recognition
Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs... |
|
Emerging |
| 2896 |
yuvraj108c/ComfyUI-PiperTTS
ComfyUI Piper TTS Custom Node |
|
Emerging |
| 2897 |
praweshd/speech_emotion_recognition
In this project, the performance of speech emotion recognition is compared... |
|
Emerging |
| 2898 |
srvk/srvk-eesen-offline-transcriber
Top level code to transcribe English audio/video files into text/subtitles |
|
Emerging |
| 2899 |
slayerrr12/WaveSlayer
ai chatbot that uses speech to operate and respond |
|
Emerging |
| 2900 |
SladkyCitron/gotau
Work-in-progress UTAU-compatible singing voice synthesizer, written in Go |
|
Emerging |