All Voice AI Tools
8,165 tools ranked by quality score · Page 55 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 5401 |
nvjob/rain-voice-note
Rain Voice Note (Speech To Text). CW Frame App. JavaScript. |
|
Experimental |
| 5402 |
kdorichev/text2speech
Text-To-Speech Dataset Preparation and Architecture |
|
Experimental |
| 5403 |
Murlors/VITS_Japanese
VITS implementation of Japanese |
|
Experimental |
| 5404 |
AlinaBaber/Arabic-Speech-Recognition-by-Machine-learning-and-feature-extraction
This project implements an Arabic Speech Recognition system using an... |
|
Experimental |
| 5405 |
erendogan6/Translateify
An interactive English learning app with personalized daily word... |
|
Experimental |
| 5406 |
ruslanmv/VRSecretary
VRSecretary is a production-ready reference implementation for building... |
|
Experimental |
| 5407 |
jeswanthmukesh20/VocalText-Contrastive-Embedding
This repository features a CLIP-inspired contrastive model that aligns audio... |
|
Experimental |
| 5408 |
Obstacleee/StreamVoice
Ce projet permet de convertir des flux RSS en format audio, offrant ainsi la... |
|
Experimental |
| 5409 |
privapps/TTS-Parakeet
an easy to use English Text To Speech tool |
|
Experimental |
| 5410 |
theolepage/wavlm_ssl_sv
SOTA method for self-supervised speaker verification leveraging a... |
|
Experimental |
| 5411 |
Aqib121201/YOLO-R-CNN-Vision-Assistant-for-Visually-Impaired-Navigation
Edge-deployed assistive vision system with object detection + audio for... |
|
Experimental |
| 5412 |
eLearningHub/text2talk
Making training videos |
|
Experimental |
| 5413 |
lsa-pucrs-old/donnie-assistive-robot-sw
Donnie's software (Arduino Firmware, Player Drivers, Stage simulation... |
|
Experimental |
| 5414 |
d4rkmen/flatsphere
Clock with TTS for WaveShare ESP32-S3 Touch LCD 1.85" |
|
Experimental |
| 5415 |
volltin/xiaodou-bot
A simple voice-to-voice chatbot. |
|
Experimental |
| 5416 |
InuInu2022/LibSasara
The utility library for CeVIO project file (.ccs / .ccst) and timing label... |
|
Experimental |
| 5417 |
Soumo-git-hub/AI-News-Aggregator
An intelligent news aggregator (Python/JS) using spaCy for NLP topic... |
|
Experimental |
| 5418 |
arshc0der/Javscript-Mini-Projects
🧩 JavaScript Mini Projects – Beginner-Friendly Practice Projects This... |
|
Experimental |
| 5419 |
Nazmul0005/Personal_Voice_Assistant_Mili
Mili is a smart voice assistant built with Python and Google Gemini AI. It... |
|
Experimental |
| 5420 |
dreamerc/twitch-tts
Twitch Text-To-Speech Tool |
|
Experimental |
| 5421 |
dfgHiatus/NeosVoiceRecognition
Speech to text for NeosVR |
|
Experimental |
| 5422 |
NOime22/Web-listen
🎧 AI语音朗读助手 - Chrome浏览器扩展,支持划词朗读和截图OCR朗读 |
|
Experimental |
| 5423 |
chicogong/ffvoice-engine
🎙️ 高性能 C++ 语音引擎 - 实时音频处理 + AI 语音识别 + 边录边转写 | High-performance C++ voice... |
|
Experimental |
| 5424 |
Nostalgiaaa/CyberClone
快速构建数字仿生人并存储在 Relic ( PC ) 中 |
|
Experimental |
| 5425 |
petitwhito/Speech_to_text_project
Complete Speech-to-Text pipeline: from-scratch architectures (MLP, CNN, RNN,... |
|
Experimental |
| 5426 |
rodrigues-aline/wav2vec2_interpretation
Investigating wav2vec2 context representations and the effects of fine-tuning |
|
Experimental |
| 5427 |
mostafabahaa25/mediguide_MVP
AI-powered accessibility app that helps blind and low-vision users manage... |
|
Experimental |
| 5428 |
BillDuke13/cosyvoice-ray-serve-api
This project provides a Ray Serve-based HTTP API wrapper around CosyVoice, a... |
|
Experimental |
| 5429 |
danilop/easy-sonic
A simple, high-level Python SDK for Amazon Nova 2 Sonic speech-to-speech... |
|
Experimental |
| 5430 |
emre-guler/jarvis
A sophisticated AI-powered personal assistant inspired by Iron Man's JARVIS,... |
|
Experimental |
| 5431 |
thedigitalchief/voice-command-assistant
Powerful assistant performing powerful automated tasks from user’s voice... |
|
Experimental |
| 5432 |
jshperalta/ai-englishTutor
Artificial Intelligence English Tutor |
|
Experimental |
| 5433 |
alessandropec/data_driven_ai_voice_cloning
This repository contain the code of the main part of my master thesis degree... |
|
Experimental |
| 5434 |
Arnav3241/WebSpeechRecognition
v0.1.4 released: A Python library for speech-to-text integration using... |
|
Experimental |
| 5435 |
will-rice/diffwave
TensorFlow 2.0 Implementation of DiffWave: A Versatile Diffusion Model for... |
|
Experimental |
| 5436 |
anikashawarma/Silent-Voice-Lip-Reader
This is an AI enhanced lip reading application based on real-world videos... |
|
Experimental |
| 5437 |
webKing021/VoiceFlow-An-Automatic-NLP-Transcriber
VoiceFlow is a Windows push-to-talk voice-to-text application that... |
|
Experimental |
| 5438 |
LENSS/EMSAssist
This is the official artifact for EMSAssist paper on MobiSys'23. EMSAssist:... |
|
Experimental |
| 5439 |
elllusion/calibre
为linux发行版的Calibre添加Edge TTS | Add Edge TTS for calibre of linux |
|
Experimental |
| 5440 |
FioPio/pepper-language-grounding-system
This repo contains the implelemtation for a simple language grounding in... |
|
Experimental |
| 5441 |
lucylow/Yeezy-Taught-Me
Yeezy Taught Me Text Generation. Training next character predictions RNN... |
|
Experimental |
| 5442 |
giribabu22/assistant-Nikki-python
i developed this assistant using speech-recognition, selenium,... |
|
Experimental |
| 5443 |
ragibson/MFCC-speech-recognition
Real-time speech recognition via "Mel-Frequency Cepstral Coefficients"... |
|
Experimental |
| 5444 |
criadacasa/podcastfy-saas
SaaS platform for generating AI podcasts from multimodal content - Built... |
|
Experimental |
| 5445 |
lhg96/stt-demo-korean
Korean Speech-to-Text app with Whisper & Vosk | 한국어 음성인식 데모 애플리케이션 |
|
Experimental |
| 5446 |
Oldes/Rebol-Speak
Rebol text-to-speech extension |
|
Experimental |
| 5447 |
awaseem/2day-api
Transform your writing into engaging AI-generated podcasts. Ditch the mics... |
|
Experimental |
| 5448 |
agent-whisper/grpc-whisper
gRPC server for OpenAI's Whisper Models |
|
Experimental |
| 5449 |
daisy/tobi
Tobi is a free, open source, multimedia book production authoring tool for... |
|
Experimental |
| 5450 |
ayzem88/text-to-speech-converter
أداة لتحويل النصوص العربية إلى ملفات صوتية باستخدام OpenAI TTS / Tool for... |
|
Experimental |
| 5451 |
cvcwebsolutions/vibe-local
Local voice-to-text with AI-powered text cleanup. Privacy-focused... |
|
Experimental |
| 5452 |
ugyenn-tsheringg/Image-Captioning-System-for-Visually-Impaired-Individals-using-CNN-LSTM-VQA-TTS
Developed a web-based image captioning system that evaluates feature... |
|
Experimental |
| 5453 |
Pendrokar/xVA-Synth-HFSpace
HuggingFace Space for xVASynth |
|
Experimental |
| 5454 |
dibasdauliya/better-speech-recognition
An improved speech recognition library with TypeScript support |
|
Experimental |
| 5455 |
Sxriptor/Whispra-Download
Whispra's Offical Download | AI-powered real-time voice and subtitle... |
|
Experimental |
| 5456 |
ShadowLp174/stt-example-bot
A basic discord bot but with voice commands |
|
Experimental |
| 5457 |
Entity047/Voice_AI_Creator
Python TTS and voice cloning framework for educational AI/ML demonstrations. |
|
Experimental |
| 5458 |
Sang-Buster/AeroLex-Editor
A powerful web-based editor for transcription and subtitle files with... |
|
Experimental |
| 5459 |
sanastasiou/dictation-service
GPU-accelerated speech-to-text service that types what you say, powered by... |
|
Experimental |
| 5460 |
neosun100/llasa-tts-8b-webui
🎙️ High-quality Text-to-Speech system based on Llasa-8B with intelligent GPU... |
|
Experimental |
| 5461 |
hd996/material-local
🎬 素材本地化 |
|
Experimental |
| 5462 |
alpereee/SpeakerRecognition
🎙️ Makine öğrenmesi ile konuşmacı tanıma, sesten duygu analizi ve metne... |
|
Experimental |
| 5463 |
LucaBallan/wikipedia-aloud-reader
Read aloud wikipedia pages |
|
Experimental |
| 5464 |
gregunger-microsoft/Jarvis
AI-powered Microsoft Teams meeting assistant with voice interaction,... |
|
Experimental |
| 5465 |
wq2012/mdeval
Python implementation of the NIST md-eval.pl script for evaluating rich... |
|
Experimental |
| 5466 |
kolonist/edgetts
Use free Microsoft Edge's online text-to-speech service from golang |
|
Experimental |
| 5467 |
SirCryptic/cli-sms
use clicksend to send either sms or text to speech to a phone number via the... |
|
Experimental |
| 5468 |
Praneeth-Gandodi/Tars
TARS is a voice AI assistant that listens to your voice and responds in... |
|
Experimental |
| 5469 |
gikonyob/speake
Speake library provides a wrapper around Espeak to easily write efficient... |
|
Experimental |
| 5470 |
amirmohammadraei/cloud-services
Familiarity with some cloud services |
|
Experimental |
| 5471 |
RiteshGenAI/openai_whisper_transcribe_yt_videos
This project is a Streamlit-based application that allows users to download... |
|
Experimental |
| 5472 |
mastashake08/OCRTTS
Javascript package that uses the TextDetector API and Speech Synthesis to... |
|
Experimental |
| 5473 |
IG-onGit/TexeT
TexeT is the tool you need to take your interaction and content control to... |
|
Experimental |
| 5474 |
techiaith/docker-deepspeech-cy
Hyfforddi modelau adnabod lleferydd Cymraeg gyda Mozilla DeepSpeech // Train... |
|
Experimental |
| 5475 |
juancarlospaco/nim-espeak
Nim Espeak NG wrapper, for super easy Voice and Text-To-Speech |
|
Experimental |
| 5476 |
CSroseX/PizzAI-EmbeddableAI-Project
Experiments with building an AI-powered web app using Flask, integrating... |
|
Experimental |
| 5477 |
sap1119/voice_agent_0.02
An open‑source voice AI platform for building real‑time, scalable, and... |
|
Experimental |
| 5478 |
neosun100/orpheus-tts-docker
Production-ready Docker deployment for Orpheus TTS with GPU management,... |
|
Experimental |
| 5479 |
ArpitaChatterjee/Covid-19-Tracker-with-VoiceAssistant
Built a Covid-19 tracker in python, where data of total no. of cases, total... |
|
Experimental |
| 5480 |
matthiaaas/otto-assistant
Voice assistant called "Otto" |
|
Experimental |
| 5481 |
egorsmkv/flashlight-ukrainian
The Ukrainian Acoustic Model for Flashlight |
|
Experimental |
| 5482 |
osandadeshan/MySight
Android application for blind community to read books, papers, shopping... |
|
Experimental |
| 5483 |
TTomas65/Text-to-Speech-with-AI
A simple web application that uses OpenAI's GPT-4o mini TTS (text-to-speech)... |
|
Experimental |
| 5484 |
neosun100/fish-speech
🐟 Advanced multilingual Text-to-Speech system with speaker management,... |
|
Experimental |
| 5485 |
Vagabond-K/Speechabler
루게릭병 환우의 목소리 프로젝트 |
|
Experimental |
| 5486 |
awesome-german/speaking
Resources and methods to improve spoken German, pronunciation, and real-life... |
|
Experimental |
| 5487 |
serkanyasr/vocavoice
AI-powered podcast generator for language learners. Creates custom scripts... |
|
Experimental |
| 5488 |
MohammadarefAhmadpoor/Speech-translation
Speech recognition, language detection, translation, and speech synthesis |
|
Experimental |
| 5489 |
sudarsan15/speech-sentiment-analyser
Speech Sentiment Analyser is a ML & AI based tool to help analyse the user... |
|
Experimental |
| 5490 |
lwdovico/zonos
Basic Zonos setup for seamless integration with multiple sentence inference tasks. |
|
Experimental |
| 5491 |
lukinhas-programando/ace-step-studio
🎵 Create and manage local-first AI-powered music with a fast, self-hosted... |
|
Experimental |
| 5492 |
kyegomez/AST
Implementation of AST from the paper: "AST: Audio Spectrogram Transformer'... |
|
Experimental |
| 5493 |
belambert/cl-asr
A (not entirely working) stand-alone speech recognizer written in Common Lisp |
|
Experimental |
| 5494 |
xAlpharax/whisper-stt-gradio
Gradio Interface for Transcription and Translation using the Whisper Large... |
|
Experimental |
| 5495 |
victorwoo/transcript-video
A PowerShell script that automatically generates subtitles in bulk for video... |
|
Experimental |
| 5496 |
darshkaushik/cough-it
Cough It is an android app that leverages deep learning and acoustics to... |
|
Experimental |
| 5497 |
dbry/skipper
Detection and selective purging of talk or music in audio streams |
|
Experimental |
| 5498 |
NeuralForge6000/steve-voice-assistant
Secure voice assistant powered by OpenAI Whisper & Google Gemini AI.... |
|
Experimental |
| 5499 |
lzfelipe/discord-ai-tts-bot
Discord Bot that combines functionalities from Eleven Labs and OpenAI API. |
|
Experimental |
| 5500 |
kamtasingh27/minor
BAE - Being Assistant Eyes - An App for the Visually Impaired People with... |
|
Experimental |