All Voice AI Tools
8,165 tools ranked by quality score · Page 46 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 4501 |
satiseason/Chatbot-with-text-voice-chatting
Telegram bot is developed by AI techniques(Speech-to-Text, Text-to-Speech,... |
|
Experimental |
| 4502 |
ABashir88/enterprise-voice-ai-architectures
Reference architectures, cost models, and sales-engineering playbooks for... |
|
Experimental |
| 4503 |
MotivationalSpeechSynthesis/motivational-speech-synthesis
Artistic research deconstructing the performative excess of motivational... |
|
Experimental |
| 4504 |
abhijhacodes/PDF_to_AudioBook_converter
Python code that converts any pdf file into audiobook |
|
Experimental |
| 4505 |
toshalpatel/AudioSimilarity
When two audio files compared, the result is giving the similar part from... |
|
Experimental |
| 4506 |
Rishabh1925/voiceforge
AI-powered voice automation platform with text-to-speech and automated... |
|
Experimental |
| 4507 |
sky-flutter/Python-Jarvis
Voice-based assistant to make task automated |
|
Experimental |
| 4508 |
temp3rr0r/Longsword-Data-MQTT-Publisher
Working demo: https://www.youtube.com/watch?v=v7hvOyPQ0EM. The main IoT app.... |
|
Experimental |
| 4509 |
Iroha-P/MiniBox
Character voice chatbot with GPT-SoVITS TTS + LLM role-playing, supports Web... |
|
Experimental |
| 4510 |
Ryadel/ClawTalk
Chrome side panel extension (MV3) that connects to an OpenClaw Gateway and... |
|
Experimental |
| 4511 |
Jay113910/Speech-to-Text-Vosk
A real time speech recognition program using microphone based on "Vosk" - an... |
|
Experimental |
| 4512 |
vishudhiman/TEXT-N-SPEECH
Small project with the help of javascript and speech synthesis web API. |
|
Experimental |
| 4513 |
dangvansam/deepxi-flask-server
DeepXi with Flask Server |
|
Experimental |
| 4514 |
clarenceluo78/singer-adaptive-svc
This repository is the implementation of project Converting to Realistic... |
|
Experimental |
| 4515 |
flexhub77/piper-tts-call
🎙️ Generate high-quality audio from text in real-time with Piperin, the... |
|
Experimental |
| 4516 |
EGWeeks/translate_tts_api
AWS Translate & Text to Speech API Javascript Example |
|
Experimental |
| 4517 |
brailcom/speechd-java
Java client library for Speech Dispatcher |
|
Experimental |
| 4518 |
woofie/woof
AR Unity virtual pet app that recognises voice commands, performs NLP on... |
|
Experimental |
| 4519 |
noErrdev/python-speech-ai-forge
Speech-AI-Forge is a project developed around TTS generation model,... |
|
Experimental |
| 4520 |
Herobrine25mcpe/text-to-speech_Tkinter
So this is a project in which I am working on a simple text to speech... |
|
Experimental |
| 4521 |
smsraj2001/PYEDIT-PRO-THE-ULTIMATE-ADVANCED-TEXT-EDITOR
An Advanced text editor in python with enhanced and amazing features |
|
Experimental |
| 4522 |
mllpresearch/ESO-dataset
ESO speech dataset: an English-language speech corpus of the oncology domain... |
|
Experimental |
| 4523 |
RamR3R/InterviewAuto
This is openAi powered interview site where the user can join and take in... |
|
Experimental |
| 4524 |
ELITA04/HackHealth2021
HelpVu: An AI-powered narration application for the visually impaired.... |
|
Experimental |
| 4525 |
UserJoo9/Noura-Assistant-Free
AI voice assistant for Windows with English/Arabic support. Control apps,... |
|
Experimental |
| 4526 |
Ani0202/Speech-Translation-with-Python
Translate your speech to many languages using Google Translate API |
|
Experimental |
| 4527 |
danielrosehill/Speech-To-Text-System-Prompt-Library
An updated skeleton library of system prompts for using LLMs to refine STT output |
|
Experimental |
| 4528 |
polterguy/magic-menu
An alternative input module for Phosphorus Five, allowing you to use natural... |
|
Experimental |
| 4529 |
Akash-Apturkar/Sentiment-Analysis-of-speech-using-NLP-with-Android-Connect-feature-and-web-scraping
We aim to develop a ‘Smart Speech Ecosystem’ that takes audio input,... |
|
Experimental |
| 4530 |
wujunwei928/go-zero-tts
基于微软edge大声朗读接口开发的语音合成服务, 后端 go-zero, 前端 vuetify |
|
Experimental |
| 4531 |
limbang/text-to-speech
基于 Azure 文本转语音 |
|
Experimental |
| 4532 |
NeptuneHub/AudioMuse-AI-DCLAP
AudioMuse-AI-DCLAP is a lightweight, high-speed distilled version of LAION... |
|
Experimental |
| 4533 |
thewh1teagle/heb-piper-tts-gemma-g2p-onnx
Text to speech with Hebrew G2P and TTS models based on Piper/Gemma3 |
|
Experimental |
| 4534 |
mklement0/voices
macOS CLI for changing the default TTS (text-to-speech) voice and printing... |
|
Experimental |
| 4535 |
tfm000/diana
Locally hosted Text-to-Speech Document Converter |
|
Experimental |
| 4536 |
vpdl-sys/vpdl-public
Proprietary AI Voice Script Writer for turning written text into natural,... |
|
Experimental |
| 4537 |
synesthesiam/pt-synesthesiam
CMU Sphinx acoustic model for Portugese (pt-br) |
|
Experimental |
| 4538 |
sovse/base_rus_whisper_stt
Fine tuning of the base model from OpenAI Whisper in Russian language on the... |
|
Experimental |
| 4539 |
NassimaOULDOUALI/Prosody-Control-French-TTS
An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control |
|
Experimental |
| 4540 |
x2agi/x2agi-speechkit
🎧 X2AGI speech services: ASR, diarization, AI reports (gRPC, REST clients) |
|
Experimental |
| 4541 |
Zhima-Mochi/whisper-v3-server
A robust backend server for audio processing, delivering high-accuracy... |
|
Experimental |
| 4542 |
aquatiko/Image-Text-Speech-Synthesizer-Converter
Converts image to speech to text using python and it's GUI feature |
|
Experimental |
| 4543 |
devikamanoj/Speech-emotion-recogniser
Recognize human emotion and affective states from speech |
|
Experimental |
| 4544 |
italogsfernandes/mtp-xadrez-de-bruxo
Chess game controlled by voice commands and with physical pieces moving by itself. |
|
Experimental |
| 4545 |
deckarep/DrSbaitsoUi
A front-end for Dr. Sbaitso done in Zig and Raylib. |
|
Experimental |
| 4546 |
EasyAI-France/Audiobook-Simplifier
Audiobook Simplifier is a tool that creates audiobooks from text documents... |
|
Experimental |
| 4547 |
wis/speak
a browser extension designed for minimal clicks or presses to start reading... |
|
Experimental |
| 4548 |
spokestack/spokestack-tray-android
A UI component that makes it easy to add voice interaction to your app. |
|
Experimental |
| 4549 |
Rajvardhman05/openwhisper-app
Free, open-source voice-to-text for macOS — 100% local, offline... |
|
Experimental |
| 4550 |
krithicswaroopan/AI-Voice-Assistance-Pipeline
A real-time voice-to-text and text-to-speech AI pipeline using Whisper, an... |
|
Experimental |
| 4551 |
mihiriart/-Traductor-de-Voz-en-Tiempo-Real-con-Voz-Clonada-Espanol-Ingles
Traductor de voz en tiempo real con clonación de voz – Español ⇄ Inglés.... |
|
Experimental |
| 4552 |
vantu5z/PyBookReaderTTS
Читалка для книг на Gtk через синтезаторы TTS |
|
Experimental |
| 4553 |
sujalrajpoot/openai-tts
A powerful and easy-to-use Python library for generating natural-sounding... |
|
Experimental |
| 4554 |
TodiwalaVentures/phantom-voices-api
10 FREE professional AI voice clones for instant API integration. Zero cost.... |
|
Experimental |
| 4555 |
eauchs/speech-to-speech-pipeline
A real-time, interruptible (barge-in) conversational AI pipeline... |
|
Experimental |
| 4556 |
BluShooz/text-to-video-generator
SOTA Text-to-Video Generator with MuseTalk 1.5, LivePortrait, and LTX-Video.... |
|
Experimental |
| 4557 |
xibn/http-openai-tts
An HTTP microservice using OpenAI to generate text-to-speech. |
|
Experimental |
| 4558 |
TexasInstrumentsDIY/SpiceRack
Voice controlled turntable using the beaglebone black wireless. |
|
Experimental |
| 4559 |
Nishant-15/TTS
Text To Speech in regional languages like English, Hindi and Marathi using python |
|
Experimental |
| 4560 |
deepgram-starters/django-text-to-speech
Get started using Deepgram's Transcription with this Django demo app |
|
Experimental |
| 4561 |
skystone011/migpt-tts-api
让小爱音箱「按需播报」,openclaw可以说话了——通过简单的 HTTP API 触发播报 |
|
Experimental |
| 4562 |
zigzag1001/LLM-to-TTS
Live voice chat with LLM through discord |
|
Experimental |
| 4563 |
fardin-sabid/NeuTTS-Studio
On-Device Text-to-Speech · Voice Cloning · Real-Time Streaming |
|
Experimental |
| 4564 |
dyankov91/a2pod
Convert articles into podcast-quality audio on Apple Silicon. Local TTS, LLM... |
|
Experimental |
| 4565 |
mbrotos/SoundSeg
Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation |
|
Experimental |
| 4566 |
rosealexander/react-tts
A flexible SpeechSynthesis adapter for React. |
|
Experimental |
| 4567 |
scrappylabsai/scrappy-radio
AI-powered radio station — generates original music, DJ commentary, and... |
|
Experimental |
| 4568 |
caimari/vtts
Continuous batching for TTS — like vLLM, but for voice. Serve 10+... |
|
Experimental |
| 4569 |
ElmiraGhorbani/gpt-speaker-diarization
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4)... |
|
Experimental |
| 4570 |
NIteshx2/PyAssistant
A Project that gets you up and running with using speech recognition and... |
|
Experimental |
| 4571 |
omenius/epub2mp3
Converts epub e-book files to mp3 audiobook files. |
|
Experimental |
| 4572 |
emon555502/sonori
Sonori is a fully local STT app for linux (wayland). |
|
Experimental |
| 4573 |
huzaifa-fullstack/eduvox-ai
EduVox AI is an AI-powered educational voice companion that delivers... |
|
Experimental |
| 4574 |
977106024/note-wechat-app
微信小程序全栈项目 语音识别 图片识别 |
|
Experimental |
| 4575 |
sebheron/TikTok-Reddit-Text-To-Speech
Reddit TTS generator designed for TikTok |
|
Experimental |
| 4576 |
thiswillbeyourgithub/Spotify_tts
Reads title of spotify songs aloud using AI |
|
Experimental |
| 4577 |
madalena-rocha/nlw-expert
Aplicação de notas de áudio que se convertem em texto. |
|
Experimental |
| 4578 |
leihuazhe/shine-crafts
A smart text-to-speech (TTS) web tool with the feature of downloading... |
|
Experimental |
| 4579 |
brailcom/singing-computer
Computer singing synthesis |
|
Experimental |
| 4580 |
jacksonkasi0/simple-speech-recognition-with-deepgram-in-reactjs
ai speech recognition |
|
Experimental |
| 4581 |
Slothologist/AudioSegmenter
Segmentation of audio for a speech pipeline |
|
Experimental |
| 4582 |
brenomfviana/rita
RITA (Rapid Interaction Assistant for Tasks) is a voice-controlled virtual... |
|
Experimental |
| 4583 |
zhaoyi2/Classical-Speech-Algorithms
Classical speech recognition and speaker recognition algorithms |
|
Experimental |
| 4584 |
Clebson-Torres/WinVoice
An offline voice assistant for Windows, utilizing local AI (Ollama) and... |
|
Experimental |
| 4585 |
Nazmul0005/Text2Audio_Audio2Text_Conversion_Using_HuggingFace
A demo project showcasing text-to-speech and speech-to-text conversions... |
|
Experimental |
| 4586 |
vicentezaror/js-web-t2v
Web text to voice utility functions that allows to customize the behavior,... |
|
Experimental |
| 4587 |
OpenVoiceOS/ovos-audio-transformer-plugin-speechbrain-langdetect
speech language detection plugin |
|
Experimental |
| 4588 |
VARCOVoice/VARCOVoice_UNITYSDK
Official Unity SDK for VARCO Voice API. High-quality AI text-to-speech,... |
|
Experimental |
| 4589 |
pig-mesh/volcengine-tts-spring-boot-starter
火山引擎语音合成(TTS)服务集成 |
|
Experimental |
| 4590 |
nikita9604/Automated-Voice-Controlled-Email-Sender
Simple Automated Voice Controlled Email Sender using SMTP in python |
|
Experimental |
| 4591 |
LuisMiSanVe/AiCursorHelper
AI Assistant that helps you move around your Desktop with voice command |
|
Experimental |
| 4592 |
hakunamatata1997/Speech-to-Text-WebApp
This is a web application that performs speech recognition on audio files.... |
|
Experimental |
| 4593 |
Hayder-IRAQ/SubLab
🎬 Auto-generate & translate video subtitles using Whisper AI — offline,... |
|
Experimental |
| 4594 |
ShahabAthar25/speech-assistant-python
A simple speech assistance in python made with the help of pyttsx3,... |
|
Experimental |
| 4595 |
Ronnie-Leon76/Swahili-ASR
This repository contains the code for fine-tuning the XLS-R Wav2Vec2 model... |
|
Experimental |
| 4596 |
ranchlai/wav2vec-2.0
Wav2vec2 English speech recognition in PaddlePaddle |
|
Experimental |
| 4597 |
babadue/seamless-m4t-v2-large-demo
Demonstration features of seamless-m4t-v2-large model |
|
Experimental |
| 4598 |
openvoicepacks/openvoicepacks
Generate and customize complete voice packs for OpenTX and EdgeTX radios. |
|
Experimental |
| 4599 |
AmirHoseein99/Persian_ASR
a ASR(automatic speech recognition) model for Persian language based on... |
|
Experimental |
| 4600 |
bjornbytes/lua-deepspeech
Lua Library for Speech Recognition |
|
Experimental |