All Voice AI Tools

8,165 tools ranked by quality score · Page 46 of 82

Showing 4501–4600 of 8,165
# Tool Score Tier
4501 satiseason/Chatbot-with-text-voice-chatting

Telegram bot is developed by AI techniques(Speech-to-Text, Text-to-Speech,...

24
Experimental
4502 ABashir88/enterprise-voice-ai-architectures

Reference architectures, cost models, and sales-engineering playbooks for...

24
Experimental
4503 MotivationalSpeechSynthesis/motivational-speech-synthesis

Artistic research deconstructing the performative excess of motivational...

24
Experimental
4504 abhijhacodes/PDF_to_AudioBook_converter

Python code that converts any pdf file into audiobook

24
Experimental
4505 toshalpatel/AudioSimilarity

When two audio files compared, the result is giving the similar part from...

24
Experimental
4506 Rishabh1925/voiceforge

AI-powered voice automation platform with text-to-speech and automated...

24
Experimental
4507 sky-flutter/Python-Jarvis

Voice-based assistant to make task automated

24
Experimental
4508 temp3rr0r/Longsword-Data-MQTT-Publisher

Working demo: https://www.youtube.com/watch?v=v7hvOyPQ0EM. The main IoT app....

24
Experimental
4509 Iroha-P/MiniBox

Character voice chatbot with GPT-SoVITS TTS + LLM role-playing, supports Web...

24
Experimental
4510 Ryadel/ClawTalk

Chrome side panel extension (MV3) that connects to an OpenClaw Gateway and...

24
Experimental
4511 Jay113910/Speech-to-Text-Vosk

A real time speech recognition program using microphone based on "Vosk" - an...

24
Experimental
4512 vishudhiman/TEXT-N-SPEECH

Small project with the help of javascript and speech synthesis web API.

24
Experimental
4513 dangvansam/deepxi-flask-server

DeepXi with Flask Server

24
Experimental
4514 clarenceluo78/singer-adaptive-svc

This repository is the implementation of project Converting to Realistic...

24
Experimental
4515 flexhub77/piper-tts-call

🎙️ Generate high-quality audio from text in real-time with Piperin, the...

24
Experimental
4516 EGWeeks/translate_tts_api

AWS Translate & Text to Speech API Javascript Example

24
Experimental
4517 brailcom/speechd-java

Java client library for Speech Dispatcher

24
Experimental
4518 woofie/woof

AR Unity virtual pet app that recognises voice commands, performs NLP on...

24
Experimental
4519 noErrdev/python-speech-ai-forge

Speech-AI-Forge is a project developed around TTS generation model,...

24
Experimental
4520 Herobrine25mcpe/text-to-speech_Tkinter

So this is a project in which I am working on a simple text to speech...

24
Experimental
4521 smsraj2001/PYEDIT-PRO-THE-ULTIMATE-ADVANCED-TEXT-EDITOR

An Advanced text editor in python with enhanced and amazing features

24
Experimental
4522 mllpresearch/ESO-dataset

ESO speech dataset: an English-language speech corpus of the oncology domain...

24
Experimental
4523 RamR3R/InterviewAuto

This is openAi powered interview site where the user can join and take in...

24
Experimental
4524 ELITA04/HackHealth2021

HelpVu: An AI-powered narration application for the visually impaired....

24
Experimental
4525 UserJoo9/Noura-Assistant-Free

AI voice assistant for Windows with English/Arabic support. Control apps,...

24
Experimental
4526 Ani0202/Speech-Translation-with-Python

Translate your speech to many languages using Google Translate API

24
Experimental
4527 danielrosehill/Speech-To-Text-System-Prompt-Library

An updated skeleton library of system prompts for using LLMs to refine STT output

24
Experimental
4528 polterguy/magic-menu

An alternative input module for Phosphorus Five, allowing you to use natural...

24
Experimental
4529 Akash-Apturkar/Sentiment-Analysis-of-speech-using-NLP-with-Android-Connect-feature-and-web-scraping

We aim to develop a ‘Smart Speech Ecosystem’ that takes audio input,...

24
Experimental
4530 wujunwei928/go-zero-tts

基于微软edge大声朗读接口开发的语音合成服务, 后端 go-zero, 前端 vuetify

24
Experimental
4531 limbang/text-to-speech

基于 Azure 文本转语音

24
Experimental
4532 NeptuneHub/AudioMuse-AI-DCLAP

AudioMuse-AI-DCLAP is a lightweight, high-speed distilled version of LAION...

24
Experimental
4533 thewh1teagle/heb-piper-tts-gemma-g2p-onnx

Text to speech with Hebrew G2P and TTS models based on Piper/Gemma3

24
Experimental
4534 mklement0/voices

macOS CLI for changing the default TTS (text-to-speech) voice and printing...

24
Experimental
4535 tfm000/diana

Locally hosted Text-to-Speech Document Converter

24
Experimental
4536 vpdl-sys/vpdl-public

Proprietary AI Voice Script Writer for turning written text into natural,...

24
Experimental
4537 synesthesiam/pt-synesthesiam

CMU Sphinx acoustic model for Portugese (pt-br)

24
Experimental
4538 sovse/base_rus_whisper_stt

Fine tuning of the base model from OpenAI Whisper in Russian language on the...

23
Experimental
4539 NassimaOULDOUALI/Prosody-Control-French-TTS

An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control

23
Experimental
4540 x2agi/x2agi-speechkit

🎧 X2AGI speech services: ASR, diarization, AI reports (gRPC, REST clients)

23
Experimental
4541 Zhima-Mochi/whisper-v3-server

A robust backend server for audio processing, delivering high-accuracy...

23
Experimental
4542 aquatiko/Image-Text-Speech-Synthesizer-Converter

Converts image to speech to text using python and it's GUI feature

23
Experimental
4543 devikamanoj/Speech-emotion-recogniser

Recognize human emotion and affective states from speech

23
Experimental
4544 italogsfernandes/mtp-xadrez-de-bruxo

Chess game controlled by voice commands and with physical pieces moving by itself.

23
Experimental
4545 deckarep/DrSbaitsoUi

A front-end for Dr. Sbaitso done in Zig and Raylib.

23
Experimental
4546 EasyAI-France/Audiobook-Simplifier

Audiobook Simplifier is a tool that creates audiobooks from text documents...

23
Experimental
4547 wis/speak

a browser extension designed for minimal clicks or presses to start reading...

23
Experimental
4548 spokestack/spokestack-tray-android

A UI component that makes it easy to add voice interaction to your app.

23
Experimental
4549 Rajvardhman05/openwhisper-app

Free, open-source voice-to-text for macOS — 100% local, offline...

23
Experimental
4550 krithicswaroopan/AI-Voice-Assistance-Pipeline

A real-time voice-to-text and text-to-speech AI pipeline using Whisper, an...

23
Experimental
4551 mihiriart/-Traductor-de-Voz-en-Tiempo-Real-con-Voz-Clonada-Espanol-Ingles

Traductor de voz en tiempo real con clonación de voz – Español ⇄ Inglés....

23
Experimental
4552 vantu5z/PyBookReaderTTS

Читалка для книг на Gtk через синтезаторы TTS

23
Experimental
4553 sujalrajpoot/openai-tts

A powerful and easy-to-use Python library for generating natural-sounding...

23
Experimental
4554 TodiwalaVentures/phantom-voices-api

10 FREE professional AI voice clones for instant API integration. Zero cost....

23
Experimental
4555 eauchs/speech-to-speech-pipeline

A real-time, interruptible (barge-in) conversational AI pipeline...

23
Experimental
4556 BluShooz/text-to-video-generator

SOTA Text-to-Video Generator with MuseTalk 1.5, LivePortrait, and LTX-Video....

23
Experimental
4557 xibn/http-openai-tts

An HTTP microservice using OpenAI to generate text-to-speech.

23
Experimental
4558 TexasInstrumentsDIY/SpiceRack

Voice controlled turntable using the beaglebone black wireless.

23
Experimental
4559 Nishant-15/TTS

Text To Speech in regional languages like English, Hindi and Marathi using python

23
Experimental
4560 deepgram-starters/django-text-to-speech

Get started using Deepgram's Transcription with this Django demo app

23
Experimental
4561 skystone011/migpt-tts-api

让小爱音箱「按需播报」,openclaw可以说话了——通过简单的 HTTP API 触发播报

23
Experimental
4562 zigzag1001/LLM-to-TTS

Live voice chat with LLM through discord

23
Experimental
4563 fardin-sabid/NeuTTS-Studio

On-Device Text-to-Speech · Voice Cloning · Real-Time Streaming

23
Experimental
4564 dyankov91/a2pod

Convert articles into podcast-quality audio on Apple Silicon. Local TTS, LLM...

23
Experimental
4565 mbrotos/SoundSeg

Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation

23
Experimental
4566 rosealexander/react-tts

A flexible SpeechSynthesis adapter for React.

23
Experimental
4567 scrappylabsai/scrappy-radio

AI-powered radio station — generates original music, DJ commentary, and...

23
Experimental
4568 caimari/vtts

Continuous batching for TTS — like vLLM, but for voice. Serve 10+...

23
Experimental
4569 ElmiraGhorbani/gpt-speaker-diarization

Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4)...

23
Experimental
4570 NIteshx2/PyAssistant

A Project that gets you up and running with using speech recognition and...

23
Experimental
4571 omenius/epub2mp3

Converts epub e-book files to mp3 audiobook files.

23
Experimental
4572 emon555502/sonori

Sonori is a fully local STT app for linux (wayland).

23
Experimental
4573 huzaifa-fullstack/eduvox-ai

EduVox AI is an AI-powered educational voice companion that delivers...

23
Experimental
4574 977106024/note-wechat-app

微信小程序全栈项目 语音识别 图片识别

23
Experimental
4575 sebheron/TikTok-Reddit-Text-To-Speech

Reddit TTS generator designed for TikTok

23
Experimental
4576 thiswillbeyourgithub/Spotify_tts

Reads title of spotify songs aloud using AI

23
Experimental
4577 madalena-rocha/nlw-expert

Aplicação de notas de áudio que se convertem em texto.

23
Experimental
4578 leihuazhe/shine-crafts

A smart text-to-speech (TTS) web tool with the feature of downloading...

23
Experimental
4579 brailcom/singing-computer

Computer singing synthesis

23
Experimental
4580 jacksonkasi0/simple-speech-recognition-with-deepgram-in-reactjs

ai speech recognition

23
Experimental
4581 Slothologist/AudioSegmenter

Segmentation of audio for a speech pipeline

23
Experimental
4582 brenomfviana/rita

RITA (Rapid Interaction Assistant for Tasks) is a voice-controlled virtual...

23
Experimental
4583 zhaoyi2/Classical-Speech-Algorithms

Classical speech recognition and speaker recognition algorithms

23
Experimental
4584 Clebson-Torres/WinVoice

An offline voice assistant for Windows, utilizing local AI (Ollama) and...

23
Experimental
4585 Nazmul0005/Text2Audio_Audio2Text_Conversion_Using_HuggingFace

A demo project showcasing text-to-speech and speech-to-text conversions...

23
Experimental
4586 vicentezaror/js-web-t2v

Web text to voice utility functions that allows to customize the behavior,...

23
Experimental
4587 OpenVoiceOS/ovos-audio-transformer-plugin-speechbrain-langdetect

speech language detection plugin

23
Experimental
4588 VARCOVoice/VARCOVoice_UNITYSDK

Official Unity SDK for VARCO Voice API. High-quality AI text-to-speech,...

23
Experimental
4589 pig-mesh/volcengine-tts-spring-boot-starter

火山引擎语音合成(TTS)服务集成

23
Experimental
4590 nikita9604/Automated-Voice-Controlled-Email-Sender

Simple Automated Voice Controlled Email Sender using SMTP in python

23
Experimental
4591 LuisMiSanVe/AiCursorHelper

AI Assistant that helps you move around your Desktop with voice command

23
Experimental
4592 hakunamatata1997/Speech-to-Text-WebApp

This is a web application that performs speech recognition on audio files....

23
Experimental
4593 Hayder-IRAQ/SubLab

🎬 Auto-generate & translate video subtitles using Whisper AI — offline,...

23
Experimental
4594 ShahabAthar25/speech-assistant-python

A simple speech assistance in python made with the help of pyttsx3,...

23
Experimental
4595 Ronnie-Leon76/Swahili-ASR

This repository contains the code for fine-tuning the XLS-R Wav2Vec2 model...

23
Experimental
4596 ranchlai/wav2vec-2.0

Wav2vec2 English speech recognition in PaddlePaddle

23
Experimental
4597 babadue/seamless-m4t-v2-large-demo

Demonstration features of seamless-m4t-v2-large model

23
Experimental
4598 openvoicepacks/openvoicepacks

Generate and customize complete voice packs for OpenTX and EdgeTX radios.

23
Experimental
4599 AmirHoseein99/Persian_ASR

a ASR(automatic speech recognition) model for Persian language based on...

23
Experimental
4600 bjornbytes/lua-deepspeech

Lua Library for Speech Recognition

23
Experimental
« Prev 1 2 3 44 45 46 47 48 80 81 82 Next »