All Voice AI Tools

8,165 tools ranked by quality score · Page 60 of 82

Showing 5901–6000 of 8,165
# Tool Score Tier
5901 Masihtabaei/reswhis

A lightweight, WebSocket-based server for real-time, remote audio...

19
Experimental
5902 fatehmtd/gladiapp

C++ Client Library for Gliadia API

19
Experimental
5903 dudarev/speechdown

CLI tool to transcribe your spoken audio notes into timestamped,...

19
Experimental
5904 moziarnj07-sys/doubaoime-asr

🎤 Enable voice recognition for the Doubao input method using Python; ideal...

19
Experimental
5905 Joyeah/videomaker

批量图片生成视频

19
Experimental
5906 HoangLayor/LiveTranslator

LiveTranslator is a real-time speech translation system that captures spoken...

19
Experimental
5907 Maksim-Goncharovskiy/video-dubbing

Dubbing english videos into russian.

19
Experimental
5908 patelritiq/CodeClause-Internship-Projects

A comprehensive collection of 4 Python applications developed during a...

19
Experimental
5909 001kenji/Text_To_Speech_AI

A modern web application that converts text to speech using advanced TTS...

19
Experimental
5910 porcelluscavia/audio-model

My Master's thesis project in audio classification using PyTorch and...

19
Experimental
5911 shrey802/PyTTSeval

Evaluation tool for TTS systems

19
Experimental
5912 skye-cyber/ttskit3

A lightweight text to speeach toolkit

19
Experimental
5913 deepgram-starters/ruby-text-to-speech

Get started using Deepgram's Text-to-Speech with this Ruby demo app

19
Experimental
5914 JuanJRA20/Conversor-Texto-a-Voz

🎙️ Sistema inteligente de conversión de texto a audio con detección...

19
Experimental
5915 laustke/jimlet_classic

Offline text-to-speech GUI converter with drag-and-drop support,...

19
Experimental
5916 James-P-D/SDRTranscriber

SDR audio transcriber in Python

19
Experimental
5917 crrrowz/Vosk-STT-Chrome-Extension

Real-time Speech-to-Text Chrome Extension — dictate into any input field...

19
Experimental
5918 deepgram-starters/php-text-to-speech

Get started using Deepgram's Text-to-Speech with this PHP demo app

19
Experimental
5919 Sergey004/Phone_Guy

An AI phone character based on Phone Guy from FNAF

19
Experimental
5920 Sgvkamalakar/Gita_Summarizer

Gita Summarizer extracts key insights from the Bhagavad Gita, aiding...

19
Experimental
5921 sknadig/ASR_2018_T01

Example repository for 2018 DS/NC 821 / Automatic Speech Recognition projects

19
Experimental
5922 madhura-23/ai-voice-assistant

🎤 AI Voice Assistant - Real-time speech recognition, production-ready, fully...

19
Experimental
5923 Pchambet/tp-hmm-markov

Markov Chains and Hidden Markov Models: weather modeling with discrete...

19
Experimental
5924 dsalnikov/wav2vec

pure numpy implementation of wav2vec 2.0

19
Experimental
5925 YoungloLee/tf2-speech-recognition-transformer

Tensorflow 2 Speech Recognition Code (Transformer)

19
Experimental
5926 deepgram-starters/csharp-text-to-speech

Get started using Deepgram's Text-to-Speech with this C# demo app

19
Experimental
5927 bryanstevensacosta/tts-studio

Personal voice cloning CLI tool using XTTS-v2

19
Experimental
5928 shantoshdurai/GhostTalker

AI voice cloning and text-to-speech using XTTS — talk to historical figures...

19
Experimental
5929 ErenBalkis/rvc-tts-studio

A Streamlit-based web interface that converts text to speech using edge-tts...

19
Experimental
5930 karthikrshet/text-to-speech

Convert any text into lifelike speech. Choose your language and voice.

19
Experimental
5931 rookiemann/portable-tts-server

Portable multi-GPU text-to-speech server for Windows — 10 AI models, gateway...

19
Experimental
5932 Uchastnick/malisa

Malisa, the voice assistant robot

19
Experimental
5933 ikeoffiah/kokoro_tts

On-device Kokoro TTS for Flutter — high-quality text-to-speech using ONNX...

19
Experimental
5934 MrThinkins/text-to-speach-native-to-web

A TTS that runs natively on the browser using the kokoro.js library.

19
Experimental
5935 Tharindu-Senanayake12/Sign-Language-Interpreter

Real-time AI sign language interpreter with gesture recognition, NLP...

19
Experimental
5936 indaco/md2audio

Convert markdown ections to audio files using multiple TTS providers - a...

19
Experimental
5937 vroomfondel/sipstuff

SIP telephony automation toolkit — place calls via PJSIP, play WAV/TTS...

19
Experimental
5938 Jhanwi/Intelligent-Desktop-Companion

This project developed a personalized Python-based voice controlled...

19
Experimental
5939 shervinnd/Persian-Voice-Assistant-for-Home-Appliance-Repairs

🛠️ A Persian voice assistant to help with diagnosing and repairing home...

19
Experimental
5940 200-DevelopersFound/Havo

The mobile application you envision is designed to facilitate the conversion...

19
Experimental
5941 chirag127/SystemAudioTranscriber-RealTime-SystemAudio-To-Text-Windows-App

Real-time transcription of Windows system audio to text via a floating,...

19
Experimental
5942 zefie/multi-tts

Docker for multiple TTS Engines with a GRadio interface

19
Experimental
5943 gcryptonlabs/FlowCue

FlowCue — native macOS teleprompter with real-time speech tracking, AI...

19
Experimental
5944 wehomemove/WhisprByTheo.spoon

Push-to-talk voice transcription for macOS using MLX Whisper. Beautiful UI,...

19
Experimental
5945 jswallez/jetvoice

Voice to text for macOS. Press a hotkey, speak, get instant transcription.

19
Experimental
5946 myl7/doubao-voice-input-electron

豆包实时语音转文字桌面应用,按下快捷键或长按指定按键,语音识别结果自动输入到当前应用

19
Experimental
5947 Revocalize/revocalize-docs

🎤 Revocalize AI API: Sing like your favorite artist with our powerful AI...

19
Experimental
5948 lukeocodes/clarion

macOS menu bar app that reads text aloud using Deepgram TTS

19
Experimental
5949 giefferre/texttospeech

Google Cloud Text-to-Speech API Client Library for Go

19
Experimental
5950 lask3802/live-translator

Real-time AI-powered transcription and translation Chrome extension for live...

19
Experimental
5951 qora-protocol/QORA-TTS-12Hz-0.6B

Pure Rust TTS engine with 9 built-in speakers. No Python, no CUDA, no...

19
Experimental
5952 GustasG/vits

VITS Text-to-Speech Model for Lithuanian Language

19
Experimental
5953 davideferrari95/alexa_voice_control

This repository allows you to establish a communication between ROS / ROS2...

19
Experimental
5954 SurveAditya/StudentManagementSystem

A student management system with graph plotting and voice recognition implemented.

19
Experimental
5955 GlobussBiogestion/text-to-signals-and-voice

This API works 100% in HTML with Javascipt so it is very light and easy to...

19
Experimental
5956 mneme-verse/mneme

Open-source mobile app for memorizing poetry using Spaced Repetition and...

19
Experimental
5957 Ask149/friday

A macOS desktop companion with an animated face, voice I/O, and personality...

19
Experimental
5958 Robertinoos13/PyroSpeak-Library

PyroSpeak is a small Python wrapper library that uses big technologies like...

19
Experimental
5959 dgaida/text2speech

Provides text2speech capabilities using ElevenLabs and Kokoro TTS

19
Experimental
5960 KF-R/turk-chat

Lightweight speech-to-speech web-based chat app combining speech...

19
Experimental
5961 Bsh54/AI_Phone_Call

Application web qui transforme la synthèse vocale traditionnelle en...

19
Experimental
5962 Eleven1111/groq-whisper

Groq-powered OpenClaw speech tools for local audio transcription and...

19
Experimental
5963 trentw/script-to-speech

Convert screenplays into multi-voiced audiobooks using various...

19
Experimental
5964 martins-vds/my-assistant

A voice-driven personal task-tracking assistant for tech workers who...

19
Experimental
5965 BedirT/NarratorX

📖 NarratorX: Turn your PDFs into captivating audiobooks in 16 languages,...

19
Experimental
5966 zyascend/End-to-End-Speech-Recognition-Learning

ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别

19
Experimental
5967 avreliusdante-web-creator/voice-input

Browser extension: convert voice to text and send it with one click in open...

19
Experimental
5968 noly24/spoken-subtitles

"Chrome extension that reads subtitles aloud on streaming sites for accessibility"

19
Experimental
5969 fromis-9/audio-fm

Create narrated countdowns of your top tracks from Last.fm

19
Experimental
5970 MnAkash/aalap

A speech to speech dialogue management package using faster-whisper ASR,...

19
Experimental
5971 LiiLk/Local-AI-Companion

A private, offline AI assistant running entirely on your local machine.

19
Experimental
5972 punyamodi/Speech-to-Speech-Local-LLM

Local speech-to-speech AI assistant with voice cloning, Gradio UI,...

19
Experimental
5973 Bailie-L/VelaNova

Fully offline voice assistant powered by local LLMs — no cloud, no...

19
Experimental
5974 seanghay/vits.cpp

VITS Inference using ONNX Runtime on C++

19
Experimental
5975 michael-borck/talk-buddy

Provides AI-powered conversation practice with speech recognition and...

19
Experimental
5976 ebisuryu/vision-ai-intern-assignment

This repository contains my solution for the Vision AI intern assignment at...

19
Experimental
5977 okamyuji/HomeCareVoiceLog

Offline iOS voice-first care journal with automatic on-device transcription...

19
Experimental
5978 theubie/OpenTAAI

Read chat log from a Twitch channel and get a natural response from OpenAI. ...

19
Experimental
5979 eryk-mazus/sigh

Seamless Voice Interactions with LLMs

19
Experimental
5980 miranda1000/TwitchTTSBot

A Twitch bot that reads point redemptions with a custom trained voice.

19
Experimental
5981 ttsaigit/tts-ios

TTS.ai iOS app — 18 AI text-to-speech models, voice cloning, speech-to-text

19
Experimental
5982 msalhab96/Listen-Attend-and-Spell

PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper

19
Experimental
5983 nkm90/HearMeWhenYouCanNotSeeMe

Sign language recognition, using multihand tracking solution from Mediapipe,...

19
Experimental
5984 erich2s/native-speak

A simple text-to-speech library using system native tts engines for Node.js

19
Experimental
5985 pyzskw/meeting-teleprompter

线上会议提词器 - 语音识别自动跟读、防截屏、专注模式、离线模型 | Meeting Teleprompter with offline ASR

19
Experimental
5986 brlin-tw/whisper.cpp-snap

Provides easy access to the whisper.cpp application on snap-enabled OS distributions.

19
Experimental
5987 hubetcardenasi/SpeechApp

Convertir tu celular en una aplicación de voz.

19
Experimental
5988 laravieira/reddit-to-tiktok

This project is a Python rendering and publishing pipeline that takes Reddit...

19
Experimental
5989 mostlyvirtual/book-to-audiobook

Convert PDFs and EPUBs into MP3 audiobooks with a clean local web UI,...

19
Experimental
5990 upskyy/RNN-Transducer

PyTorch Implementation of RNN-Transducer

19
Experimental
5991 avrtt/MoE-speech-recognition

Mixture of experts architecture for speech-to-text and language...

19
Experimental
5992 Yacinewhatchandcode/VoiceCloning

🎙️ Real-Time TTS & Voice Cloning Pipeline — F5-TTS · PyTorch · Gradio · Voice Agent

19
Experimental
5993 xAlpharax/edge-tts-gradio

Gradio Interface for Text-To-Speech using Edge TTS.

19
Experimental
5994 wacumov/stttool

A command-line utility for converting audio files to text using a pretrained model.

19
Experimental
5995 Inexpli/Discord-Jarvis

A real-time Discord voice assistant powered by Llama 3, Whisper, and Web...

19
Experimental
5996 Matrixxboy/vermeil

Vermeil is personal assistant just like Jarvis

19
Experimental
5997 cybernahx/urdu-voice-assistant

An Urdu language voice assistant built with Python for speech recognition and TTS

19
Experimental
5998 Dragon745/urdu-roman-dictionary

A growing open-source Urdu → Roman Urdu dictionary and lexicon for...

19
Experimental
5999 Largo-m/AutoCaption

AutoCaption is a complete, fully automated tool for generating video...

19
Experimental
6000 mcp-tool-shop-org/voice-soundboard

TTS library for AI agents — compiler/graph/engine architecture, swappable...

19
Experimental
« Prev 1 2 3 58 59 60 61 62 80 81 82 Next »