All Voice AI Tools

8,165 tools ranked by quality score · Page 44 of 82

Showing 4301–4400 of 8,165
# Tool Score Tier
4301 BrunoHenrique00/ear

Ear is a desktop app that will help you transcribe what is playing on your computer!

25
Experimental
4302 mahirgul/Google-Cloud-Text-To-Speech-PHP

Google Cloud Text To Speech Application - PHP

25
Experimental
4303 BestSithInEU/cc-vox

Claude Code plugin that speaks a short summary aloud after every response

25
Experimental
4304 dictto-app/dictto

Voice-to-text for Windows — hold a hotkey, speak, release. Clean text...

25
Experimental
4305 bunyaminergen/awesome-speech-dataset

Awesome Speech Dataset, including download links and a brief explanation for...

25
Experimental
4306 chirag127/ZenRead-AI-Content-Reader-Browser-Extension

Privacy-first browser extension providing clean reader mode, AI...

25
Experimental
4307 Youtube-Transcript-Dev/Youtube-Transcript-API

YouTube Transcript API — Extract, transcribe, and translate YouTube videos...

25
Experimental
4308 habitual69/speakify-api

Speakify is a simple API that generates audio and subtitles from text using...

25
Experimental
4309 Bangla-Language-Processing/Bangla-Speech-Corpora

Bangla cleaned speech corpus, specially developed for Bangla Text to Speech

25
Experimental
4310 LinkonBSMRSTU/Speech-To-Text-App-iOS

A simple iOS App that can convert speech/voice into text. Only English voice...

25
Experimental
4311 klee-repos/dialogflow-voice-streaming

Intent mapping with real-time voice to text stream

25
Experimental
4312 Parseval-Labs/SoundML

A high level DSP library in the OCaml language

25
Experimental
4313 daaminashai/speech-assistant

speech assistant for individuals suffering from Aphasia

25
Experimental
4314 conbitin/htk3.5-install

Installation steps of HTK 3.5 under Ubuntu

25
Experimental
4315 tomasgoiba/diphone-synthesizer

Basic diphone-based concatenative speech synthesizer in English.

25
Experimental
4316 R3ner/Barrel-Timer

Advanced voice-controlled cooldown tracker for League of Legends. Tracking...

25
Experimental
4317 aws-samples/seq2seq-asr-misbehaves

Artifacts for the paper "Attentional Speech Recognition Models Misbehave on...

25
Experimental
4318 kyopark2014/demo-robo-soulmate

It is a repository to prepare a demo for dansing robot.

25
Experimental
4319 anuran-roy/vosk-demo

A simple offline voice recognition system purely built on Python3, that...

25
Experimental
4320 avikantz/Samaritan

Samaritan demo clone for iOS.

25
Experimental
4321 Aalwattar/ParrotInk

Professional-grade, real-time voice-to-text for Windows. Stream your voice...

25
Experimental
4322 celanthe/clarion

Your agents have things to say. Now they have a voice to say them with.

25
Experimental
4323 p-jacobo2012240/AI-Real-Time-Recognition

Tensorflow app for real-time environment sketching using text-to-speech and GCP

25
Experimental
4324 Mmesek/mUSh

Ultrastar Songs Creation/Management helper utils.

25
Experimental
4325 Leonqn/speech-to-text-bot

Speech to text telegram bot. It can convert voice and video note messages to...

25
Experimental
4326 verbio-technologies/rust-verbio-speech-center

Rust integration with Verbio Speech Center Cloud https://www.speech-center.verbio.com

25
Experimental
4327 palahsu/textspeech

A python program that helps you to read your text in lady robot voice at...

25
Experimental
4328 abhishekkr/lyrical-video-generator

lyrical-video-generator is supposed to help create lyrical videos from audio...

25
Experimental
4329 miikkij/Speechos

Local-first speech AI benchmarking — compare STT, TTS, emotion & diarization...

25
Experimental
4330 iChochy/mimo-tts-chat

MiMo TTS Chat

25
Experimental
4331 ammarasmro/Kurdish-Language

Applications of NLP on the Kurdish language

25
Experimental
4332 raym33/aiemoji

AI talking face

25
Experimental
4333 DIY-Engineering/Advanced-STS-Local-AI-Assistant

This is a fully local AI Assistant that uses Silero VAD, Faster-Whisper, LM...

25
Experimental
4334 SKLD-xm/speechy

A text-to-speech synthesizer based on C# that supports SSML

25
Experimental
4335 box-community/sample-audio-skills

🎼 Box Skills samples for processing audio files

25
Experimental
4336 PRITHIVSAKTHIUR/Qwen3-TTS-Daggr-UI

Demonstration for the Qwen/Qwen3-TTS-12Hz models using Daggr for modular UI...

25
Experimental
4337 rvuyyuru2/supertonic-restapi

Supertonic FastAPI - High Performance OpenAI-Compatible TTS API

25
Experimental
4338 senigami/audiobook-studio

Professional local-first AI production pipeline for long-form narration....

25
Experimental
4339 harmindersinghnijjar/streamlit-punjabi-ai

Punjabi AI, ChatGPT with translation and Punjabi TTS using Narakeet's API.

25
Experimental
4340 x-phone/xbridge

Self-hosted voice gateway — WebSocket audio streaming and REST call control....

25
Experimental
4341 somosnlp/wav2vec2-spanish

Pre-train a Spanish Wav2Vec2 model using the Spanish portion of the Common...

25
Experimental
4342 tozalia/pocket-tts-openapi-gpu

🎤 Clone voices locally with Pocket TTS OpenAPI - GPU. Enjoy free,...

25
Experimental
4343 victoryangzhijie/stt-server

Real-time speech-to-text WebSocket server with pluggable ASR backends,...

24
Experimental
4344 spokestack/android-skeleton

A functionless Android app that demonstrates a basic integration with the...

24
Experimental
4345 xujiaao/BezierSpline

Android - Smooth Bézier Spline Through Prescribed Points

24
Experimental
4346 msalhab96/RNN-Transducer

PyTorch implementation of Sequence Transduction with Recurrent Neural...

24
Experimental
4347 syedjahangirpeeran/Optical-Character-Recognition-and-TTS

Written in MATLAB, the project aims to convert hand written or printed text...

24
Experimental
4348 artem-gorodetskii/long-form-voice-cloning

Audio samples from "Zero-Shot Long-Form Voice Cloning with Dynamic...

24
Experimental
4349 Salama1429/speech-to-speech-translation

cascaded speech-to-speech translation (STST), mapping from source speech in...

24
Experimental
4350 rohansx/convox

Open-source voice AI orchestration platform for India. Build production...

24
Experimental
4351 1Finn2me/Novery

A modern Android novel reader with multi-source support, TTS, and offline reading

24
Experimental
4352 gillan-krishna/meeting_notes

Hobby project to transcribe audio files from meetings to transcripts with a summary

24
Experimental
4353 baharudin-yusup/salingsapa

A video call apps to enable deaf people to communicate with normal people...

24
Experimental
4354 raminnakhli/HMM-DNN-Speech-Recognition

This repository is a Python implementation of HMM-DNN model.

24
Experimental
4355 Sneakyhydra/Sentinel

Voice Assistant using Whisper in python3

24
Experimental
4356 Shibli-Nomani/Open-Source-Models-with-Hugging-Face

Open Source Models With Hugging Face

24
Experimental
4357 vorojar/VibeVoice

Open-source AI audiobook studio. A free, private alternative to ElevenLabs....

24
Experimental
4358 OpenLake/Speech-Analyser

An App to help you improve your English fluency 🎤

24
Experimental
4359 Prakash2403/asl-recognizer

Sign language recognition using Hidden Markov Models

24
Experimental
4360 treychen-369/WallWhisper

🏠 Turn any IP camera into a smart English tutor for your family. AI-powered,...

24
Experimental
4361 prabormukherjee/Coursera_Helper_chatbot

A chatbot to help coursera student with their difficulty.

24
Experimental
4362 10raw/Prescription-Generator

android app to generate Doctor's Prescriptions faster using Deep Learning

24
Experimental
4363 FernandoLpz/SpeechRecognition

This repository contains the implementation of an Automatic Speech...

24
Experimental
4364 David-Antolick/REX_voice_assistant

Lightweight offline voice assistant for hands-free music control (YouTube...

24
Experimental
4365 SinhaRepo/nexus-ai-assistant

A distributed AI voice assistant built on a Raspberry Pi Zero W and a Flask...

24
Experimental
4366 manojsvgit/Voice_Based_Email_For_Blind

A Python-based application designed specifically for visually impaired...

24
Experimental
4367 h4l0anne/adv-companion-app

[🏆 Harm Reduction Category Winner, Top 5 @RUHacks] Drug-Venture Companion

24
Experimental
4368 EN10/Speech-to-Text-WaveNet

Speech to Text

24
Experimental
4369 richardr1126/KittenTTS-FastAPI

High-performance KittenTTS API server with a built-in web UI,...

24
Experimental
4370 Michaelrace/awesome-voice-agents

🗣️ Explore a curated list of voice AI agents, frameworks, tools, and best...

24
Experimental
4371 dalyanalytics/counselor

👑 voice-powered code review tool for R developers

24
Experimental
4372 AnuragGupta93/LocalEcho

**LocalEcho** is a fully local, open-source Text-to-Speech engine powered by...

24
Experimental
4373 Maxborland/mindtype-app

MindType — Voice-to-text with AI-powered summaries. 100+ languages, works...

24
Experimental
4374 IceDynamix/iceTTS

Twitch Chat TTS with no strings attached

24
Experimental
4375 Ziggx5/TalkToText

Speech-to-text app bulit with Python and Vosk speech recognition engine

24
Experimental
4376 erogol/TTS_tf

WIP Tensorflow implementation of https://github.com/mozilla/TTS

24
Experimental
4377 gupta-v/Eva

Eva - Desktop Assistant: A Python-based desktop assistant designed for...

24
Experimental
4378 meangrinch/spelling-bee

Spelling bee game with multiple difficulty tiers

24
Experimental
4379 371tti/Nelfie

A **standalone** Discord bot with LLM, VOICEVOX, and KaTeX support.

24
Experimental
4380 ryanblab1903n8/piperplus

Piperin is an efficient TTS tool that instantly creates high-quality audio...

24
Experimental
4381 angelinekeke/claude-awake-speak

让你的 Claude Code 会说话 — 自动语音朗读中文内容,8种微软官方音色可选,实时切换,免费无需API Key,跨平台支持

24
Experimental
4382 lianabisuna/spelltacular

Random word spelling skills test/practice (Vue.js 2 & Vuetify)

24
Experimental
4383 GrahamPellegrini/Machine-Learning-Noise-Cancellation

Bachelor Final Year Project exploring real-time speech denoising using...

24
Experimental
4384 Jaffe2718/qwen3asr4j

Java binding for Qwen3 ASR

24
Experimental
4385 analyticsinmotion/wake-word

Hands-free voice activation for VS Code, Cursor, and compatible editors....

24
Experimental
4386 YOUSSEF-BT/Ai-Summarizer

AI-powered summarizer for articles, PDFs, and Word documents with...

24
Experimental
4387 joypix-ai/joypix

AI Talking Video Generator: Talking Photo (AI lip-sync) + AI Avatar...

24
Experimental
4388 LiveTrad/livetrad-io-engine

A complete solution for Live meeting translations Extension+DesktopApp+Server

24
Experimental
4389 Sariel2018/audio-srt-aligner

Dual-mode subtitle tool: transcript-aware alignment and audio-only auto...

24
Experimental
4390 masasibata/t-one-rest-api

Production-ready REST API for Russian speech recognition using T-one model....

24
Experimental
4391 anatolykoptev/moonshine-whisper

Fast speech-to-text HTTP service powered by Moonshine + sherpa-onnx. Beats...

24
Experimental
4392 ComputerCampaign/contentflow-ai

一个功能强大的Python工具,集成网页图片爬取和博客自动生成功能。支持XPath规则配置、任务ID管理、Selenium动态加载、GitHub图床上传、...

24
Experimental
4393 ekdysis/Speech-POC

POC using Apple's Speech framework demonstrating real-time speech...

24
Experimental
4394 Argo-Robot/wake_word_detection

Step-by-step guide to implement a wake-word detection system for Argo, an AI...

24
Experimental
4395 ducnt18121997/Viet-Transformer-TTS

This is PyTorch Implementation of A Non-Autoregressive Transformer with...

24
Experimental
4396 ZhanpengWang96/pytorch-speech2vec

Pytorch implementation of the paper Speech2Vec: A Sequence-to-Sequence...

24
Experimental
4397 BinkyWong/speech-recognition

Centos 7 based container for speech recognition

24
Experimental
4398 PrarieComamile/speech-to-text

Convert your voice to text file with this program.

24
Experimental
4399 davidsuragan/elevenlabs-alpha-v3

Use ElevenLabs Alpha v3 TTS model in Python with this repo.

24
Experimental
4400 Nono-04/ChannelPoints-TTS

A simple TTS rewards script for Twitch channel points

24
Experimental
« Prev 1 2 3 42 43 44 45 46 80 81 82 Next »