All Voice AI Tools

8,165 tools ranked by quality score · Page 45 of 82

Showing 4401–4500 of 8,165
# Tool Score Tier
4401 inforkgodara/python-speech-to-text

A few lines of code which convert speech to text.

24
Experimental
4402 boned-fruitwood759/whisperx-asr-with-fastapi

🎤 Enable real-time speech recognition with WhisperX using FastAPI for...

24
Experimental
4403 hutchpd/AI-Medical-Scribe

Local-first AI medical scribe running entirely in the browser using Chrome...

24
Experimental
4404 Sergey004/silero_tts_rvc

A simple extension that allows LLM to speak in any voice, literally, based...

24
Experimental
4405 profdilley/markdown-speech-converter

This tool converts Markdown files into **speech-friendly plain text** files....

24
Experimental
4406 danielcorsano/reader-gui

Standalone app for creating audiobooks from ebooks using realistic AI voices...

24
Experimental
4407 mohammadhasananisi/Google-Speech-Recognition

Persian-Speech-Recognition

24
Experimental
4408 sindhura-pv/lip-reading

In this project, visual speech recognition has been attempted using 2 major...

24
Experimental
4409 ss87021456/mfcc_ctc_speech

apply mfcc feature of waveform with the LSTM + CTC loss architecture

24
Experimental
4410 Dante9581/laravel-elevenlabs

🎤 Integrate ElevenLabs Text-to-Speech and Speech-to-Text APIs seamlessly...

24
Experimental
4411 JagratiVerma1408/ObjectDetectionApplication

Andriod app integrating tflite model for object detection

24
Experimental
4412 Bacdong/virtual-assistant-v1

Learning build virtual assistant with python and python library support.

24
Experimental
4413 Manokero/face-recognition-and-tts-numbers

En este proyecto se utiliza reconocimiento facial para verificar una persona...

24
Experimental
4414 andydowsen/voice-assistant

🏳🌌♨ Simple voice assistant with minimal ai logics includes streamlit web...

24
Experimental
4415 swiss-ai-center/text-to-speech-service

Queries an API based on Edge-TTS and returns an audio file based on...

24
Experimental
4416 akashchaudhary-git/android-azure-speech-openai

An integration of Azure Speech Service and Azure OpenAI in Android. This...

24
Experimental
4417 NitinN77/ASL-To-Speech-Rpi

A pi setup to recognize ASL signs using a pre-trained CNN model and speak it...

24
Experimental
4418 yiwise/yiwise-asr-demo-java

杭州一知智能科技有限公司自研 ASR Java客户端demo

24
Experimental
4419 aditeyabaral/natural-language-database-querying

A novel approach to data retrieval from tagged databases using only natural...

24
Experimental
4420 astrologos/libri-scraper

The Public Audiobook Scraper downloads full audiobook MP3's from...

24
Experimental
4421 imvladikon/wav2vec2-hebrew

Speech Recognition for Hebrew (using wav2vec2 models)

24
Experimental
4422 icosane/alstroemeria

Create and translate subtitles for any video, complete with voiceover capabilities.

24
Experimental
4423 DrewThomasson/ebook2audiobookEspeak

Create audiobooks with espeak in a gradio interface gui easy

24
Experimental
4424 tuanio/e2e-asr-toolkit

E2E Speech Recognition Toolkit with Hydra and Pytorch Lightning

24
Experimental
4425 vishal1patidar/TEXT-TO-SPEAK

🔖24 Different Languages voice's Add a text🗨️ in it and listen👂

24
Experimental
4426 rupin/WrittenAudio

Written Audio Uses Google Text to Speech engine and a configuration file to...

24
Experimental
4427 techieinhouse/chatbot

python chatterbot using flask and speech recognition from html5

24
Experimental
4428 BBC-Esq/Elegant-Audio-Transcriber

Extremely fast and accurate audio transcrbier surpassing Whisper. Optimized...

24
Experimental
4429 probablyagoodusername/vesper

Therapeutic audio pipeline. Faith meets science. Free, static, open source.

24
Experimental
4430 collinsuen/Local-Whisper-STT-Windows11-ZH

Local GPU-Accelerated Chinese Speech-to-Text for Windows 11 (Whisper-based,...

24
Experimental
4431 garconvacher/TextToSpeech_eBook

Un kit de test pour la synthèse vocale eBook (EPUB + Kindle)

24
Experimental
4432 Ponyu-dev/Unity-Sherpa-ONNX

Unity plugin for sherpa-onnx — offline TTS, ASR, and VAD with one-click setup

24
Experimental
4433 atanu20/alan-ai-news-project

Here i build a Conversational Voice Controlled React News Application using...

24
Experimental
4434 ckull/SUKI

A Node.JS Discord bot

24
Experimental
4435 YoRyan/obicaller

Talking caller ID for OBiTALK OBi200 and Raspberry Pi (or other Linux)

24
Experimental
4436 djleamen/renamer

Utility to rename mp3 files based on speech content

24
Experimental
4437 elvanselvano/streamlit-whisper

empowering the visually impaired with equal financial access through...

24
Experimental
4438 dongheehand/Tacotron-PyTorch

PyTorch implementation of Tacotron

24
Experimental
4439 itscooleric/yap

Local-first speech I/O stack — privacy-preserving transcription, synthesis,...

24
Experimental
4440 linseycurrie/NHS-Speech-Recognition-App

This was a group project created remotely over 7 days using Java, Spring,...

24
Experimental
4441 Aprataksh/Python-Files

mic_py : Python 3 code for successful use of microphone on windows....

24
Experimental
4442 vault-42/AIND_DNN_Speech_Recognizer

End-to-end speech to text recognition

24
Experimental
4443 Momotoculteur/Keyword-voice-recognition

Créer une reconnaissance vocale de mots clés via des algorithmes...

24
Experimental
4444 Neil-001/audio-to-subtitle-translate

Easily convert speech to timed SRT subtitles and translated captions (Colab-ready)

24
Experimental
4445 dcervantes/VoiceFlashcards

VoiceFlashcards is an innovative web app that helps users practice language...

24
Experimental
4446 dpid/openclaw-voice-bridge

Hands-free voice interface for OpenClaw (Clawdbot). VAD-based PWA with...

24
Experimental
4447 elizabethfuentes12/meta-ai-agent-sample-for-aws-agentcore

Voice AI agent for Ray-Ban Meta glasses using Amazon Bedrock AgentCore and...

24
Experimental
4448 lmk123/cvox

Get spoken alerts when Claude Code needs permission or finishes a task — so...

24
Experimental
4449 neurlang/whipstr

Whipstr ASR/STT System

24
Experimental
4450 Epistates/rosellas

Automatic speech recognition (ASR) for Apple Silicon

24
Experimental
4451 D34DC3N73R/ha-chatterbox-tts

Home Assistant TTS integration for Chatterbox-TTS-Server

24
Experimental
4452 jagerzhang/FastTTS

基于edge-tts的简单语音合成服务,支持私有化部署,支持和源阅读APP无缝对接。

24
Experimental
4453 pstepanovum/Cadence

Open-source AI pronunciation coach with phoneme feedback, guided speaking...

24
Experimental
4454 proger/uk

Фонограми та синтагми: інструменти обробки

24
Experimental
4455 umitkacar/transformer-asr-transcription

Real-time transformer-based ASR supporting 100+ languages - Google Cloud...

24
Experimental
4456 MAXBAF1/SpoonEat

A mobile application for maintaining a balance in nutrition, with the...

24
Experimental
4457 xi-Rick/captains-log

A voice transcription and logging web app built with TypeScript, Captain's...

24
Experimental
4458 IDEA-Emdoor-Lab/UniTTS

A TTS Trained on Universal Audio.

24
Experimental
4459 1999AZZAR/Telegram-Bot-Playground

This repository is a playground for experimenting with several simple...

24
Experimental
4460 LexicalStressDetection/lexical-stress-detection

Deep Learning model for lexical stress detection in spoken English

24
Experimental
4461 asheghi/text-to-speech

Text to Speech

24
Experimental
4462 atomiechen/funasr-client-ts

Really easy-to-use Typescript client for FunASR runtime server.

24
Experimental
4463 SVM0N/ttsweb

Convert PDFs/EPUBs to audiobooks with synchronized text highlighting using...

24
Experimental
4464 vijethph/violet-speech

Violet is a Speech Assistant made using Python

24
Experimental
4465 jinseok19/Intermediate_Level_Project_for_AI-X

🤖AI+X 선도 인재 양성 중급 프로젝트 with KT & 상명대학교🤖

24
Experimental
4466 florabtw/google-translate-tts

Node library for Google Translate TTS (Text-to-Speech) API

24
Experimental
4467 TassAI/TASS-Android-UI

TASS Android UI is an open source Android application for using a remote...

24
Experimental
4468 gheyret/uyghur-asr-transformer

Speech Recognition for Uyghur using Speech transformer

24
Experimental
4469 HelgeSverre/glados

A web interface for GLaDOS text-to-speech with AI conversation capabilities

24
Experimental
4470 jaypinho/transcript-accuracy

A Streamlit app to evaluate the accuracy of automatic speech recognition...

24
Experimental
4471 baochuquan/ios-vad

iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN,...

24
Experimental
4472 alorbach/open-video-transcribe

Open Video Transcribe - Open-source video transcription tool that emphasizes...

24
Experimental
4473 kemsta/macloop

https://pypi.org/project/macloop/

24
Experimental
4474 SudharsanSaravanan/JARVIS

JARVIS (Just A Rather Very Intelligent System) is a voice-controlled,...

24
Experimental
4475 bagustris/speech-recognition-course

Material for learning speech recognition, based on Microsoft teaching material on EdX

24
Experimental
4476 smswg/FreeSwitch-Mod_FunAsr

FreeSWITCH...

24
Experimental
4477 josharsh/terminal-voice

Voice input for the terminal. Speak, and it types. Local transcription,...

24
Experimental
4478 SentimentalK/Reliquary

The best voice input, a Zero-Friction Bridge to Your AI Exobrain

24
Experimental
4479 leminhnguyen/ai-speech-engineer-roadmap

A curated roadmap based on my 6 years of experience form zero to become a...

24
Experimental
4480 rudhreeshkumaar/Speech-to-Text

Speech recognition and text transcription from file or microphone

24
Experimental
4481 lane203m/SoundByte

U of R SSE Capstone Project; Recommending Music For Artists

24
Experimental
4482 sahilmishra0012/prescription-generator

This project aims at generating the prescription dictated by the doctor in a...

24
Experimental
4483 rahul6975/Helping-Voice

An Android application which completely works on voice input which helps...

24
Experimental
4484 rapidaai/rapida-python

Open-source Python SDK for real-time Voice AI, voice agents, streaming...

24
Experimental
4485 williamclavier/Multimodal-Classroom-Video-Recorder

A smart multimodal classroom video recording system that automatically...

24
Experimental
4486 lcukerd/Blink-to-Text

Application converts eye blinks to text and hence helps paralysed people communicate.

24
Experimental
4487 CrankZ/muyi

本地字幕生成与翻译,支持显卡加速

24
Experimental
4488 Winnie-Fred/text-to-speech

Text-to-speech web-based application using Django and Google Translate...

24
Experimental
4489 xDoritox/Voice-Clone-Studio

🔊 Clone and design voices easily with Voice Clone Studio, a web UI powered...

24
Experimental
4490 PrathuashaKB/ASR-Using-Deep-Learning

Automatic Speech Recognition is a technique that processes human speech into...

24
Experimental
4491 kiy0ni/auto-video-editor

Un outil Python (Tkinter) qui génère automatiquement des highlights et des...

24
Experimental
4492 upstash/radio-hackernews

Audio Recap of Top Hackernews Stories

24
Experimental
4493 joaoalvarenga/voice-assistant

An open-source Alexa-like complete voice assistant system, from speech...

24
Experimental
4494 Mildemelwe/Japanese-Tacotron-2-notebook

Training notebook for Japanese TTS model with Tacotron 2

24
Experimental
4495 salehsargolzaee/Audio-Signal-Processing-and-Feature-Extraction

Feature extraction from audio signal (explained in Persian)

24
Experimental
4496 Sajith171111/whisper

🗣️ Transcribe your voice to text easily on macOS. Just hold **Fn**, speak,...

24
Experimental
4497 ilya16/isp-tts

A simple TTS model developed for the Speech Synthesis and Voice Cloning...

24
Experimental
4498 nsourlos/end-to-end_deepfake_colab

Create deepfake video by just uploading the original video and specifying...

24
Experimental
4499 Muhib-Mehdi/ASL-Recognition-System

The ASL Recognition System is a real‑time American Sign Language (ASL)...

24
Experimental
4500 sebinbenjamin/wav2vec_demo

A Python tool for transcribing speech from audio files using the Wav2Vec 2.0...

24
Experimental
« Prev 1 2 3 43 44 45 46 47 80 81 82 Next »