All Voice AI Tools

8,165 tools ranked by quality score · Page 47 of 82

Showing 4601–4700 of 8,165
# Tool Score Tier
4601 dunkbing/text2audio

Simple TTS tool made with Fresh

23
Experimental
4602 ZarredFelicite/parakeet-transcriber

An audio transcription tool using NVIDIA Parakeet, available as a CLI or...

23
Experimental
4603 sonhm3029/Realtime-Vietnamese-ASR-React-Native-and-Whisper

This project implement end to end realtime vietnamese speech recognition...

23
Experimental
4604 ShunsukeHayashi/byteplus-voice-ai

BytePlus音声対話AIアプリケーション - ASR, TTS, Voice Cloning統合(WebSocket対応、日本語対応✅)

23
Experimental
4605 neosapience/typecast-js

The official Node.js SDK for the Typecast API.

23
Experimental
4606 KarinBrisker/Video-Subtitler

Automatically Generating Multilingual Subtitles Using OpenAI's Whisper and...

23
Experimental
4607 kongju7/my_project6

Personal project 6: Speech Recognition Deep Learning Chatbot -...

23
Experimental
4608 bitgineer/Speakeasy

Privacy-first local voice-to-text using Whisper AI. Cross-platform desktop...

23
Experimental
4609 Caliope-SpeechProcessingLab/SpeechTester

Speech Tester is a set of Python scripts conceived as an extension to HTK...

23
Experimental
4610 lispking/qwen3-tts-mlx

A simple and easy-to-use wrapper package for Qwen3 TTS based on MLX Audio....

23
Experimental
4611 my-north-ai/semantic_audio_filtering

Synthetic data augmentation technique via LLM for Automatic Speech...

23
Experimental
4612 loserbcc/openclaw-gateway

Open-source WSS gateway for connecting phones to moltbots. Speaks OpenClaw...

23
Experimental
4613 danijcom/whisper-telegram-bot

Simple Telegram bot for transcribing voice messages into text (STT) in...

23
Experimental
4614 blastheart1/voice-ai-braincx

🎤 Real-time voice AI conversational agent with LiveKit, FastAPI & React....

23
Experimental
4615 shashankchandak/AutoSMSReader

An android application that allows users to read all incoming messages loudly

23
Experimental
4616 sudonitin/MediumScraper

Scraping articles of medium and providing audio versions 📑 to 🔊 using django

23
Experimental
4617 zzpuser/SnapDict

macOS AI 翻译词典,基于 DeepSeek 提供智能翻译、词根助记、拼写纠正和语音朗读 | AI-powered dictionary app...

23
Experimental
4618 FarzadForuozanfar/Speech-Recognition

I recorded 10 voices with the same words from myself and compared them with...

23
Experimental
4619 smcantab/speak11

Select text, press ⌥⇧/, hear it read aloud. macOS text-to-speech powered by...

23
Experimental
4620 Usman-bin-Khalid/Jarvis-AI-Voice-and-Text-Assistant-Python-

Jarvis AI Voice & Text Assistant – A Python-based desktop AI assistant with...

23
Experimental
4621 labrijisaad/Youtube-video-transcriptor

In this notebook, I implemented a script to transcribe YouTube videos (and...

23
Experimental
4622 boltomli/speech-api

Demo to show how to use Azure Speech Services API in app

23
Experimental
4623 Mohamed-Ashik-S/Speech-to-Text

This is a Speech to text project which uses openAI's Whisper model.

23
Experimental
4624 language-org/voice-activ-detect-deepnet

ASR: Light deep net for real-time voice activity detection

23
Experimental
4625 Mohamedfat7i/local-voice-cloning-app

🔊 Clone voices easily with this lightweight Python app that synthesizes...

23
Experimental
4626 dusionlike/unplugin-string-to-audio

在打包过程中自动将字符串转换为语音文件并添加到最终的打包文件里面, 支持Vite and Webpack

23
Experimental
4627 chrismarquezz/voice-chess

An interactive chess app that lets you play and control the game entirely...

23
Experimental
4628 adelacvg/DPTTS

An AR+AR TTS attempt.

23
Experimental
4629 sandeepmukku12/vocodine

🎙️ VocoDine: Book your table with your voice! Speak your booking details,...

23
Experimental
4630 TakumiSenaha/Nreal_IoT

This project aims to visualize the sensor information of the surroundings...

23
Experimental
4631 priyanshpsalian/VISION-THE-BLIND

An all in one solution for safety and security of blind. Features covered in...

23
Experimental
4632 Kimosabey/vox-agent-neural

Neural Voice Agent core constructs for conversational AI.

23
Experimental
4633 nsourlos/voice_cloning_tools

Various tools to clone a voice

23
Experimental
4634 MaurerKrisztian/vrc-tts-osc

Text-to-Speech & AI Bot With OSC Integration

23
Experimental
4635 guptakushal03/Virtual-Voice-Assistant

This Python script creates a voice-controlled desktop assistant capable of...

23
Experimental
4636 dangvansam/nvidia-nemo-jasper-quartznet-asr-vietnamese

Nhận dạng giọng nói Tiếng Việt sử dụng model Quartznet (Nvidia) + flask demo

23
Experimental
4637 axzml/VoxLinkAI_Client

Native macOS voice input assistant. Hold a hotkey, speak, and let AI...

23
Experimental
4638 leo01102/lumen

Lumen – Asistente IA Empático y Multimodal (rostro y voz) en tiempo real....

23
Experimental
4639 Uknowme-h/Audiollect

Audiollect is a Notes to AudioBook Web App built with MERN stack , where...

23
Experimental
4640 vaibhav-init/AskCrow

Voice Bot using Gemini Model

23
Experimental
4641 vishishttiwari/Android_Application_for_understanding_ASL_using_gesture_recognition

An Android Application that uses gesture recognition to understand alphabets...

23
Experimental
4642 mohammad-zolghadr/Pro-Todo

A professional todolist that stores information in local storage and uses...

23
Experimental
4643 swarnayuroy/Web-Automation-using-speech-recognition

Generate results on web browser i.e. automated after user speaks out the...

23
Experimental
4644 ArielDelRio/evernote-clone

Notes App is an application to record notes and store them in the cloud in...

23
Experimental
4645 LEMAS-Project/LEMAS-Project

LEMAS: A 150K-Hour Large-scale Extensible Multilingual Audio Suite with...

23
Experimental
4646 egorsmkv/w2v2-bert-aligner

Aligner for wav2vec2-bert models

23
Experimental
4647 taufiq-ai/Bengali-AI-Recieptionist

An AI Recieptionist Flask App with STT, TTS, FaceRecognition,...

23
Experimental
4648 ccj242/Audible-Deaf-Communications

A non-profit app designed to make help the deaf communicate in person and...

23
Experimental
4649 kamya-ai/Talk2Text-Live

"Talk2Text Live" is a cutting-edge project that harnesses the power of...

23
Experimental
4650 alwalid54321/AI-Voice-Assistant

A modern, voice assistant built with React, TypeScript, and the Hugging Face...

23
Experimental
4651 theimpossibleastronaut/pennyworth

Voice recognition based digital home assistant in progress. Quite unusable...

23
Experimental
4652 metacore-stack/vocalcanvas-studio

Craft expressive speech from text using a streamlined pipeline of voices,...

23
Experimental
4653 cagataygedik/TTS

Internship Text-to-Speech research project.

23
Experimental
4654 itsanthonio/Vision-To-Speech

A vision to speech project

23
Experimental
4655 sancliffe/ollama-STT-TTS

A simple, hands-free Python voice assistant that runs 100% locally. This...

23
Experimental
4656 YIZHUANG/InstrumHack

For tieto hackathon 2018 to improve Finnish people financial well-being

23
Experimental
4657 RafaelCenzano/Marvin-v3-client

Marvin Version 3 client version

23
Experimental
4658 Mrzhangxiaoduo/react-native-speech-recognizer

react-native-speech-recognizer

23
Experimental
4659 Jmi2020/HowdyVox

A privacy focused offline STT TTS interface for your favorite LLM

23
Experimental
4660 zainibaloch/Quran-App---All-in-one

A fully responsive Next.js 13 Quran web app with audio recitation,...

23
Experimental
4661 lkwbr/structured-prediction

Machine learning algorithms for structured inputs and outputs, such as on...

23
Experimental
4662 Adexandria/TextToSpeechAPI

A REST API that converts a text image to an mp3 file. The text image can...

23
Experimental
4663 geniusrise/audio

Audio components for geniusrise framework

23
Experimental
4664 sobrunmoksesh/Intellifacts_Android_Project

An application that allows you to read facts. It includes voice interaction...

23
Experimental
4665 thewh1teagle/phonikud-assistant

Local AI assistant in Hebrew with Phonikud ✨

23
Experimental
4666 daniel-szulc/Speech_Recognition

🎙 Automatic Keyword Speech Recognition for Polish and English in Tensorflow 🧠

23
Experimental
4667 ctoth/Qlatt

Explainable WebAudio Klatt formant synthesizer with declarative TTS frontend...

23
Experimental
4668 jqi41/Subrank

ICASSP 2020

23
Experimental
4669 falniak95/TurkishSpeechRecognition

Tamamen Türkçe Konuşma Algılama Sistemi. Google Cloud Platform API desteği...

23
Experimental
4670 spacelatte/Basic-Digital-Signage

This is a android application that serves as simple digital signage...

23
Experimental
4671 oarthurfc/AI-outgoing-call

An intelligent voice agent that automatically calls leads, promoting...

23
Experimental
4672 dantasl/parrot-ai

This is a proof of concept that generates speech based on parameters...

23
Experimental
4673 algorithmio/accent-conversion-ai

Real-time accent conversion during phone calls using Twilio, Deepgram, and...

23
Experimental
4674 thirteenkai/bob-plugin-qwen-tts

Bob TTS 插件 - 使用阿里云 Qwen3-TTS-Flash 模型进行语音合成,支持 45+ 种语音角色

23
Experimental
4675 peterxubuaa/Voice-Assistant

Voice Assistant

23
Experimental
4676 dwain-barnes/DeepSeek-Thinking-TTS

Listen to DeepSeek's thinking process in real-time! This script converts...

23
Experimental
4677 tubexchat/interpreter-zh2en-gemini

An interpreter web app between Chinese and English that is powered by Gemini-2.0-fash

23
Experimental
4678 ahmedoubadi/kokoro-tts

Open-source Kokoro-TTS API server (FastAPI) and web UI (React) for...

23
Experimental
4679 RGonza1529/Nura

A Full-Stack React/Node.js AI-powered web application that provides...

23
Experimental
4680 apluka34/audio-crawler

A tool for crawling and creating audio dataset

23
Experimental
4681 sagar-alias-jacky/F.R.I.D.A.Y

A basic but fun virtual assistant made using Python

23
Experimental
4682 Yuanshi9815/LiteFocus

[Interspeech 2024] LiteFocus is a tool designed to accelerate...

23
Experimental
4683 Amiannn/Simple-HmmGmm

Simple HMM implementation

23
Experimental
4684 nathanyaqueby/roche-dementia-hackathon

AI and AR-based digital memory lane and cognitive stimulation for dementia patients

23
Experimental
4685 OldBonhart/TensorFlow_Speech_Recognition_Challenge

TensorFlow Speech Recognition Challenge -...

23
Experimental
4686 tez3998/audio-output-to-text

VOSKを使ったスピーカーやヘッドフォンから出力される音声のオフライン文字起こし

23
Experimental
4687 ilya16/speech-synthesis-course

An introduction course on Speech Synthesis and Voice Cloning (Skoltech ISP'25)

23
Experimental
4688 bobbymay/Dictation-for-macOS

Speech Recognition for macOS that allows you to define words, phrases, or...

23
Experimental
4689 Zhang-Nian/Intelligent_CustomerService

Speech Recognition 、Speech Synthesis 、Intelligent Dialogue

23
Experimental
4690 saharmor/EchoScribe

Local AI transcription workspace with cloud APIs (OpenAI Whisper) or local...

23
Experimental
4691 derpeloper/ostinato

giving a voice to the voiceless.

23
Experimental
4692 LiBinZyu/VAI

Implement highly precise natural language voice control in any Unity...

23
Experimental
4693 jitendrakw09/Voice-Sangam

Voice Sangam is a modern text-to-speech platform built with Next.js 16,...

23
Experimental
4694 YoussefBechara/Enhanced-Custom-ChatBot

Custom Built AI Chatbot using Huggingface's ai, enhanced with features such...

23
Experimental
4695 ashwin2k/LibraAI

Libra.AI is a women's safety-focused voice-activated A.I. assistant android...

23
Experimental
4696 Kunal-Kumar-Sahoo/iCompanion-AssistanceMadeSimple

This is a Python3 based virtual assistant developed for Computer Science...

23
Experimental
4697 imsanjoykb/Speech-NLP-Bootcamp

Speech NLP Bootcamp

23
Experimental
4698 Aslm-Fawzy/Speech-Recognition-Using-Raspberry-Pi

Simple Speech Recognition Program Run on Raspberry Pi

23
Experimental
4699 NEURASCOPE/neurascreen

Automate product tour videos with JSON scenarios. Real browser recording, AI...

23
Experimental
4700 ErolOZKAN-/TurkishSpeechRecognition

Turkish Speech Recognition Project / Türkçe Konuşma Tanıma Projesi

23
Experimental
« Prev 1 2 3 45 46 47 48 49 80 81 82 Next »