All Voice AI Tools

8,165 tools ranked by quality score · Page 54 of 82

Showing 5301–5400 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
5301	phil1px/voice-message-transcriber An iOS share-action extension that transcribes voice messages using Google...	21	Experimental	meeting-transcription-automation	2	Swift
5302	voothi/20251228104300-subtitles This repository is dedicated to preparing subtitles as part of working with...	21	Experimental	prompt-engineering-guides	2	SRecode Template
5303	malob/serverless-tts-podcast WIP rewrite of article-to-audio-cloud-function and...	21	Experimental	content-to-podcast-converters	9	TypeScript
5304	cheeweijie/qwen3-tts-lora-finetuning Qwen3‑TTS LoRA fine‑tuning tools (companion repo) for custom voice adaptation	21	Experimental	qwen3-tts-applications	2	Shell
5305	itsmemotivist/qwen-tts2api 🗣️ Enable text-to-speech with Qwen TTS, a simple API solution that...	21	Experimental	stable-diffusion-tools	—	Python
5306	arrdel/voice-assistant Python script that utilizes natural language processing (NLP) and machine...	21	Experimental	general-purpose-voice-assistants	1	Python
5307	mfirozahmed/iTranslator Project using OCR and TTS	21	Experimental	android-speech-apps	1	Java
5308	BertanDogancay/Multi-Functional-AI-Assistant An advanced AI assistant that can make object detections and uses dialogpt...	21	Experimental	voice-assistant-applications	7	Python
5309	safikhanSoofiyani/VoicePrescription An android application that uses speech to text functionality to produce...	21	Experimental	android-voice-assistants	1	Java
5310	naver/multilingual-distilwhisper This repository contains all the code necessary for running the multilingual...	21	Experimental	whisper-fine-tuning	33	Python
5311	lyle-mlengineer/timesnap A web service for extracting timestamps from youtube videos.	21	Experimental	youtube-video-summarization	—	CSS
5312	marklubin/kairix Voice-first AI agent with persistent memory, background reflection, and...	21	Experimental	voice-agent-applications	5	Python
5313	wyatt-avilla/discord-tiktok-tts-bot discord bot that can play tiktok tts in voice	21	Experimental	discord-tts-bots	1	Python
5314	chalotrasahil/AI-Lecture-Studio AI Lecture Studio is an NLP-driven system that transforms audio and video...	21	Experimental	whisper-speech-transcription	—	Python
5315	krishn1122/voice-agent-local Specially designed for AI Team	21	Experimental	voice-agent-applications	—	Python
5316	ryanfb/ancientgreekspeak Transliterate Ancient Greek to Apple phonemes for text-to-speech synthesis	21	Experimental	lightweight-tts-libraries	9	Ruby
5317	ADT109119/WhisperX-GUI 一個使用者友善的圖形介面，用於輕鬆調用 WhisperX，這是一個提供精確轉錄、強大語者分離和詞級時間戳對齊的自動語音辨識 (ASR) 工具。此 GUI...	21	Experimental	speech-to-text-converters	7	Python
5318	incubated-geek-cc/whisper-onnx A Vite-ReactJS setup to run Whisper OpenAI models locally to transcribe...	21	Experimental	speech-to-text-converters	7	TypeScript
5319	samuelebh/CNN-Spoken-Digit-Classifier Repository containing Python code of a classifier that recognizes spoken...	21	Experimental	keyword-speech-recognition	1	Python
5320	PhysisVerse/physis-vad-swift Modular Swift package for on-device voice activity detection on Apple...	21	Experimental	wake-word-detection	—	Swift
5321	SuJun-Hub/voiceId 借鉴CapsWriter修改的windows端语音输入工具	21	Experimental	dotnet-tts-libraries	9	Batchfile
5322	8G6/rtts rtts is an open source JavaScript package for text to speech conversion	21	Experimental	web-speech-api-tts	3	JavaScript
5323	fann1993814/whisper.cpy Python wrapper for Whisper.cpp	21	Experimental	speech-to-text-converters	6	Python
5324	terkelg/utters Small (257B) promise wrapper for SpeechSynthesisUtterance	21	Experimental	web-speech-api-tts	13	JavaScript
5325	MahtaFetrat/Mana-Forced-Aligner A robust forced alignment tool for low-resource languages using multiple ASR...	21	Experimental	asr-evaluation-metrics	6	Jupyter Notebook
5326	zhangmei126/TextToSpeech UE4 集成TTS文字转语音，使用SAPI5.3版本	21	Experimental	dotnet-tts-libraries	7	—
5327	1abhishekpandey/FastScribe Fast parallel video-to-text transcription powered by OpenAI's Whisper AI.	21	Experimental	audio-transcription-tools	2	Python
5328	aristech-de/tts-clients Clients to communicate with the Aristech TTS service	21	Experimental	lightweight-tts-libraries	3	Python
5329	leanhtech/TextToSpeech_EN_VN Đồ Án Text To Speech (Môn Hệ Điều Hành - PTITHCM)	21	Experimental	tts-model-finetuning	1	Python
5330	mym-br/gnuspeech_sa Articulatory speech synthesizer	21	Experimental	cross-platform-tts-frameworks	11	C++
5331	wenhuahuo/Cross-Device-Acoustic-Communication-Python-Implementation Digital acoustic communication tools using QFSK and Convolutional Encode. 跨设备声学通信。	21	Experimental	zero-shot-voice-synthesis	9	Python
5332	cowdude/flapi FLAPI is an offline, containerized speech recognition websocket API	21	Experimental	go-tts-libraries	7	Go
5333	1ytic/edit-distance-papers A curated list of papers dedicated to edit-distance as objective function	21	Experimental	end-to-end-asr-frameworks	53	—
5334	Wonbin-Jung/e3-vits Official GitHub page of E3-VITS	21	Experimental	zero-shot-voice-synthesis	9	HTML
5335	iamarunbrahma/smart-voice-assistant A simple voice assistant to get your queries in speech format and generate...	21	Experimental	voice-chatgpt-interfaces	1	Python
5336	marttirandma/tipi Tipi Web v2	21	Experimental	twitch-chat-tts	2	TypeScript
5337	cjbayron/audiate Ear training game using machine learning models in the browser	21	Experimental	audio-music-learning	11	JavaScript
5338	ChrisRobinT/realtime-translation Real-time WebRTC voice translation using Whisper STT, Azure Translate, and...	21	Experimental	speech-to-text-converters	2	TypeScript
5339	asrajeh/kaldi-arabic HHM-based Arabic ASR using Kaldi engine	21	Experimental	kaldi-asr-ecosystem	9	Shell
5340	kowaalczyk/reformer-tts An adaptation of Reformer: The Efficient Transformer for text-to-speech task.	21	Experimental	fastspeech-tts-models	10	Python
5341	IRSPlays/ProjectCortexV2 A $300 wearable that gives visually impaired users real-time scene...	21	Experimental	assistive-vision-ai	3	Python
5342	kevinjalbert/spellspoon Spellspoon is a macOS tool built using Hammerspoon that enables...	21	Experimental	audio-transcription-tools	9	Lua
5343	WaelShaikh/OmniVerse-Desktop OmniVerse-Desktop is your local LLM based AI assistant that integrates...	21	Experimental	local-voice-assistants	3	TypeScript
5344	anubhav-n-mishra/xtts-api Production-ready Text-to-Speech API with XTTS-v2, voice cloning,...	21	Experimental	voice-cloning-tools	2	Python
5345	jp1924/HF_builders 🤗 Datasets의 builder script를 모와둔 repo	21	Experimental	speech-corpora-datasets	3	Python
5346	marcogenna/epub2audiobook Convert EPUB books to M4B audiobooks with AI-powered TTS (Edge TTS, Kokoro, Piper)	21	Experimental	ebook-to-audiobook-conversion	—	Python
5347	fulviodenza/go-gladia-client Client Go for Gladia APIs	21	Experimental	go-tts-libraries	5	Go
5348	AryanVBW/AiVoiceClone Transform Your Voice: Replicate Your Unique Sound in a Pristine Pre-Trained...	21	Experimental	voice-cloning-synthesis	11	Python
5349	SyedHuzaifa007/Robbie-12.20-Personal-Virtual-Assistant It is a Speech Recognition Personal Virtual Assistant made with Python that...	21	Experimental	general-purpose-voice-assistants	1	Python
5350	sandeepswain54/Yukti-Care Yukti Care is a mobile app that enables pharmacies, medical distributors,...	21	Experimental	educational-voice-apps	2	Dart
5351	cydanix/voice-agent Real-time voice AI assistant	21	Experimental	voice-agent-applications	2	Rust
5352	Aketirani/audio-mnist Gender Recognition By Voice Analysis	21	Experimental	facial-attribute-classification	12	Python
5353	theablemo/Voice-Captcha-Verification This repository contains the code for the Captcha Verification by voice...	21	Experimental	agentic-ai-orchestration	1	Dart
5354	Nexdata-AI/100-Hours-Thai-Children-Spontaneous-Speech-Data Thai Child's Spontaneous Speech Data	21	Experimental	multilingual-speech-datasets	1	—
5355	Fdr3iZzz/YoutubeVideoTranslate Get a translated YouTube video with AI voiceover	21	Experimental	video-dubbing-tools	7	Java
5356	RumitPatel/android-continues-speech-recognition This project is a demonstration to continues recognition of speech using...	21	Experimental	android-speech-apps	7	Kotlin
5357	traderpedroso/xphoneBR XphoneBR is a Brazilian portuguese transformer base grapheme-to-phoneme and...	21	Experimental	grapheme-to-phoneme-conversion	12	Python
5358	CrispStrobe/CrispTTS (wip) python command-line Text-to-Speech (TTS) tool esp. for German,...	21	Experimental	coqui-tts-applications	7	Python
5359	chihakuro/attendance-check Face recognition for attendance checking system	21	Experimental	face-recognition-systems	1	Python
5360	NhanPhamThanh-IT/Vietnamese-Voice-Search-Engine 🔎 Vietnamese Voice Search Engine - Vietnamese news search app with voice...	21	Experimental	tts-model-finetuning	15	Python
5361	Davi20044/Chat-de-Voz-GPT-3.5 Este projeto consiste em um assistente de conversação que utiliza a...	21	Experimental	general-purpose-voice-assistants	1	HTML
5362	kundan-6646/Musica Musica is an online audio splitter. It works with the power of AI which...	21	Experimental	audio-music-learning	1	EJS
5363	WinsDominoes/sanskrit-tts Sanskrit Text-To-Speech Web-App - Made this for my Sanskrit Learning Journey	21	Experimental	web-speech-api-tts	1	JavaScript
5364	HKAB/vietnamese-rnnt-tutorial A tutorial on how to train RNN-T from scratch with Whisper encoder	21	Experimental	whisper-fine-tuning	12	Python
5365	shesuyo/isi alibaba 智能语音交互（Intelligent Speech Interaction） GO SDK	21	Experimental	go-tts-libraries	1	Go
5366	uigiporc/icon-sr Progetto di Ingegneria della conoscenza, autori: Porcelli Luigi, Nicolo Cucinotta.	21	Experimental	keyword-speech-recognition	1	PureBasic
5367	rgychiu/docbot Personal doctor bot for all your common medical needs.	21	Experimental	multimodal-medical-assistants	1	Java
5368	IHKYoung/AhaTTS TTS Fast Web，一个简单优雅的本地文字转语音的前端与API接口。A localized, cross-platform,...	21	Experimental	gradio-tts-webuis	2	Python
5369	khakhasshi/myOwnTTS A lightweight, high-performance voice cloning TTS system based on Coqui TTS...	21	Experimental	voice-cloning-tools	2	Python
5370	ayutaz/uZipVoice Unity implementation of ZipVoice - lightweight zero-shot text-to-speech...	21	Experimental	dotnet-tts-libraries	—	C#
5371	andreehrlich/Daily-Briefing-Voice-Assistant Conversational voice agent to brief you on your schedule for the day....	21	Experimental	general-purpose-voice-assistants	1	Python
5372	corbinr40/RTCC A piece of software that converts voice to text in a visual output, as an...	20	Experimental	assistive-vision-ai	6	Python
5373	vislupus/Bulgarian-TTS-dataset LibriVox dataset for Bulgarian language TTS	20	Experimental	speech-corpora-datasets	8	—
5374	AppleHolic/2020AIChallengeSpeechRecognition 2020 AI Challenge 음성 인식 코드	20	Experimental	end-to-end-asr-frameworks	8	Python
5375	pika-online/Foreign_Pronunciation_Generator_for_Code-Switch_ASR a socket script to obtain chinese phones-sequence for any english word	20	Experimental	automatic-speech-recognition	5	Python
5376	atharva9167j/Sign-Language-Translator Sign Language Recognition Platform - A real-time American Sign Language...	20	Experimental	sign-language-recognition	2	TypeScript
5377	Kavindu-Rankothge/tiktok-bot TikTok video generation from scraping Reddit community posts	20	Experimental	ai-video-generation	8	Python
5378	shahad-mahmud/incremental_learning_for_asr Incremental learning for automatic speech recognition (ASR)	20	Experimental	end-to-end-asr-frameworks	8	Python
5379	voidful/whisper-live-asr-demo run whisper on CPU/GPU server	20	Experimental	speech-to-text-converters	8	JavaScript
5380	4over7/SpeakOut Offline-first AI voice input for macOS. Hold-to-speak or tap-to-toggle,...	20	Experimental	local-voice-dictation	2	Dart
5381	timothypesi/Speech-to-Text-Converter This GitHub repository contains a Python Streamlit app that utilizes machine...	20	Experimental	streamlit-tts-apps	8	Python
5382	bfackland/replica_dialog_generator 🗣 Auto-generate dialog audio files using the Replica Studios 'AI Voices' API...	20	Experimental	openai-tts-applications	8	Python
5383	oscurprof/Realtime-Subtitles-Generator-using-Python LiveScript: Real-time Live Captioning Software, generates subtitles in...	20	Experimental	live-caption-generation	3	Python
5384	maziac/currah_uspeech_tests Tests for the ZX Spectrums speech synthesizer peripheral: Currah uSpeech...	20	Experimental	embedded-tts-systems	5	Assembly
5385	gerlaxrex/parrot PARRoT: Precise Audio Recognition and Recap over Transcription	20	Experimental	parakeet-asr-implementations	6	Python
5386	SSobol77/Say-Salomon-AI Asynchronous text-to-speech conversion, asynchronous speech-to-text...	20	Experimental	vosk-asr-implementations	5	C++
5387	xingchensong/ASR-Wavnet some ASR-system implementations （via tensorflow 1.x）	20	Experimental	end-to-end-asr-frameworks	5	Python
5388	morikeli/Xcalibur A speech recognition and translation website built with Django in addition...	20	Experimental	web-based-tts-apps	2	JavaScript
5389	MorrisXu-Driving/Improving_DeepSpeech_2_by_RNN_Transducer_Pytorch_Implementation In this repository, based on Deep Speech 2, two losses, CTC and RNN-T are compared.	20	Experimental	end-to-end-asr-frameworks	8	Python
5390	Androz2091/Cicero Great speaker, Cicero is a text-to-speech Discord Bot!	20	Experimental	discord-tts-bots	8	TypeScript
5391	rossriserose/Real-time-Voice-cloning Clone a voice to generate arbitrary speech in real-time	20	Experimental	voice-cloning-tools	1	Python
5392	marcosfelt/latex2speech Convert Latex to speech	20	Experimental	lightweight-tts-libraries	5	Jupyter Notebook
5393	shreyashghag/OfflineSpeechRecognition Offline Speech Recognition For Android Library	20	Experimental	android-speech-apps	5	Kotlin
5394	eray-yuztyurk/python-ai-voice-chatbot AI-powered voice chatbot with Gradio web interface. Talk or type your...	20	Experimental	conversational-chatbot-applications	1	Python
5395	Sec-ant/etts edge-tts in Bun.	20	Experimental	edge-tts-implementations	1	TypeScript
5396	HarunoriKawano/Conformer Implementation of the paper "Conformer: Convolution-augmented Transformer...	20	Experimental	conformer-asr-implementations	6	Python
5397	dibbed/TTSKit-multi-engine-tts Python Text-to-Speech toolkit (multi-engine) with FastAPI, CLI and Telegram...	20	Experimental	lightweight-tts-libraries	4	Python
5398	technout/tts_gtk Graphical interface for Coqui TTS (Text to Speech) command line. Made in...	20	Experimental	lightweight-tts-libraries	5	Python
5399	Tombarr/TranscriberApp Local-first macOS Tahoe Transcription App & CLI Tool	20	Experimental	local-voice-dictation	1	Swift
5400	Dalia-Sher/Speech-Emotion-Recognition-using-BLSTM-with-Attention We present a study of a neural network based method for speech emotion...	20	Experimental	speech-emotion-recognition	11	Python

« Prev 1 2 3 … 52 53 54 55 56 … 80 81 82 Next »