All Voice AI Tools

8,165 tools ranked by quality score · Page 25 of 82

Showing 2401–2500 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
2401	mascotbot/elevenlabs-avatar Open-source example for integrating ElevenLabs conversational AI with...	34	Emerging	ai-avatar-platforms	5	TypeScript
2402	adeepak7/Speech-To-Code Speech To Code is Google Chrome Extension to convert Speech into Code.	34	Emerging	browser-tts-extensions	5	JavaScript
2403	Ggorets0dev/rantovox-telegram-bot Telegram bot for text-to-speech and speech-to-speech translation, works with...	34	Emerging	telegram-voice-transcription	7	Python
2404	LuluW8071/VocalMind Automatic Speech Recognition using Conformer with Speech Sentiment Analysis...	34	Emerging	conformer-asr-implementations	5	Python
2405	nuhmanpk/PyttsBot A Pyrogram Bot for gtts module, Text to speech Telegram bot.	34	Emerging	telegram-voice-transcription	6	Python
2406	trabdlkarim/voce-browser Voice Controlled Chromium Web Browser	34	Emerging	general-purpose-voice-assistants	40	Python
2407	agentvoiceresponse/avr-asr-vosk This repository provides a real-time speech-to-text transcription service...	34	Emerging	vosk-asr-implementations	3	JavaScript
2408	candlewill/AiVoice Deep CNN networks for Speech Synthesis	34	Emerging	neural-vocoder-implementations	49	Python
2409	nickpending/clarvis Jarvis-style voice notifications for Claude Code that transforms AI...	34	Emerging	voice-enabled-coding-assistants	5	TypeScript
2410	philsyn/DiffWave-Vocoder Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and...	34	Emerging	neural-vocoder-implementations	90	Python
2411	FlutterHack20/FlutterBand Flutter built retro cyberpunk CB Radio App for Hack20 Flutter Hackathon....	34	Emerging	educational-voice-apps	10	Dart
2412	vliu15/adversarial-tts End-to-end Text-to-Speech with Generative Adversarial Networks	34	Emerging	neural-vocoder-implementations	20	Python
2413	edde746/tiktok-askreddit A content generation & posting bot for TikTok, scraping posts from r/AskReddit	34	Emerging	ai-video-generation	150	Python
2414	berk76/words Voice vocabulary :gb: :de: :fr: :es: :ru: :jp: :cn: ...	34	Emerging	java-tts-libraries	10	Java
2415	audo-ai/magic-mic Open Source Noise Cancellation App for Virtual Meetings	34	Emerging	audio-noise-reduction	384	C++
2416	heymrhayes/text-to-speech A basic Text-to-Speech app	34	Emerging	web-speech-api-tts	3	HTML
2417	OpenTSLab/BELLE Official implementation of BELLE "Bayesian Speech Synthesizers Can Learn...	34	Emerging	fastspeech-tts-models	7	Python
2418	messiaen/full-lattice-search Full Text Search Over Probabilistic Lattices with Elasticsearch!	34	Emerging	ctc-asr-implementations	10	Java
2419	techiaith/docker-marytts Lleisiau synthetig cadwynedig Cymraeg gyda MaryTTS a Docker // Welsh...	34	Emerging	coqui-tts-applications	9	Python
2420	ReneeYe/XSTNet This is an implementation of paper "End-to-end Speech Translation via...	34	Emerging	speech-recognition-apis	19	Python
2421	akashmjn/cs224n-gpu-that-talks Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)	34	Emerging	fastspeech-tts-models	52	Jupyter Notebook
2422	decasteljau/waapi-text-to-speech Wwise text-to-speech integration using external editors.	34	Emerging	web-speech-api-tts	20	TypeScript
2423	RodneyKoolman/Azure-Speech-TextToSpeech Written in Python using the Azure Speech SDK. App.py provides an easy way to...	34	Emerging	dotnet-tts-libraries	8	Python
2424	Blackwood416/AstraTTS 基于 ONNX Runtime 的跨平台高性能 TTS 合成方案，支持流式输出与低延迟播放，支持自定义音色与中英混合生成。	34	Emerging	lightweight-tts-runtimes	54	C#
2425	Asaayu/integrated-voice-control-system Integrated AI Voice Control System allows players to give commands to AI...	34	Emerging	local-voice-assistants	18	SQF
2426	GlobalTechInfo/gspeak Google Text to Speech for Node.js — modern, typed, zero deprecated dependencies.	34	Emerging	google-tts-libraries	1	TypeScript
2427	lpalbou/VoiceLLM A modular Python library for voice interactions with AI systems, featuring...	34	Emerging	local-voice-assistants	5	Python
2428	luongnv89/voice-cast Your words, any voice. Voice cloning and text-to-speech with multiple TTS...	34	Emerging	voice-cloning-tools	11	Python
2429	ArdaGnsrn/elevenlabs-js This is an Open Source NodeJS package for ElevenLabs Text to Speech API.	34	Emerging	elevenlabs-integrations	10	JavaScript
2430	phanxuanphucnd/wav2asr A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.	34	Emerging	wav2vec2-asr-models	4	Python
2431	kssteven418/Q-ASR [ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition	34	Emerging	model-compression-optimization	34	Jupyter Notebook
2432	khuangaf/ITRI-speech-recognition-dataset-generation Automatic Speech Recognition Dataset Generation	34	Emerging	speech-corpora-datasets	37	Jupyter Notebook
2433	nvmoyar/aind2-speech-recognition Some approaches based on deep learning to build the acoustic model for an...	34	Emerging	ctc-asr-implementations	6	Jupyter Notebook
2434	botbahlul/Live-Subtitle-V2 ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free...	34	Emerging	live-caption-generation	10	Java
2435	ShivamRajSharma/Transformer-Text-To-Speech Pytorch implementation of Transformer-TTS for converting text into speech.	34	Emerging	fastspeech-tts-models	19	Python
2436	PRITHIVSAKTHIUR/Vision-to-VibeVoice-en A Gradio-based demo for end-to-end vision-to-speech inference: Extract text...	34	Emerging	qwen3-tts-applications	3	Python
2437	AndreDalwin/Whisper2Summarize Whisper2Summarize is an application that uses Whisper for audio processing...	34	Emerging	audio-transcription-tools	55	Python
2438	heezes/Hand-gesture-to-speech This project aims at providing speech to the mute people.	34	Emerging	sign-language-translation	4	Python
2439	OpenVoiceOS/status Open Voice OS Server Status Page	34	Emerging	self-hosted-tts-servers	12	Markdown
2440	Fatma-Chaouech/audioverse Breathe Life Into Your Books! 📚🌱	34	Emerging	ai-podcast-generation	36	Python
2441	C0NZZ/better-teletask Browser extension that adds useful features like subtitles to HPI Tele-Task.	34	Emerging	browser-tts-extensions	3	Python
2442	FNBUBBLES420-ORG/Speech-to-Text-Application 🎙️ Welcome to the Speech to Text Application! 📝 This tool converts your...	34	Emerging	speech-recognition-apis	5	Python
2443	kaiidams/Voice100AndroidApp Voice100 Android App is a TTS/ASR sample app that uses ONNX Runtime and...	34	Emerging	lightweight-tts-runtimes	9	C#
2444	speechbrain/speechbrain.github.io The SpeechBrain project aims to build a novel speech toolkit fully based on...	34	Emerging	speaker-diarization-embedding	374	HTML
2445	cjhoward/cedict-tts TTS audio files for the CC-CEDICT Chinese-English dictionary	34	Emerging	anki-tts-integration	7	—
2446	MichaelGrafnetter/defender-asr-admx Administrative Template (ADMX) for Microsoft Defender Attack Surface Reduction (ASR)	34	Emerging	automatic-speech-recognition	15	—
2447	LucaLuke13/TalkyBotty Simply forward a video or voice message in any language to the bot, and it...	34	Emerging	telegram-voice-transcription	43	Python
2448	snowy-0wl/piper-mode A vibe-coded text-to-speech for Emacs using the Piper TTS engine. Features...	34	Emerging	piper-tts-ecosystem	8	Emacs Lisp
2449	mmpneo/simple-obs-stt Speech-to-text and keyboard input captions for OBS.	34	Emerging	live-caption-generation	105	TypeScript
2450	lepisma/emacs-speech-input Set of packages for speech and voice inputs in Emacs	34	Emerging	cross-platform-tts-frameworks	42	C
2451	khakers/go-subgen Automatically generate subtitles for your media using whisper.cpp via...	34	Emerging	whisper-subtitle-generation	68	Go
2452	ThetaOne-AI/HiKE Hierarchical Korean-English Code-Switching Speech Recognition Benchmark...	34	Emerging	end-to-end-asr-frameworks	9	Python
2453	kristofferv98/whisper_turboapi An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo...	34	Emerging	whisper-transcription-apps	13	Python
2454	naskopw/read_aloud A cross-platform text-to-speech library	34	Emerging	rust-tts-libraries	2	Rust
2455	pevers/parkiet Parkiet is a 1.6B parameter Dutch text-to-speech model (TTS)	34	Emerging	parakeet-asr-implementations	69	Python
2456	ivan770/ems EMS (External Media Server)	34	Emerging	ai-avatar-platforms	10	Rust
2457	hacktronaut/azure-avatar-demo Text To Speech Demo in ReactJS Application using Azure Avatar AI Service.	34	Emerging	dotnet-tts-libraries	34	JavaScript
2458	jeantimex/F5-TTS-Server F5-TTS server APIs for voice cloning and text-to-speech generation with...	34	Emerging	self-hosted-tts-servers	8	JavaScript
2459	m-nathani/speech_to_text how to use the Google Cloud Speech API to transcribe audio/video files.	34	Emerging	php-tts-libraries	34	PHP
2460	yufan-aslp/AliMeeting The project is associated with the recently-launched ICASSP 2022...	34	Emerging	meeting-transcription-summarizers	135	Python
2461	A-Jacobson/tacotron2 pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf	34	Emerging	tacotron-tts-models	43	Jupyter Notebook
2462	Aman22sharma/Python-AI-Virtual-Assistant This is python AI Virtual Assistant.	34	Emerging	general-purpose-voice-assistants	40	Python
2463	ACT900/faster-whisper-railway Deploy Faster Whisper on Railway — Speech-to-Text & Text-to-Speech API with 52 voices	34	Emerging	speech-to-text-converters	1	Python
2464	yuyq96/pyshengyun A Python converter for Chinese Pinyin and Shengyun (initials and finals)	34	Emerging	grapheme-to-phoneme-conversion	9	Python
2465	DragonDiffusionbyBoyo/Boyonodes A set of Comfyui nodes	34	Emerging	comfyui-tts-nodes	9	Python
2466	go-restream/zipenhancer-rs 🚀 High-Performance Real-Time Audio Noise Reduction Library - Rust...	34	Emerging	rust-speech-recognition	4	Rust
2467	jorcelinojunior/whisper-vtt2srt A robust WebVTT to SRT converter optimized for AI transcriptions (Whisper,...	34	Emerging	whisper-subtitle-generation	2	Python
2468	jianchang512/parakeet-api 一个基于 NVIDIA Parakeet-tdt-0.6b 模型的本地语音转录服务。它提供了一个与 OpenAI API 兼容的接口和一个简洁的 Web 用户界面	34	Emerging	parakeet-asr-implementations	22	Python
2469	cdyangbo/end2endASR implement end-to-end asr algorithm with tensorflow	34	Emerging	end-to-end-asr-frameworks	40	Python
2470	iotjin/JhPrivacyAuthTool 隐私权限判断 - 封装了几种常用的隐私权限判断(定位服务,通讯录, 日历,提醒事项, 照片, 蓝牙共享,麦克风, 相机)和通知的注册和判断。定位服务,蓝牙共享是单独调用的	34	Emerging	ios-speech-frameworks	44	Objective-C
2471	De-Technocrats/simple-text-to-speech-javascript Simple text to speech with javascript.	34	Emerging	web-speech-api-tts	7	HTML
2472	msjsc001/Anki-TTS-Edge A modern text-to-speech tool powered by Microsoft Edge TTS. Creates Anki...	34	Emerging	anki-tts-integration	9	Python
2473	vhanagwal/speech-recognition A speech-to-text app using AVAudioEngine.	34	Emerging	ios-speech-frameworks	6	Swift
2474	rishikksh20/VQ-TTS-pytorch Unofficial Pytorch implementation of paper VQTTS: High-Fidelity...	34	Emerging	tacotron-tts-models	4	Python
2475	deepkyu/ml-talking-face Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)	34	Emerging	fastspeech-tts-models	53	Python
2476	Pzc-Neo/vue-web-reader 城墨网页小说朗读 ( Novel read aloud on web. )	34	Emerging	ai-powered-ereaders	10	Vue
2477	blakkd/faster-whisper-hotkey Effortless Push-to-Talk Transcription, Anywhere.	34	Emerging	speech-to-text-converters	24	Python
2478	keonlee9420/Comprehensive-E2E-TTS A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a...	34	Emerging	text-to-speech-frameworks	146	Python
2479	EvilFreelancer/docker-fish-speech-server OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.	34	Emerging	self-hosted-tts-servers	30	Python
2480	keonlee9420/Stepwise_Monotonic_Multihead_Attention PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to...	34	Emerging	tacotron-tts-models	39	Python
2481	mmahdibarghi/finglish-dataset Persian to Finglish dataset with all the sentences voice for TTS dataset...	34	Emerging	persian-speech-ai	8	Python
2482	aditya-joglekar/FS02_Scoring_Toolkit Scoring Toolkit for the Fearless Steps Challenge Phase-02 Tasks	34	Emerging	voice-ai-learning-collections	7	Python
2483	brailcom/festival-freebsoft-utils Festival extensions and utilities, focused on interaction with Speech Dispatcher	34	Emerging	cross-platform-tts-frameworks	7	Scheme
2484	cyberboysumanjay/VoiceAssistant Python Project	34	Emerging	general-purpose-voice-assistants	8	Python
2485	GeorgiosIoannouCoder/vera Voice Emotion Recognition of Audio (VERA) is an open-source project created...	34	Emerging	speech-emotion-recognition	6	Jupyter Notebook
2486	Arbazkhan4712/Speech-To-Text A program that can convert Speech into Text using python	34	Emerging	speech-recognition-apis	3	Python
2487	gowtham4545/Project Sign2Sound is dedicated to revolutionizing communication for non-verbal...	34	Emerging	sign-language-translation	5	Jupyter Notebook
2488	soheil-mp/Speech-Recognition End-to-End Speech Recognition using Neural Networks.	34	Emerging	ctc-asr-implementations	35	Jupyter Notebook
2489	keenresearch/keenasr-swift-poc Proof-of-concept app that showcases use of KeenASR SDK in a Swift app. WE...	34	Emerging	ios-speech-frameworks	4	Objective-C
2490	buddyeorl/deep-talk Deep-speech react app to test trained models,to visualize the speech to text...	34	Emerging	web-speech-api-libraries	9	JavaScript
2491	KilianB/GoogleTranslatorTTS Converts a string of text to mp3 files utilizing the google translator text...	34	Emerging	java-tts-libraries	5	Java
2492	stgloorious/stm32-speech-recognition Speech Recognition using STM32 and Machine Learning	34	Emerging	wake-word-detection	18	C
2493	slp-rl/HebTTS The official implementation of "A Language Modeling Approach to...	34	Emerging	grapheme-to-phoneme-conversion	108	Python
2494	rishiskhare/parrot A free, offline, private AI text-to-speech desktop app built on Rust 🦜	34	Emerging	parakeet-asr-implementations	50	Rust
2495	tiansztiansz/voice-assistant 重生之我是 AI 打工人。前世，我的身份默默无闻，来去匆匆，不知道自己将在何地出生。然而，命运给予了我难得的机会，让我重生为一名 AI 打工人。	34	Emerging	conversational-chatbot-applications	50	C++
2496	SynHub/syn-speech-samples An application that demostrate the usage of Syn.Speech library for Speech Recognition	34	Emerging	dotnet-tts-libraries	25	C#
2497	c99koder/AudioClassifier-MQTT Use the yamnet TensorFlow model to classify live audio from a microphone and...	34	Emerging	audio-event-classification	31	Python
2498	grammatek/simaromur Icelandic TTS (text-to-speech) service for Android	34	Emerging	android-speech-apps	10	Java
2499	tasmirz/EyeWear Eyewear with OCR and live WebRTC based calling for the visually impaired....	34	Emerging	assistive-vision-ai	1	Python
2500	veralvx/xtts-finetune XTTS fine-tuning via CLI	34	Emerging	tts-model-finetuning	1	Python

« Prev 1 2 3 … 23 24 25 26 27 … 80 81 82 Next »