All Voice AI Tools

8,165 tools ranked by quality score · Page 24 of 82

Showing 2301–2400 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
2301	xiaominfc/aliyun_nls_c_demo 阿里云的实时语音识别服务(ASR)没有提供C的SDK,项目中需要,看了它java sdk的实现,就做了个C版demo	35	Emerging	java-tts-libraries	5	C
2302	a-n-rose/Python-Sound-Tool SoundPy (alpha stage) is a research-based python package for speech and...	35	Emerging	audio-source-separation	77	Jupyter Notebook
2303	renaudjenny/TellTime iOS application to tell the time in the British way 🇬🇧⏰	35	Emerging	ios-speech-frameworks	67	Swift
2304	jreremy/conformer Pytorch implementation of conformer with with training script for end-to-end...	35	Emerging	text-to-speech-frameworks	28	Python
2305	SohamRatnaparkhi/Voice-Assistant Voice Assistant coded in Python!	35	Emerging	general-purpose-voice-assistants	10	Python
2306	MingLunHan/CIF-PyTorch [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech...	35	Emerging	end-to-end-asr-frameworks	79	Python
2307	alitahir4024/Text-To-Speach-Javascript A creative project to give voice to your words.	35	Emerging	web-speech-api-tts	14	JavaScript
2308	huuquyet/PhoWhisper-next Demo using PhoWhisper models of VinAI built with Transformers.js + Next.js	35	Emerging	whisper-fine-tuning	7	TypeScript
2309	PareekshithPalat/AETHER---Personal-Assistant AETHER is a voice-activated Python personal assistant that responds to...	35	Emerging	voice-controlled-desktop-automation	2	Python
2310	holgern/pykokoro A Python library for Kokoro TTS (Text-to-Speech) using ONNX runtime.	35	Emerging	kokoro-tts-ecosystem	2	Python
2311	manhph2211/ML-Deployment Pushing Deep Learning models into production using torchserve, kubernetes...	35	Emerging	self-hosted-tts-servers	27	Python
2312	aria-music/zundacord Japanese Text-to-speech bot for Discord, powered by VOICEVOX	35	Emerging	discord-tts-bots	7	TypeScript
2313	Aculeasis/rhvoice-proxy High-level interface for RHVoice library	35	Emerging	espeak-ng-ecosystem	9	Python
2314	tuanio/noisy-student-training-asr Pytorch implementation of Noisy Student Training for Automatic Speech...	35	Emerging	speaker-diarization-embedding	99	Python
2315	efeslab/LiteASR [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with...	35	Emerging	automatic-speech-recognition	148	Python
2316	IS2AI/ISSAI_SAIDA_Kazakh_ASR the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus...	35	Emerging	speech-corpora-datasets	56	Shell
2317	mathigatti/RealTimeSingingSynthesizer Live Coding Singing Synthesizer. Python sinsy-NG wrapper.	35	Emerging	espeak-ng-ecosystem	61	C
2318	chaonan99/ppt_presenter Convert ppt to video with audio track, using text to speech synthesis	35	Emerging	pdf-to-audio-conversion	69	Python
2319	LlmKira/VitsServer 🌻 VITS ONNX TTS server designed for fast inference 🔥	35	Emerging	vits-tts-implementations	131	Python
2320	andriyadi/Maix-SpeechRecognizer Speech Recognition or Wake Word detection demo, developed using Maixduino...	35	Emerging	wake-word-detection	52	Objective-C
2321	rishikksh20/AudioMAE-pytorch Unofficial PyTorch implementation of Masked Autoencoders that Listen	35	Emerging	tacotron-tts-models	71	Python
2322	thotnd173389/SpeechCommand The project aims to use keyword spotting streaming in a real-time offline...	35	Emerging	wake-word-detection	5	Python
2323	fcakyon/pywhisper openai/whisper + extra features	35	Emerging	whisper-transcription-apps	89	Python
2324	h4rm0n1c/NetTTS A Retro-modern SAPI 4.0 TTS Client with Network Connectivity and custom...	35	Emerging	dotnet-tts-libraries	9	C++
2325	kwea123/Unity_live_caption Use Google Speech-to-Text API to do real-time live stream caption on Unity!...	35	Emerging	live-caption-generation	36	Python
2326	18F/tts-buy-cloudgov-vulnerability-scanner Solicitation and acquisition documents created for the cloud.gov...	35	Emerging	government-procurement-docs	4	—
2327	sinProject-Inc/talk Listening and Speaking	35	Emerging	ai-tutoring-platforms	3	TypeScript
2328	Ikaros-521/FunASR_WS 基于FunASR官方Demo修改的WS服务端，配合FastAPI提供HTTP服务，可以在浏览器中进行实时ASR测试	35	Emerging	funasr-speech-recognition	48	JavaScript
2329	Lqm1/openai-workers-ai A Cloudflare Workers-based, OpenAI-compatible API project that provides...	35	Emerging	speech-to-text-converters	6	TypeScript
2330	ryanlintott/OEVoice Old English text-to-speech using AVSpeechSynthesis and IPA pronunciations.	35	Emerging	ios-speech-frameworks	26	Swift
2331	sooftware/End-to-End-Speech-Recognition-Models PyTorch implementation of automatic speech recognition models.	35	Emerging	end-to-end-asr-frameworks	38	Python
2332	jashutch/zeddal Turn your voice into intelligent, linked notes inside Obsidian	35	Emerging	personal-knowledge-management	4	JavaScript
2333	GravityPoet/ChordVox Your voice is the fastest keyboard. Local AI voice input — speak, AI polish,...	35	Emerging	audio-transcription-tools	44	TypeScript
2334	litagin02/vits-japros-webui 日本語TTS（VITS）の学習と音声合成のGradio WebUI	35	Emerging	vits-tts-implementations	42	Python
2335	rollingstarky/Python-Voice-Assistant A Python based Voice Assistant like Siri	35	Emerging	general-purpose-voice-assistants	43	Python
2336	cosmoquester/speech-recognition Develop speech recognition models with Tensorflow 2	35	Emerging	keyword-speech-recognition	8	Python
2337	pinch-eng/pinch-python-sdk Real-time voice translation SDK	35	Emerging	voice-ai-sdks	6	Python
2338	tjunttila/pdf2video A tool for making videos from PDF presentations.	35	Emerging	pdf-to-audio-conversion	36	Python
2339	m15-ai/Local-Voice A real-time, offline voice assistant for Linux and Raspberry Pi. Uses local...	35	Emerging	local-voice-assistants	10	Python
2340	simonesiega-academics/culinary-ai-assistant AI-powered culinary assistant that stores structured data in a tabular...	35	Emerging	voice-assistant-applications	2	Python
2341	emiliioaguirre/youtube-live-tts Real-time YouTube Live Chat Text-to-Speech (TTS) using ElevenLabs AI voices	35	Emerging	elevenlabs-integrations	42	TypeScript
2342	IOriens/whisper-video Generate subtitles for all the videos in a folder with OpenAI's Whisper...	35	Emerging	audio-transcription-tools	35	Python
2343	jaganadhg/nemoexamples Experiments with NVIDIA NeMo	35	Emerging	funasr-speech-recognition	3	Python
2344	Ananya-0306/Jarvis-desktop-assistant This is the New Jarvis AI Project it will do some functionality followed by...	35	Emerging	voice-assistant-projects	8	Python
2345	robotology/natural-speech This repository contains a codebase to build automatic speech recognition...	35	Emerging	automatic-speech-recognition	6	C
2346	LEMAS-Project/LEMAS-TTS LEMAS‑TTS is a multilingual zero‑shot text‑to‑speech system, supporting 10...	35	Emerging	tts-model-finetuning	92	Python
2347	EtienneAb3d/WhisperTimeSync Synchronize Whisper's timestamps over an existing accurate transcription	35	Emerging	whisper-subtitle-generation	163	Java
2348	elbruno/ElBruno.QwenTTS Qwen3-TTS ONNX export pipeline + C# .NET 10 console app for local voice generation	35	Emerging	tts	15	C#
2349	DivineUX23/Audio-to-Audio-translation Imagine translating your speech or anybody's speech to any language you want...	35	Emerging	audio-transcription-tools	50	Python
2350	matlab-deep-learning/deepspeech This repo provides the pretrained DeepSpeech model in MATLAB. The model is...	35	Emerging	speaker-diarization-embedding	7	MATLAB
2351	cadia-lvl/WebRICE WebRICE (Web Reader ICE) is an open source web reader in development at...	35	Emerging	ai-powered-ereaders	5	TypeScript
2352	Mokkapps/parents-soundboard A soundboard developed for parents to be able to play often needed phrases like "No"	35	Emerging	go-tts-libraries	7	JavaScript
2353	jlia0/RealityTalk RealityTalk: Real-Time Speech-Driven Augmented Presentation for AR Live Storytelling	35	Emerging	ai-tutoring-platforms	6	JavaScript
2354	Sukumar9944/Speech-to-Text-with-ChatGPT This Python application combines speech recognition with the power of...	35	Emerging	voice-chatgpt-interfaces	7	Python
2355	speechly/react-example-repo-filtering An example app for filtering data with Speechly and React	35	Emerging	react-speech-recognition	14	TypeScript
2356	hkdb/offline-tts A Chrome extension that reads web pages and PDFs aloud using Supertonic's...	35	Emerging	browser-tts-extensions	4	JavaScript
2357	Zuellni/LLaSA-WebUI LLaSA WebUI using ExLlamaV2 and FastAPI.	35	Emerging	gradio-tts-webuis	28	Python
2358	xuchennlp/S2T The project for speech translation	35	Emerging	speech-recognition-apis	12	Python
2359	i-bardinov/Godot-Android-Text-to-Speech Godot Android Text to Speech plugin for Godot Engine 3.4 or higher	35	Emerging	android-speech-apps	13	Java
2360	ARK018/multi-voice-sdk A universal Text-to-Speech (TTS) SDK . Easily generate and manage audio...	35	Emerging	google-tts-libraries	1	JavaScript
2361	18F/tts-buy-code-review Solicitation documents for the code review procurement being undertaken by TTS.	35	Emerging	government-procurement-docs	4	—
2362	WelkinYang/Learn2Sing2.0 Diffusion and Mutual Information-Based Target Speaker SVS by Learning from...	35	Emerging	zero-shot-voice-synthesis	181	JavaScript
2363	StanGirard/speechdigest Audio to summary with openAI Whisper & GPT 3.5/4 using streamlit	35	Emerging	audio-transcription-tools	62	Python
2364	opencog/TinyCog Small Robot, Toy Robot platform	35	Emerging	voice-controlled-robotics	34	C++
2365	nearkyh/AWS-Polly How to use Amazon Polly TTS(Text To Speech)	35	Emerging	aws-polly-tts	10	Python
2366	FS-17/SpeechDataBuilder Browser-based open-source tool for creating high-quality TTS/STT datasets....	35	Emerging	tts-dataset-creation	6	JavaScript
2367	LianjiaTech/bella-whisper bella-whisper是一系列基于OpenAI...	35	Emerging	whisper-fine-tuning	3	Python
2368	seven-io/go-client Official Go API Client for seven.io	35	Emerging	sms-voice-integrations	6	Go
2369	DarmorGamz/Youtube-Shorts-Generator Harness OpenAI's power to effortlessly create YouTube Shorts with this...	35	Emerging	ai-video-generation	34	Python
2370	alexykn/TorchTS A modern text to speech frontend for Kokoro-82M	35	Emerging	kokoro-tts-ecosystem	6	JavaScript
2371	stensmir/mimir Offline voice-to-text for macOS. No cloud, no tracking.	35	Emerging	local-voice-dictation	6	—
2372	indigane/wyoming-android-tts Use your Android device's TTS engines in Home Assistant via the Wyoming protocol.	35	Emerging	lightweight-tts-runtimes	7	Kotlin
2373	Garden-Tree/yomi-KAI yomi-KAIはDiscordのテキストチャンネルに送られた文章をボイスチャンネルで読み上げるbotです。	35	Emerging	discord-tts-bots	11	Python
2374	WindQAQ/tensorflow-wavenet Implementation of WaveNet network based on Tensorflow.	35	Emerging	neural-vocoder-implementations	9	Python
2375	SingAvi/SpeechToText Simple python script to convert live speech or any audio file to text using...	35	Emerging	speech-recognition-apis	6	Python
2376	VirtualZer0/StreamTalkerClient Cross-platform desktop app that reads Twitch and VK Play chat aloud using AI...	35	Emerging	dotnet-tts-libraries	2	C#
2377	bobokick/Microsoft-Speech-API_Guide 微软的语音引擎SAPI的使用及API描述	34	Emerging	dotnet-tts-libraries	6	—
2378	lucidprogrammer/youtube-vision-transcriber AI-powered pipeline that converts YouTube videos into polished articles...	34	Emerging	youtube-video-intelligence	1	Python
2379	ElishaAz/mau_local_stt A Maubot to transcribe audio messages using local open-source libraries	34	Emerging	speech-to-text-converters	3	Python
2380	yuryleb/garmin-russian-tts-voices Дополнения и исправления для русских TTS-голосов из навигаторов Garmin	34	Emerging	dotnet-tts-libraries	13	C#
2381	mhshajib/avro-phonetic-go Avro-style Banglish → বাংলা transliteration engine for Go, using trie-based...	34	Emerging	go-nlp-libraries	4	Go
2382	JohannLai/audio-to-text Convert audio to text and summary just need to input the audio link.	34	Emerging	audio-transcription-tools	9	Shell
2383	leokwsw/OpenAI-TTS-Gradio Use OpenAI TTS(Text to Speech) API with Gradio	34	Emerging	gradio-tts-webuis	59	Python
2384	taeyoun811/Whisfusion Whisfusion: Parallel ASR Decoding via a Diffusion Transformer	34	Emerging	funasr-speech-recognition	22	Python
2385	danielga/gmcl_speech A module for Garry's Mod that provides speech recognition interfaces to developers.	34	Emerging	dotnet-tts-libraries	4	C++
2386	voice-cloning-app/Voice-API API template for deploying tacotron2 voices	34	Emerging	tacotron-tts-models	3	Python
2387	daveshap/keras_asr ASR experiment using Google's Universal Sentence Encoder	34	Emerging	end-to-end-asr-frameworks	9	Jupyter Notebook
2388	Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022 Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and...	34	Emerging	automatic-speech-recognition	69	Python
2389	tabahi/Mel-Spectrum-Analyzer Online web based mel-spectrum, power spectrum, FFT analyzer for speech and...	34	Emerging	web-speech-api-libraries	12	JavaScript
2390	RoyNkem/SwiftUI-AI-Voice-Assistant A multi-platform app for voice-based interactions built using SwiftUI with...	34	Emerging	ios-speech-frameworks	32	Swift
2391	sureshnswamy/tamil-text2voice Text to speech tool for Tamil language	34	Emerging	lightweight-tts-libraries	7	Shell
2392	hari-huynh/viVQA-voice-assistant Voice assistant using Multimodal LLMs - LLaVA-NeXT (Mistral 7B) finetuned &...	34	Emerging	local-voice-assistants	4	Python
2393	msalhab96/SpeeQ A framework for automatic speech recognition	34	Emerging	keyword-speech-recognition	51	Python
2394	GreenSheep01201/claw-voice-chat Push-to-talk voice chat interface for OpenClaw channels	34	Emerging	openclaw-voice-assistants	9	TypeScript
2395	uysalemre/Voice-Mail Python, Django, Text to Speech, Speech to Text, AJAX, Gmail API, Email...	34	Emerging	web-based-tts-apps	6	Python
2396	adasegroup/OSM-one-shot-multispeaker Framework for one-shot multispeaker system based on Deep Learning	34	Emerging	fastspeech-tts-models	19	Python
2397	T-vK/Termux-DeepSpeech Open source offline speech recognition for Android using Mozilla's...	34	Emerging	java-tts-libraries	85	Shell
2398	ttuleyb/TortoiseTTS-GUI GradioUI for TortoiseTTS voice generation	34	Emerging	gradio-tts-webuis	33	Python
2399	ekleziast/kiwi-voice Voice interface for OpenClaw with speaker recognition, voice-gated security,...	34	Emerging	openclaw-voice-assistants	7	Python
2400	rishikksh20/TalkNet2-pytorch TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for...	34	Emerging	tacotron-tts-models	89	Python

« Prev 1 2 3 … 22 23 24 25 26 … 80 81 82 Next »