All Voice AI Tools

8,165 tools ranked by quality score · Page 6 of 82

Showing 501–600 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
501	AlexandreSajus/JARVIS Your own personal voice assistant: Voice to Text to LLM to Speech, displayed...	50	Established	tts	515	Python
502	keshavbhatt/glate Open Source Google Translator and TTS App for Linux Desktop	50	Established	speech-translation-apps	82	C++
503	sveinbjornt/hear Command line interface for the built-in speech recognition and transcription...	50	Established	local-voice-dictation	645	Objective-C
504	yl4579/StarGANv2-VC StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for...	50	Established	text-to-speech-frameworks	518	Python
505	goodatlas/zeroth Kaldi-based Korean ASR (한국어 음성인식) open-source project	50	Established	kaldi-asr-ecosystem	358	Shell
506	amanvirparhar/chaplin A real-time silent speech recognition tool.	50	Established	audio-transcription-tools	714	Python
507	zzw922cn/Automatic_Speech_Recognition End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow	50	Established	speaker-diarization-embedding	2,839	Python
508	VRCWizard/TTS-Voice-Wizard Speech to Text to Speech. Song now playing. Sends text as OSC messages to...	50	Established	dotnet-tts-libraries	778	C#
509	Finrandojin/alexandria-audiobook AI-powered multi-voice audiobook generator — LLM script annotation, voice...	50	Established	ebook-to-audiobook-conversion	371	Python
510	Azure-Samples/Cognitive-Services-Voice-Assistant Welcome to the Microsoft Voice Assistant samples repository! Here you will...	50	Established	dotnet-tts-libraries	123	C++
511	moeru-ai/unspeech 🗣️🔊 Your Text-to-Speech Services, All-in-One.	50	Established	elevenlabs-integrations	85	Go
512	svc-develop-team/so-vits-svc SoftVC VITS Singing Voice Conversion	50	Established	text-to-speech-frameworks	28,008	Python
513	gustavostz/whisper-clip WhisperClip simplifies your life by automatically transcribing audio...	50	Established	speech-to-text-converters	137	Python
514	deepgram-starters/flask-transcription Get started using Deepgram's Pre-Recorded Transcription with this Flask demo app	50	Established	deepgram-starter-projects	17	Python
515	NaomiProject/Naomi The Naomi Project is an open source, technology agnostic platform for...	50	Established	python-voice-assistants	292	Python
516	SamirPaulb/real-time-voice-translator A desktop application that uses AI to translate voice between languages in...	50	Established	audio-transcription-apps	396	Tcl
517	travisvn/openai-edge-tts Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or...	50	Established	voice-assistant-devices	1,677	Python
518	XnneHangLab/XnneHangLab 不会聊天的字幕提取器不是一个好 B 站下载器~	50	Established	meeting-transcription-summarizers	92	Python
519	davidmartinrius/speech-dataset-generator 🔊 Create labeled datasets, enhance audio quality, identify speakers, support...	50	Established	speech-corpora-datasets	257	Python
520	ekwek1/soprano-factory Soprano-Factory: Train your own 2000x realtime text-to-speech model	50	Established	tts-model-finetuning	212	Python
521	FunAudioLLM/Fun-ASR Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.	50	Established	automatic-speech-recognition	946	Python
522	sergenes/runandread-audiobook 🚀 Open-source project for creating high-quality AI TTS-narrated audiobooks...	49	Emerging	ebook-to-audiobook-conversion	57	Python
523	Lex-au/Vocalis Speech-to-speech AI assistant with natural conversation flow, mid-speech...	49	Emerging	voice-assistant-applications	290	TypeScript
524	junzew/HanTTS Chinese Text-to-Speech web service	49	Emerging	lightweight-tts-runtimes	313	Python
525	PriesiaMioShirakana/DragonianVoice 多个SVC/TTS的C++推理库	49	Emerging	lightweight-tts-runtimes	1,121	C
526	tugstugi/pytorch-dc-tts Text to Speech with PyTorch (English and Mongolian)	49	Emerging	text-to-speech-frameworks	187	Jupyter Notebook
527	NevilPatel01/RVC-WebUI-MacOS Optimized Retrieval-based Voice Conversion WebUI for Apple Silicon Macs...	49	Emerging	text-to-speech-frameworks	31	Python
528	DragonComputer/Dragonfire the open-source virtual assistant for Ubuntu based Linux distributions	49	Emerging	voice-assistant-applications	1,404	Python
529	dessa-oss/fake-voice-detection Using temporal convolution to detect Audio Deepfakes	49	Emerging	deepfake-detection-systems	383	Python
530	dhruvapte26/B.E.N.J.I. B.E.N.J.I.- The Impossible Missions Force's digital assistant	49	Emerging	python-voice-assistants	89	Python
531	techiaith/pyfestival Amlapiwr Python C ar gyfer hwyluso rhaglennu gyda Festival \| A Python C...	49	Emerging	lightweight-tts-runtimes	10	Python
532	p0p4k/vits2_pytorch unofficial vits2-TTS implementation in pytorch	49	Emerging	text-to-speech-frameworks	547	Python
533	OpenVoiceOS/ovos-buildroot Open Voice Operating System - Buildroot edition is a minimalistic linux OS...	49	Emerging	multi-agent-orchestration	279	Python
534	gionanide/Speech_Signal_Processing_and_Classification Front-end speech processing aims at extracting proper features from short-...	49	Emerging	text-emotion-recognition	257	Python
535	botbahlul/PyAutoSRT PySimpleGUI based DESKTOP APP to AUTO GENERATE SUBTITLE FILE (using free...	49	Emerging	whisper-subtitle-generation	188	Python
536	arghyasur1991/Spark-TTS-Unity Unity package for using Spark-TTS on-device models. This is a C# port of...	49	Emerging	unity-ml-inference	30	C#
537	juntaosun/ComeCut 「来剪」轻量级视频编辑器。网页版、桌面版等均可免费使用，功能灵感源自 CapCut 等编辑器。A Lightweight Video Editor....	49	Emerging	video-dubbing-tools	485	Batchfile
538	createcandle/voco Privacy friendly voice control for the Candle Controller / WebThings...	49	Emerging	voice-assistant-frameworks	29	Python
539	nitaiaharoni1/whisper-speech-to-text Whisper Speech-to-Text is a JavaScript library for recording and...	49	Emerging	speech-to-text-converters	33	TypeScript
540	Poeschl/Hassio-Addons The repository for my Home Assistant Supervisor Add-ons.	49	Emerging	home-assistant-tts	326	Dockerfile
541	Artrajz/vits-simple-api A simple VITS HTTP API, developed by extending Moegoe with additional features.	49	Emerging	vits-tts-implementations	1,045	Python
542	myshell-ai/OpenVoice Instant voice cloning by MIT and MyShell. Audio foundation model.	49	Emerging	voice-cloning-tools	36,111	Python
543	CodersCreative/natural-tts A rust crate for easily implementing Text-To-Speech into your rust programs.	49	Emerging	rust-tts-libraries	24	Rust
544	vasistalodagala/whisper-finetune Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR)...	49	Emerging	whisper-speech-transcription	361	Python
545	speechmatics/speechmatics-python-sdk Python SDKs for Speechmatics APIs	49	Emerging	voice-ai-sdks	17	Python
546	rishikksh20/iSTFTNet-pytorch iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating...	49	Emerging	neural-vocoder-implementations	274	Python
547	metavoiceio/metavoice-src Foundational model for human-like, expressive TTS	49	Emerging	text-to-speech-frameworks	4,201	Python
548	lixiangyu890601/EasyAICC-Easy-AI-Call-Center 外呼系统，智能外呼，自动外呼系统，人工外呼，呼叫中心	49	Emerging	voice-agent-applications	11	Java
549	OpenBMB/UltraEval-Audio Your faithful, impartial partner for audio evaluation — know yourself, know...	49	Emerging	asr-evaluation-metrics	281	Python
550	C-Loftus/QuickPiperAudiobook With one command, create a natural-sounding audiobook from a variety of...	49	Emerging	ebook-to-audiobook-conversion	1,038	Go
551	thuhcsi/Crystal Crystal - C++ implementation of a unified framework for multilingual TTS...	49	Emerging	cross-platform-tts-frameworks	229	C++
552	snakers4/silero-stress Silero Stress — pre-trained enterprise-grade automated stress and homograph...	49	Emerging	gradio-tts-webuis	125	Python
553	JJWRoeloffs/transcribe_align_textgrid A small wrapper package around whisper-timestamped. Create force-aligned...	49	Emerging	video-transcription-extraction	18	Python
554	ARBML/klaam Arabic speech recognition, classification and text-to-speech.	49	Emerging	kaldi-asr-ecosystem	424	Jupyter Notebook
555	artibex/piper-http Creates a docker image that runs the piper http service	49	Emerging	piper-tts-ecosystem	18	Python
556	nullabork/talkbot Text-to-speech and translation bot for Discord	49	Emerging	discord-tts-bots	31	JavaScript
557	robmsmt/KerasDeepSpeech A Keras CTC implementation of Baidu's DeepSpeech for model experimentation	49	Emerging	ctc-asr-implementations	243	Python
558	drankush/VoxRad VOXRAD is a voice transcription application for radiologists leveraging...	49	Emerging	audio-transcription-tools	27	Python
559	zzw922cn/awesome-speech-recognition-speech-synthesis-papers Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis,...	49	Emerging	speech-synthesis-diffusion	3,119	—
560	Steve0929/tiktok-tts Provides a simple way to generate text-to-speech audio files using TikTok's...	49	Emerging	telegram-voice-transcription	105	JavaScript
561	Audio-WestlakeU/VINP Official PyTorch implementation of 'VINP: Variational Bayesian Inference...	49	Emerging	end-to-end-asr-frameworks	31	Python
562	deepgram/deepgram-go-sdk Official Go SDK for Deepgram.	49	Emerging	go-tts-libraries	78	Go
563	rakeshvar/rnn_ctc Recurrent Neural Network and Long Short Term Memory (LSTM) with...	49	Emerging	ctc-asr-implementations	221	Python
564	google/tacotron Audio samples accompanying publications related to Tacotron, an end-to-end...	49	Emerging	text-to-speech-frameworks	539	HTML
565	litagin02/rvc-tts-webui Text-to-Speech Gradio webui using RVC and edge-tts	49	Emerging	self-hosted-tts-servers	336	Python
566	SlapBot/stephanie-va Stephanie is an open-source platform built specifically for voice-controlled...	49	Emerging	general-purpose-voice-assistants	798	Python
567	nvidia-riva/common Protocol buffers and other common resources.	49	Emerging	voice-ai-sdks	13	Starlark
568	ceuk/speech-recognition-aws-polyfill Polyfill for the SpeechRecognition browser API using AWS Transcribe as a fallback	49	Emerging	web-speech-api-libraries	13	TypeScript
569	iMicknl/azure-podcast-generator Generate an engaging podcast based on your document using Azure OpenAI and...	49	Emerging	content-to-podcast-converters	42	Python
570	santi-pdp/pase Problem Agnostic Speech Encoder	49	Emerging	speaker-diarization-embedding	447	Python
571	NeonGeckoCom/neon-tts-plugin-coqui Coqui AI TTS plugin	49	Emerging	coqui-tts-applications	85	Python
572	Picovoice/leopard On-device speech-to-text engine powered by deep learning	49	Emerging	funasr-speech-recognition	474	Python
573	woheller69/whisperIME Android Input Method Editor (IME) based on Whisper	49	Emerging	whisper-framework-ports	543	Java
574	seungwonpark/melgan MelGAN vocoder (compatible with NVIDIA/tacotron2)	49	Emerging	neural-vocoder-implementations	650	Python
575	stimm-ai/stimm The Open Source Voice Agent Platform. Orchestrate ultra-low latency AI...	49	Emerging	voice-assistant-frameworks	40	Python
576	belambert/asr-evaluation Python module for evaluating ASR hypotheses (e.g. word error rate, word...	49	Emerging	asr-evaluation-metrics	283	Python
577	modal-labs/quillman A voice chat app	49	Emerging	voice-agent-applications	1,198	Python
578	mozilla/DeepSpeech DeepSpeech is an open source embedded (offline, on-device) speech-to-text...	49	Emerging	wake-word-detection	26,741	C++
579	pedroetb/tts-api Text to speech REST API for multiple TTS engines	49	Emerging	self-hosted-tts-servers	34	JavaScript
580	hetpandya/youtube_tts_data_generator A python library to generate speech dataset from Youtube videos	49	Emerging	tts-dataset-creation	37	Python
581	eheikes/tts Tools to convert text to speech :books::speech_balloon:	49	Emerging	aws-polly-tts	93	JavaScript
582	voice-cloning-app/Voice-Cloning-App A Python/Pytorch app for easily synthesising human voices	49	Emerging	voice-cloning-synthesis	1,443	Python
583	thevickypedia/py3-tts Offline Text To Speech library for python	49	Emerging	lightweight-tts-libraries	30	Python
584	davidamacey/OpenTranscribe Self-hosted AI-powered transcription platform with speaker diarization,...	49	Emerging	audio-transcription-apps	32	Python
585	jim-schwoebel/voicebook 🗣️ A book and repo to get you started programming voice computing...	49	Emerging	audio-transcription-apps	388	Python
586	savbell/whisper-writer 💬📝 A small dictation app using OpenAI's Whisper speech recognition model.	49	Emerging	speech-to-text-converters	1,021	Python
587	ddPn08/rvc-webui liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project	49	Emerging	voice-cloning-tools	519	Python
588	opendilab/CleanS2S High-quality and streaming Speech-to-Speech interactive agent in a single...	49	Emerging	text-to-speech-conversion	499	Python
589	ActiveNick/HoloBot HoloBot is a reusable 3D interface that allows HoloLens & VR users to...	48	Emerging	voice-command-assistants	124	C#
590	keonlee9420/STYLER Official repository of STYLER: Style Factor Modeling with Rapidity and...	48	Emerging	fastspeech-tts-models	160	Python
591	lucoiso/UEAzSpeech This plugin integrates Azure Speech Cognitive Services in Unreal Engine.	48	Emerging	dotnet-tts-libraries	215	C++
592	liangstein/Chinese-speech-to-text Chinese Speech To Text Using Wavenet	48	Emerging	wav2vec2-asr-models	163	Python
593	avinashvarna/sanskrit_tts Sanskrit text to speech	48	Emerging	lightweight-tts-libraries	33	Python
594	advanced-media-inc/amivoice-api-client-library AmiVoice API Client Library and the sample programs	48	Emerging	web-speech-api-libraries	15	JavaScript
595	travisvn/edge-tts-client Client-side (web browser) implementation of Edge TTS package — Microsoft...	48	Emerging	edge-tts-implementations	22	TypeScript
596	albirrkarim/react-speech-highlight-demo React / Vanilla JS Text to Speech with highlighting the words and sentences...	48	Emerging	gemini-prompt-workbenches	186	JavaScript
597	ModelTC/LightTTS LightTTS is a lightweight TTS inference framework optimized for CosyVoice2...	48	Emerging	coqui-tts-applications	31	Python
598	zlargon/google-tts Google TTS (Text-To-Speech) for node.js	48	Emerging	google-tts-libraries	286	JavaScript
599	enhuiz/vall-e An unofficial PyTorch implementation of the audio LM VALL-E	48	Emerging	tacotron-tts-models	2,992	Python
600	Aivis-Project/AIVM-Generator Aivis Voice Model File (.aivm/.aivmx) Generator / Editor	48	Emerging	openai-tts-applications	15	Vue

« Prev 1 2 3 4 5 6 7 8 … 80 81 82 Next »