All Voice AI Tools

8,165 tools ranked by quality score · Page 3 of 82

Showing 201–300 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
201	jianchang512/stt Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具，输出json、srt字幕、纯文字格式	56	Established	real-time-voice-translation	4,331	Python
202	Migushthe2nd/MsEdgeTTS A simple Azure Speech Service module that uses the Microsoft Edge Read Aloud...	56	Established	edge-tts-implementations	325	TypeScript
203	MatteoFasulo/Whisper-TikTok From AI tools to TikTok video creation using FFMPEG, Microsoft Edge read...	56	Established	video-transcription-extraction	318	Python
204	vox-serve/vox-serve A Streaming-Native Serving Engine for TTS/STS Models	56	Established	text-to-speech-conversion	59	Python
205	aahl/zai-tts 🗣️ ZAI/GLM TTS to OpenAI Speech API, 免费的语音合成API，支持克隆音色，基于智谱TTS	56	Established	openai-tts-applications	158	Python
206	Femoon/tts-azure-web TTS Azure Web 是一个 Azure 文本转语音（TTS）网页应用，可以在本地或者云端使用你的 Azure Key 一键部署。TTS...	56	Established	dotnet-tts-libraries	479	TypeScript
207	RVC-Boss/GPT-SoVITS 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)	56	Established	vits-tts-implementations	55,896	Python
208	ahmetoner/whisper-asr-webservice OpenAI Whisper ASR Webservice API	56	Established	speech-to-text-converters	3,202	Python
209	rwth-i6/rasr The RWTH ASR Toolkit.	56	Established	automatic-speech-recognition	58	C++
210	MahmoudAshraf97/whisper-diarization Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper	56	Established	whisper-diarization	5,437	Jupyter Notebook
211	AbdullahHendy/live-translation Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM...	56	Established	speech-to-text-transcription	13	Python
212	ThioJoe/Auto-Synced-Translated-Dubs Automatically translates the text of a video based on a subtitle file, and...	56	Established	video-dubbing-tools	1,715	Python
213	yuga-hashimoto/openclaw-assistant OpenClaw voice assistant app for Android - Wake word activation & system...	56	Established	openclaw-voice-assistants	196	Kotlin
214	namastexlabs/murmurai 🎙️ Drop-in replacement for paid transcription APIs. Self-hosted,...	56	Established	speech-to-text-converters	39	Python
215	lobehub/lobe-tts 🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser	56	Established	edge-tts-implementations	779	TypeScript
216	GitYCC/g2pW Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)	56	Established	grapheme-to-phoneme-conversion	382	Python
217	xinjli/allosaurus Allosaurus is a pretrained universal phone recognizer for more than 2000 languages	56	Established	end-to-end-asr-frameworks	715	Python
218	marty1885/paroli Streaming TTS based on Piper with optional RK3588 NPU support	56	Established	piper-tts-ecosystem	123	C++
219	alphacep/vosk-unity-asr Automatic Speech Recognition in Unity using Vosk library	56	Established	dotnet-tts-libraries	118	C#
220	haoheliu/voicefixer General Speech Restoration	56	Established	automatic-speech-recognition	1,302	Python
221	Stypox/dicio-android Dicio assistant app for Android	56	Established	android-voice-assistants	1,295	Kotlin
222	justinsalamon/scaper A library for soundscape synthesis and augmentation	56	Established	audio-source-separation	414	Python
223	SahilAggarwal2004/react-text-to-speech An easy-to-use React.js library that leverages the Web Speech API to convert...	56	Established	vue-speech-recognition	81	TypeScript
224	bshall/Tacotron A PyTorch implementation of Location-Relative Attention Mechanisms For...	55	Established	tacotron-tts-models	115	Python
225	sooftware/conformer [Unofficial] PyTorch implementation of "Conformer: Convolution-augmented...	55	Established	conformer-asr-implementations	1,109	Python
226	RageAgainstThePixel/ElevenLabs-DotNet A Non-Official ElevenLabs RESTful API Client for dotnet	55	Established	elevenlabs-integrations	89	C#
227	dimonier/tg2obsidian This bot pulls new messages from a Telegram chat or group and puts them into...	55	Established	telegram-voice-transcription	144	Python
228	antirek/voicer AGI-server voice recognizer for #Asterisk	55	Established	web-speech-api-libraries	101	JavaScript
229	peteonrails/voxtype Voice-to-text with push-to-talk for Wayland compositors	55	Established	voice-dictation-typing	510	Rust
230	sccn/eegprep EEGPrep is an automated preprocessing tool for human EEG data built on a...	55	Established	automatic-speech-recognition	19	Jupyter Notebook
231	astorfi/speechpy :speech_balloon: SpeechPy - A Library for Speech Processing and Recognition:...	55	Established	automatic-speech-recognition	886	Python
232	dputhier/pygtftk A python package and a set of shell commands to handle GTF files	55	Established	lightweight-tts-libraries	51	Python
233	deepgram/deepgram-dotnet-sdk Official .NET SDK for Deepgram.	55	Established	deepgram-starter-projects	51	C#
234	arcosoph/nanowakeword A lightweight, open-source, and intelligent wake word detection engine....	55	Established	wake-word-detection	48	Python
235	karashiiro/TextToTalk Chat TTS plugin for Dalamud. Has support for triggers/exclusions, several...	55	Established	dotnet-tts-libraries	68	C#
236	readbeyond/aeneas aeneas is a Python/C library and a set of tools to automagically synchronize...	55	Established	asr-evaluation-metrics	2,811	Python
237	innovatorved/whisper.api This project provides an API with user level access support to transcribe...	55	Established	speech-to-text-converters	914	Python
238	deepgram/deepgram-rust-sdk Community Rust SDK for Deepgram.	55	Established	deepgram-starter-projects	65	Rust
239	AlexxIT/YandexStation Управление Яндекс.Станцией и другими устройствами умного дома с Алисой из...	55	Established	yandex-speechkit-tools	1,807	Python
240	JackismyShephard/ultimate-rvc An app for creating audio-based content such as song covers and speech using...	55	Established	voice-cloning-tools	264	Python
241	High-Logic/Genie-TTS GPT-SoVITS ONNX Inference Engine & Model Converter	55	Established	vits-tts-implementations	1,433	Python
242	krillinai/KrillinAI Video translation and dubbing tool powered by LLMs. The video translator...	55	Established	video-dubbing-tools	9,724	Go
243	flashlight/wav2letter Facebook AI Research's Automatic Speech Recognition Toolkit	55	Established	speaker-diarization-embedding	6,446	C++
244	FireRedTeam/FireRedASR Open-source industrial-grade ASR models supporting Mandarin, Chinese...	55	Established	audio-transcription-tools	1,796	Python
245	machinelearningZH/audio-transcription Transcribe any audio or video file. Edit and view your transcripts in a...	55	Established	whisper-transcription-apps	94	Python
246	OpenMOSS/MOSS-TTS MOSS‑TTS Family is an open‑source speech and sound generation model family...	55	Established	voice-assistant-devices	922	Python
247	remsky/Kokoro-FastAPI Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX...	55	Established	kokoro-tts-ecosystem	4,585	Python
248	Saurav-Paul/AI-virtual-assistant-python Command line virtual assistant for competitive programming	55	Established	general-purpose-voice-assistants	118	Python
249	Lyrcaxis/KokoroSharp Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker,...	55	Established	kokoro-tts-ecosystem	207	C#
250	wannaphong/ttsmms TTS with The Massively Multilingual Speech (MMS) project	54	Established	lightweight-tts-libraries	235	Python
251	hugobloem/wyoming-microsoft-tts Wyoming protocol server for Microsoft Azure text-to-speech	54	Established	lightweight-tts-runtimes	25	Python
252	Aivis-Project/AivisSpeech-Engine AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine	54	Established	self-hosted-tts-servers	150	Python
253	TrevorS/voxtral-mini-realtime-rs Streaming speech recognition running natively and in the browser. A pure...	54	Established	rust-speech-recognition	710	Rust
254	linto-ai/linto-stt An automatic speech recognition API	54	Established	whisper-diarization	81	Python
255	swlegion/tts Table Top Simulator Mod for Star Wars: Legion	54	Established	dotnet-tts-libraries	48	Lua
256	mbsantiago/whombat Audio Annotation Tool for ML development	54	Established	data-annotation-tools	86	TypeScript
257	codename0og/codename-rvc-fork-4 Codename's rvc fork version 4, based on Applio.	54	Established	voice-cloning-tools	41	Python
258	double22a/speech_dataset The dataset of Speech Recognition	54	Established	speech-recognition-datasets	453	—
259	ttop32/MouseTooltipTranslator Mouseover Translate Any Language At Once - Chrome Extension: PDF Translator,...	54	Established	live-meeting-translation	1,140	JavaScript
260	mlalma/kokoro-ios Kokoro TTS for iOS and macOSX	54	Established	text-to-speech-tts	209	Swift
261	MattyB95/Jabberjay 🦜 Synthetic Voice Detection	54	Established	wav2vec2-speech-recognition	5	Python
262	Aivis-Project/aivmlib Aivis Voice Model File (.aivm/.aivmx) Utility Library	54	Established	openai-tts-applications	25	Python
263	DevEmperor/Dictate A powerful Whisper AI keyboard for reliable speech transcription	54	Established	audio-transcription-tools	183	Java
264	hs-CN/msedge-tts This library is a wrapper of MSEdge Read aloud function API. You can use it...	54	Established	edge-tts-implementations	19	Rust
265	VolcanicArts/VRCOSC A modular node-programming language, program creator, animation system,...	54	Established	dotnet-tts-libraries	502	C#
266	evancohen/sonus :speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword...	54	Established	web-speech-api-libraries	636	JavaScript
267	stepfun-ai/Step-Audio-EditX A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model...	54	Established	zero-shot-voice-synthesis	884	Python
268	shivammehta25/Neural-HMM Neural HMMs are all you need (for high-quality attention-free TTS)	54	Established	text-to-speech-frameworks	164	Jupyter Notebook
269	jtCodes/lyrictor Browser-based lyric video editor built for complex timelines with hundreds...	54	Established	text-to-video-generation	52	TypeScript
270	Blaizzy/mlx-audio-swift A modular Swift SDK for audio processing with MLX on Apple Silicon	54	Established	ios-speech-frameworks	446	Swift
271	mgonzs13/whisper_ros Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2	54	Established	whisper-framework-ports	91	C++
272	ArkanDash/Advanced-RVC-Inference Advanced RVC Inference for quicker and effortless model downloads	54	Established	voice-cloning-tools	68	Python
273	stemrollerapp/stemroller Isolate vocals, drums, bass, and other instrumental stems from any song	54	Established	audio-source-separation	3,052	Svelte
274	lucasnewman/f5-tts-mlx Implementation of F5-TTS in MLX	54	Established	zero-shot-voice-synthesis	611	Python
275	ynop/audiomate Python library for handling audio datasets.	54	Established	speech-corpora-datasets	138	Python
276	HumeAI/hume-typescript-sdk Add Hume AI to any TypeScript project	54	Established	web-speech-api-libraries	75	TypeScript
277	Oaklight/asr2clip handy cli tool to convert your speech to clipboard text	54	Established	speech-to-text-converters	15	Python
278	mateogon/pdf-narrator Convert your PDFs and EPUBs into audiobooks effortlessly. Features...	54	Established	ebook-to-audiobook-conversion	167	Python
279	met4citizen/HeadTTS HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for...	54	Established	kokoro-tts-ecosystem	112	JavaScript
280	jpreprocess/jpreprocess Japanese text preprocessor for Text-to-Speech applications (OpenJTalk...	54	Established	rust-tts-libraries	52	Rust
281	funnyzak/tts-now 跨平台基于云平台(阿里云、讯飞等)语音合成 API 的文字转语音助手。支持单文本快速合成和批量合成。支持windows、macOS、Linux。	54	Established	google-tts-libraries	317	TypeScript
282	netease-youdao/EmotiVoice EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine	54	Established	text-to-speech-frameworks	8,455	Python
283	Softcatala/open-dubbing Open dubbing is an AI dubbing system which uses machine learning models to...	54	Established	voice-cloning-synthesis	373	Python
284	LokerL/tts-vue 🎤 微软语音合成工具，使用 Electron + Vue + ElementPlus + Vite 构建。	54	Established	google-tts-libraries	6,099	TypeScript
285	EddyVerbruggen/nativescript-speech-recognition :speech_balloon: Speech to text, using the awesome engines readily available...	54	Established	web-speech-api-libraries	91	TypeScript
286	chinokikiss/GSV-TTS-Lite GSV-TTS-Lite A high-performance inference engine specifically designed for...	54	Established	vits-tts-implementations	57	Python
287	emnikhil/Sign-Language-To-Text-Conversion Sign Language to Text Conversion is a real-time system that uses a camera to...	53	Established	sign-language-recognition	348	Python
288	jpreprocess/jbonsai Voice synthesis library for Text-to-Speech applications (Currently HTS...	53	Established	rust-tts-libraries	13	Rust
289	Lex-au/Orpheus-FastAPI High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices,...	53	Established	text-to-speech-conversion	673	Python
290	alphacep/vosk-server WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi...	53	Established	vosk-asr-implementations	1,240	Python
291	hgneng/ekho Chinese text-to-speech engine	53	Established	lightweight-tts-runtimes	1,202	Lex
292	thewh1teagle/pyannote-rs pyannote audio diarization in rust	53	Established	parakeet-asr-implementations	108	Rust
293	jianchang512/ChatTTS-ui 一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface...	53	Established	self-hosted-tts-servers	7,521	Python
294	Henry-23/VideoChat 实时交互数字人，可自定义形象与音色，支持音色克隆，对话延迟低至3s。Real-time voice interactive digital human,...	53	Established	ai-avatar-platforms	1,223	Python
295	drmfinlay/tts-util-app TTS Util — Text-to-speech utility Android app for synthesising text into...	53	Established	android-speech-apps	176	Kotlin
296	IhorShevchuk/piper-app The original Piper, now on iOS and macOS	53	Established	piper-tts-ecosystem	35	Swift
297	LibreSpark/LibreTTS TTS-文本转语音/文本转语音前端，兼容OpenAI、EdgeTTS等接口	53	Established	edge-tts-implementations	350	JavaScript
298	wxxxcxx/ms-ra-forwarder 免费的在线文本转语音API	53	Established	google-tts-libraries	1,030	TypeScript
299	Notely-Voice/NotelyVoice A 100% private AI voice transcription app that converts speech to text in...	53	Established	local-voice-dictation	629	C++
300	rzru/nightingale Machine learning powered Karaoke app (with scores!)	53	Established	audio-music-learning	548	Rust

« Prev 1 2 3 4 5 … 80 81 82 Next »