All Voice AI Tools

8,165 tools ranked by quality score · Page 12 of 82

Showing 1101–1200 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
1101	fedden/RenderMan Command line C++ and Python VSTi Host library with MFCC, FFT, RMS and audio...	43	Emerging	audio-source-separation	395	C++
1102	stefantaubert/en-tts Command-line interface and Python library for synthesizing English texts into speech.	43	Emerging	lightweight-tts-libraries	5	Python
1103	alexpinel/Dot Text-To-Speech, RAG, and LLMs. All local!	43	Emerging	document-qa-chatbots	1,896	JavaScript
1104	tema6120/ForgetMeNot A flashcard app for Android.	43	Emerging	android-speech-apps	429	Kotlin
1105	OpenCOVID19CoughCheck/CoughCheckApp Development of AI audio app to compare the cough of a Coronavirus (COVID-19)...	43	Emerging	respiratory-disease-detection	67	JavaScript
1106	bold-ronin/lira A Voice-First AI Companion	43	Emerging	local-voice-assistants	15	Dart
1107	superstarryeyes/lue Terminal eBook Reader with Audiobook-Quality Text-to-Speech — Supports EPUB,...	43	Emerging	ebook-to-audiobook-conversion	705	Python
1108	stefantaubert/mel-cepstral-distance A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral...	43	Emerging	keyword-speech-recognition	65	Python
1109	pnlpal/pnl-reader PNL Reader: read quietly or read aloud	43	Emerging	browser-tts-extensions	11	JavaScript
1110	nobody132/masr 中文语音识别; Mandarin Automatic Speech Recognition;	43	Emerging	end-to-end-asr-frameworks	1,964	Python
1111	kurianbenoy/Indic-Subtitler Open source subtitling platform 💻 for transcribing and translating...	43	Emerging	whisper-speech-transcription	93	Jupyter Notebook
1112	keonlee9420/PortaSpeech PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative...	43	Emerging	neural-vocoder-implementations	341	Python
1113	Rongjiehuang/GenerSpeech PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model...	43	Emerging	fastspeech-tts-models	330	Python
1114	AASHISHAG/deepspeech-german Automatic Speech Recognition (ASR) - German	43	Emerging	automatic-speech-recognition	321	Python
1115	benmaster82/writher Voice-powered productivity for Windows	43	Emerging	audio-transcription-tools	11	Python
1116	TimoBolkart/voca This codebase demonstrates how to synthesize realistic 3D character...	43	Emerging	image-caption-generation	1,256	Python
1117	deepgram-starters/django-voice-agent Get started using Deepgram's Voice Agent with this Django demo app	43	Emerging	deepgram-starter-projects	7	Python
1118	DmitryRyumin/OpenAV An open-source library for recognition of speech commands in the user...	43	Emerging	wake-word-detection	7	Python
1119	sai9640nayak/StreamingKokoroJS Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100%...	43	Emerging	kokoro-tts-ecosystem	3	JavaScript
1120	goodmike31/pl-asr-bigos-tools Extendable toolkit for comprehensive evaluation of ASR systems. Currently...	43	Emerging	automatic-speech-recognition	11	Python
1121	mikopbx/ModuleSmartIVR Модуль умной маршрутизации для 1C:Предприятия	43	Emerging	ai-tutoring-platforms	4	PHP
1122	huawei-noah/Speech-Backbones This is the main repository of open-sourced speech technology by Huawei...	43	Emerging	automatic-speech-recognition	602	Jupyter Notebook
1123	t0mer/tts-stt Small pyhon flask container allowing us to convert Text to Speech and Speech to Text	43	Emerging	self-hosted-tts-servers	11	Python
1124	sp-nitech/DNN-HSMM pytorch implementation of DNN-HSMM for TTS	43	Emerging	tacotron-tts-models	70	Python
1125	sovaai/sova-asr SOVA ASR (Automatic Speech Recognition)	43	Emerging	automatic-speech-recognition	172	Python
1126	rhulha/StreamingKokoroJS Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100%...	43	Emerging	kokoro-tts-ecosystem	330	JavaScript
1127	ponlponl123/-Prototype-AIVTuber a open-source Artificial Intelligence Virtual Youtuber (AI VTuber), (this...	43	Emerging	interactive-ai-avatars	439	JavaScript
1128	novoic/surfboard Novoic's audio feature extraction library	43	Emerging	audio-source-separation	440	Python
1129	EricBatlle/UnityAndroidSpeechRecognizer 🗣️ Speech recognition on Unity and Android without the annoying google popup!	43	Emerging	dotnet-tts-libraries	71	ShaderLab
1130	timmo001/home-assistant-assist-desktop Use Home Assistant Assist on the desktop. Compatible with Windows, MacOS, and Linux	43	Emerging	voice-controlled-desktop-automation	133	Svelte
1131	AIFSH/ComfyUI-XTTS a custom comfyui node for coqui-ai/TTS's xtts module! support 17 languages...	43	Emerging	comfyui-tts-nodes	67	Python
1132	soundhound/hound-sdk-web-example An example of how to work with text and voice requests using the Houndify...	43	Emerging	web-speech-api-libraries	7	JavaScript
1133	hujingshuang/MTrans Multi-source Translation	43	Emerging	java-tts-libraries	809	Java
1134	rishikksh20/melgan MelGAN implementation with Multi-Band and Full Band supports...	43	Emerging	neural-vocoder-implementations	62	Jupyter Notebook
1135	JosefAlbers/WTM Blazing fast whisper turbo for ASR (speech-to-text) tasks	43	Emerging	whisper-transcription-apps	222	Python
1136	wangkaisine/mrcp-plugin-with-freeswitch 使用FreeSWITCH接受用户手机呼叫，通过UniMRCP...	43	Emerging	vosk-asr-implementations	350	—
1137	FireRedTeam/FireRedASR2S A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc...	43	Emerging	audio-transcription-tools	365	Python
1138	SamYuan1990/flet_sherpa_onnx flet_sherpa_onnx an ASR/STT library for flet basing on sherpa-onnx	43	Emerging	dotnet-tts-libraries	3	Dart
1139	Picovoice/speech-to-intent-benchmark benchmark for Speech-to-Intent engines	43	Emerging	speech-ai-coursework	17	Python
1140	George0828Zhang/torch_cif A fast parallel PyTorch implementation of the "CIF: Continuous...	43	Emerging	end-to-end-asr-frameworks	36	Python
1141	qianchang/zici 字词：收集国学/汉语字词拼音相关资源	43	Emerging	multilingual-speech-datasets	31	—
1142	Appen/UHV-OTS-Speech A data annotation pipeline to generate high-quality, large-scale speech...	43	Emerging	speech-corpora-datasets	106	Forth
1143	chandran-jr/Noteify 🔎A Currency Detection app for the visually impaired which automatically...	43	Emerging	educational-voice-apps	62	Dart
1144	tomasz-oponowicz/spoken_language_identification Identify a spoken language using artificial intelligence (LID).	43	Emerging	next-word-prediction	124	Python
1145	keonlee9420/WaveGrad2 PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement...	43	Emerging	neural-vocoder-implementations	69	Python
1146	zceng/LVCNet LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation	43	Emerging	neural-vocoder-implementations	80	Python
1147	haguro/elevenlabs-go A Go API client library for the ElevenLabs speech synthesis platform	43	Emerging	ai-terminal-agents	31	Go
1148	Ezdokz1337/sunona-v0.001 🎤 Build and deploy intelligent voice AI agents in minutes with Sunona, your...	43	Emerging	voice-agent-applications	2	Python
1149	mitchib1440/SpeakThat The world's most comprehensive notification reader for Android devices.	43	Emerging	android-speech-apps	100	Kotlin
1150	darkautism/sensevoice-rs A Rust-based, SenseVoiceSmall	43	Emerging	rust-speech-recognition	27	Rust
1151	xyqfer/reader 毕业设计-基于智能手机的报纸阅读器	43	Emerging	android-speech-apps	62	Java
1152	GinoShun/Accent-Activation-Steering Official code for "Activation Steering for Accent Adaptation in Speech...	43	Emerging	end-to-end-asr-frameworks	3	Python
1153	HachiroSan/google-pronouncer 🔊 Download pronunciation audio files from Google's dictionary service....	43	Emerging	lightweight-tts-libraries	3	Python
1154	jonatasgrosman/asrecognition ASRecognition: just an easy-to-use library for Automatic Speech Recognition.	43	Emerging	automatic-speech-recognition	50	Python
1155	ai-learning-tools/viva-translate Real-time translation copilot for your browser	43	Emerging	machine-translation-systems	56	TypeScript
1156	karim23657/Persian-tts-coqui Persian/Farsi text to speech(TTS) training using coqui tts	43	Emerging	text-to-speech-frameworks	199	Jupyter Notebook
1157	felixchenfy/Speech-Commands-Classification-by-LSTM-PyTorch Classification of 11 types of audio clips using MFCCs features and LSTM....	43	Emerging	keyword-speech-recognition	43	Jupyter Notebook
1158	sevangelatos/py-ttspico Python svox picotts wrapper	43	Emerging	cross-platform-tts-frameworks	6	Python
1159	thetobysiu/Deepstory Deepstory turns a text/generated text into a video where the character is...	43	Emerging	ai-children-storytelling	103	Python
1160	thewh1teagle/piper-onnx Use piper TTS with onnxruntime	43	Emerging	piper-tts-ecosystem	8	Python
1161	aws-solutions/content-localization-on-aws Automatically generate multi-language subtitles using AWS AI/ML services....	43	Emerging	speech-to-text-transcription	43	Vue
1162	MohammedRashad/FPGA-Speech-Recognition Expiremental Speech Recognition System using VHDL & MATLAB.	43	Emerging	keyword-speech-recognition	50	VHDL
1163	R1ckShi/AESRC2020 [ICASSP2021] Data preperation scripts, training pipeline and baseline...	43	Emerging	end-to-end-asr-frameworks	56	Python
1164	rorpage/openfaas-text-to-speech Generate an MP3 of text using Google's Text-to-Speech	43	Emerging	openai-tts-applications	11	Dockerfile
1165	dbklim/Voice_ChatBot Chatbot in russian with speech recognition using PocketSphinx and speech...	43	Emerging	voice-chatbot-applications	60	Python
1166	wit-ai/android-voice-demo Example on how to build a voice-enabled Android app with Wit.ai	43	Emerging	text-to-speech-conversion	41	Java
1167	lablab-ai/OpenAI_Whisper_Streamlit A minimalistic automatic speech recognition streamlit based webapp powered...	43	Emerging	speech-to-text-converters	40	Python
1168	gooofy/py-marytts Python MaryTTS HTTP client library	43	Emerging	lightweight-tts-libraries	8	Python
1169	rainygirl/rspeaker 말귀를 알아듣고 뉴스도 요약해 읽어줍니다	43	Emerging	news-audio-bulletins	26	Python
1170	yl4579/StyleTTS-VC Official Implementation of StyleTTS-VC	43	Emerging	text-to-speech-frameworks	197	Python
1171	upskyy/Transformer-Transducer PyTorch implementation of "Transformer Transducer: A Streamable Speech...	43	Emerging	end-to-end-asr-frameworks	113	Python
1172	LiberSonora/LiberSonora LiberSonora，寓意“自由的声音”，是一个 AI 赋能的、强大的、开源有声书工具集，包含智能字幕提取、AI标题生成、多语言翻译等功能，支持...	43	Emerging	ai-podcast-generation	463	Python
1173	developers-cosmos/Mimasa Real time multilingual face translator	43	Emerging	real-time-voice-translation	38	Python
1174	keonlee9420/Cross-Speaker-Emotion-Transfer PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based...	43	Emerging	zero-shot-voice-synthesis	194	Python
1175	opensource-spraakherkenning-nl/Kaldi_NL Code related to the Dutch instance and user groups of the KALDI speech...	43	Emerging	kaldi-asr-ecosystem	68	Shell
1176	hopkira/k9 Latest main K9 robot repository with 3D vision, local STT/TTS with GPT-3 and...	43	Emerging	voice-controlled-robotics	24	Python
1177	Gmzxdotzz/Dia-TTS-Server Self-host the powerful Dia TTS model. This server offers a user-friendly Web...	43	Emerging	self-hosted-tts-servers	4	Python
1178	taresh18/TTSizer 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific...	43	Emerging	tts-dataset-creation	135	Python
1179	pandeydivesh15/AVSR-Deep-Speech Google Summer of Code 2017 Project: Development of Speech Recognition Module...	43	Emerging	ctc-asr-implementations	45	Python
1180	yuhr/langue A modern platform for conlanging. Currently in the planning stage.	43	Emerging	vosk-asr-implementations	44	TypeScript
1181	mozilla/DeepSpeech-examples Examples of how to use or integrate DeepSpeech	43	Emerging	wake-word-detection	858	Python
1182	niker/EdgeTtsSharp EdgeTTS Sharp is a library that provides an easy-to-use, realtime-streaming,...	43	Emerging	edge-tts-implementations	18	C#
1183	alex-vt/WhisperInput Offline voice input panel & keyboard with punctuation for Android.	43	Emerging	whisper-framework-ports	111	Java
1184	candlewill/Speech-Corpus-Collection A Collection of Speech Corpus for ASR and TTS	43	Emerging	speech-corpora-datasets	113	—
1185	Hecate2/sukasuka-vocal-dataset-builder すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from...	43	Emerging	tts-dataset-creation	49	Python
1186	AmphionTeam/FlexiCodec [ICLR2026] FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates	43	Emerging	neural-vocoder-implementations	42	Python
1187	jtkim-kaist/VAD Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM...	43	Emerging	ios-speech-frameworks	869	MATLAB
1188	kaituoxu/Speech-Transformer A PyTorch implementation of Speech Transformer, an End-to-End ASR with...	43	Emerging	end-to-end-asr-frameworks	809	Python
1189	Pankaj-Baranwal/pocketsphinx Updated ROS bindings to pocketsphinx	43	Emerging	automatic-speech-recognition	38	Python
1190	ttop32/coqui_tts_korea Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS	43	Emerging	voice-cloning-synthesis	64	Jupyter Notebook
1191	bawangxx/XZVoice Free and open source text-to-speech software	43	Emerging	web-speech-api-tts	1,180	Vue
1192	journey-ad/CosyVoice2-Ex CosyVoice2 功能扩充（预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API）	43	Emerging	coqui-tts-applications	189	Python
1193	tover0314-w/opentypeless Talkmore with Opentypeless. Type with your voice. Anywhere. Talk -...	43	Emerging	audio-transcription-tools	40	TypeScript
1194	nyrahealth/CrisperWhisper Verbatim Automatic Speech Recognition with improved word-level timestamps...	43	Emerging	whisper-diarization	927	Python
1195	chenmingxiang110/Chinese-automatic-speech-recognition Chinese speech recognition	43	Emerging	speaker-diarization-embedding	159	Jupyter Notebook
1196	jojojaeger/whisper-streamlit this master thesis project is based on OpenAI Whisper with the goal to...	43	Emerging	speech-to-text-converters	48	Python
1197	flogy/gatsby-mdx-tts 🗣 Adds speech output to your Gatsby site using Amazon Polly.	43	Emerging	aws-polly-tts	9	TypeScript
1198	jsugg/ser The AI-powered ser Python package is a tool for recognizing and analyzing...	43	Emerging	speech-emotion-recognition	6	Python
1199	linux-speakup/espeakup a light weight connector for espeak-ng and speakup	43	Emerging	espeak-ng-ecosystem	36	C
1200	seanghay/KLEA An open-source Khmer Word to Speech Model. Just single word not sentence!	43	Emerging	tts-model-finetuning	19	Python

« Prev 1 2 3 … 10 11 12 13 14 … 80 81 82 Next »