All Voice AI Tools

8,165 tools ranked by quality score · Page 32 of 82

Showing 3101–3200 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
3101	AlasdairKing/Calendar-VB6 Simple, accessible Calendar for screenreader and blind users.	31	Emerging	dotnet-tts-libraries	3	VBA
3102	tigjaw/remyme ReMyMe - a basic "Read My Messages" Android application (old)	31	Emerging	android-speech-apps	3	Java
3103	Infineon/i2s-microphone A collection of documentation and examples for Infineon's I2S microphones.	31	Emerging	edge-device-ml-frameworks	7	HTML
3104	The-Data-Dilemma/Medibeng-Orpheus-3b-0.1-ft-Fine-Tuning Medibeng-Orpheus-3b-0.1-ft- A TTS model for bilingual Bengali-English...	31	Emerging	tts-model-finetuning	6	Python
3105	BenjaminPoncet/bobby-snips-tts bobby-snips-tts is an implementation of snips-tts written in Node.js with...	31	Emerging	google-tts-libraries	4	JavaScript
3106	Abhradipta/OCR-With-Read-Out-Loud-Using-Python An Optical Character Recognition (OCR) System designed using Python to read...	31	Emerging	image-to-speech-synthesis	3	Python
3107	taeefnajib/Vocazee A voice cloning and text-to-speech application that can generate speech in any voice.	31	Emerging	voice-cloning-tools	3	Python
3108	viig99/esolafast Fast C++ implementation of ESOLA using KFRLib, can be used for online...	31	Emerging	end-to-end-asr-frameworks	16	C++
3109	koesan/Evoars A multi-model AI platform for comics, manga, and videos. It colorizes...	31	Emerging	video-dubbing-tools	16	Python
3110	PiasRoY/Bangla-Spoken-Number-Recognition recognizing spoken Bangla numbers using MFCCs and CNN.	31	Emerging	keyword-speech-recognition	4	Jupyter Notebook
3111	suzumushi0/SoundObject_binary SoundObject binary distribution.	31	Emerging	audio-source-separation	57	—
3112	palahsu/Greeting-PC Greeting PC, made with simple Visual Basic Script. Run file it will executes...	31	Emerging	python-voice-assistants	3	VBScript
3113	dhdaines/soundswallower-demo Simple demo of client-side speech recognition	31	Emerging	web-speech-api-libraries	3	TypeScript
3114	TCL606/Speech-Number-Recognition 基于数字信号处理的语音数字识别器	31	Emerging	keyword-speech-recognition	4	MATLAB
3115	baocin/hugging_face_example_STT_api Demonstration of Hugging Face's (https://huggingface.co/) newly released...	31	Emerging	wav2vec2-asr-models	3	Python
3116	vinbhaskara/Digit-Speech-Recognition Using MFCC features on Speech Signals to classify Digits after matching...	31	Emerging	keyword-speech-recognition	4	Matlab
3117	idiap/TIDIGITSRecipe.jl A Julia recipe for training an ASR system using the TIDIGITS database	31	Emerging	automatic-speech-recognition	4	Julia
3118	marvinborner/CTC-LSTM Spoken word recognition using CTC LSTMs for SWR2 Tübingen	31	Emerging	ctc-asr-implementations	4	Python
3119	vectominist/rspin Official inference code for NAACL 2024 paper "R-Spin: Efficient Speaker and...	31	Emerging	end-to-end-asr-frameworks	4	Python
3120	SzLeaves/asr-model-ctc ASR deep learning models (use BiGRU & WaveNet & CTC), use Tensorflow2...	31	Emerging	ctc-asr-implementations	3	Python
3121	loglux/FlexAudioPrint FlexAudioPrint is a Python-based app for transcribing audio to text using...	31	Emerging	whisper-transcription-apps	10	Python
3122	SEPIA-Framework/sepia-web-audio Create modular, cross-browser, web audio pipelines to record and process...	31	Emerging	web-speech-api-libraries	46	JavaScript
3123	oeschsec/Sidekick---voice-controlled-keyboard-and-mouse Voice controlled keyboard and mouse that is lightweight (minimal...	31	Emerging	vosk-asr-implementations	2	Python
3124	aeleraqi/gTTS---Arabic-text-to-multiple-languages Converting Arabic text to speech in various languages with the versatile...	31	Emerging	lightweight-tts-libraries	2	Jupyter Notebook
3125	BobRandomNumber/ComfyUI-KyutaiTTS A non real-time ComfyUI implementation of Kyutai TTS	31	Emerging	comfyui-tts-nodes	6	Python
3126	papercast-dev/papercast A Python pipeline tool and plugin ecosystem for processing technical...	31	Emerging	content-to-podcast-converters	54	Python
3127	deepgram/deepgram-js-captions This package is the JavaScript implementation of Deepgram's WebVTT and SRT...	31	Emerging	deepgram-starter-projects	16	TypeScript
3128	khanld/Wav2vec2-Pretraining Wav2vec 2.0 Self-Supervised Pretraining	31	Emerging	wav2vec2-asr-models	59	Python
3129	heptacode/interactivekiosk 다양한 사용자를 위한 키오스크 개선 프로젝트 ✨	31	Emerging	vue-speech-recognition	3	Vue
3130	elie-atia/talk-to-chat-gpt Enable to talk to ChatGPTusing voice-to-text (record and recognize the...	31	Emerging	voice-chatgpt-interfaces	3	Python
3131	X-LANCE/VoiceFlow-TTS [ICASSP 2024] This is the official code for "VoiceFlow: Efficient...	31	Emerging	fastspeech-tts-models	372	Python
3132	tsengia/SphinxTrainHelper A Bash script designed to make training sphinx4 and pocketsphinx acoustic...	31	Emerging	kaldi-asr-ecosystem	3	Shell
3133	Phe0nix/Speech-Email-Sender Send email with speech recognition means just start talking and send emails....	31	Emerging	web-speech-api-libraries	2	JavaScript
3134	Philipinho/ThreadVoice Source code for https://twitter.com/threadvoice	31	Emerging	java-tts-libraries	3	Java
3135	yeyupiaoling/VITS-PaddlePaddle 本项目是基于PaddlePaddle的语音合成项目，使用的是VITS，VITS是一种语音合成方法，这种时端到端的模型使用起来非常简单，不需要文本对齐等太复...	31	Emerging	vits-tts-implementations	3	Python
3136	bookbot-hive/OpenBible-TTS Building Text-to-Speech Systems using OpenBible!	31	Emerging	sacred-text-nlp	2	Jupyter Notebook
3137	falabrasil/cmusphinx-br Scripts e recursos para ASR em Português Brasileiro	31	Emerging	kaldi-asr-ecosystem	4	Shell
3138	arcb01/g-narrator A screen reading accessibility tool	31	Emerging	openai-tts-applications	4	Python
3139	kofemann/streetguide An Android app to discover where you drive	31	Emerging	android-speech-apps	2	Java
3140	Ryan5453/lyricscribe Automated Lyric Transcription Research	31	Emerging	live-caption-generation	2	Python
3141	pragmatrix/context-switch Audio Streaming for FreeSWITCH with backends powered by Azure, OpenAI, and Aristech	31	Emerging	ai-avatar-platforms	1	Rust
3142	ASR-project/Multilingual-PR Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM....	31	Emerging	speaker-diarization-embedding	258	Python
3143	savg92/voice-cloning This project provides a comprehensive testing and comparison platform for...	31	Emerging	voice-cloning-tools	5	Python
3144	repodiac/espeak-ng_german_loan_words Brief tutorial with code where you can automatically create a dictionary...	31	Emerging	espeak-ng-ecosystem	3	Python
3145	tongplw/ASR-web-based-restaurant 🍔 Foody, a smart voice-assistant web-based restaurant using Kaldi, React, and WebRTC	31	Emerging	react-speech-recognition	3	JavaScript
3146	vishalnagda1/text-to-speech Python program to convert text to speech.	31	Emerging	lightweight-tts-libraries	5	Python
3147	KernelOverseer/caLLMe Realtime voice conversation with llm models using an asynchronous Voice to...	31	Emerging	local-voice-assistants	20	Python
3148	USSLab/DolphinAttack Inaudible Voice Commands	31	Emerging	dotnet-tts-libraries	108	—
3149	Arbazkhan4712/Text-To-Speech A program that can convert Text into Speech using python	31	Emerging	lightweight-tts-libraries	1	Python
3150	auroraapi/aurora-python Aurora SDK for Python	31	Emerging	voice-ai-sdks	4	Python
3151	belambert/asr-scripts Lots of miscellaneous scripts to work with Sphinx ASR files and other...	31	Emerging	automatic-speech-recognition	2	Python
3152	mehdichaouch/nabstory Let your Nabaztag 🐰 read you a story 📖	31	Emerging	ebook-to-audiobook-conversion	3	Python
3153	hanifabd/voice-activity-detection-vad-realtime Real-time Voice Activity Detection (VAD) with some example use case like...	31	Emerging	speaker-diarization-embedding	106	Python
3154	visu123s/MimicKit 🤖 Learn motion imitation with MimicKit, a framework offering advanced...	31	Emerging	voice-cloning-tools	2	Python
3155	Inviro/Illud Illud is a smart text analyzer written in pure Java that displays different...	31	Emerging	ai-avatar-platforms	3	Java
3156	speechly/api Speechly public API definitions and generated code	31	Emerging	ios-speech-frameworks	17	Swift
3157	lpkpaco/Bocchi-The-Rock-GPT-SoVITS-Models Contains voice models based on the GPT-SoVITS architecture of different...	31	Emerging	vits-tts-implementations	3	Python
3158	ggh-png/EMOTIBOT emotion robot using gpt model3.5 EMOTIBOT	31	Emerging	voice-controlled-robotics	21	C++
3159	nikkiw/realtime_translator Python tool for real-time voice recognition and multilingual translation	31	Emerging	real-time-voice-translation	2	Python
3160	SEPIA-Framework/sepia-docs Documentation and Wiki for SEPIA. Please post your questions and bug-reports...	31	Emerging	voice-chatbot-applications	251	—
3161	m1n1v1rus/futuristic-calculator A futuristic, AI-powered advanced calculator with voice control, graph...	31	Emerging	voice-controlled-calculators	2	Python
3162	wamich/personal-vocabulary 「个人词库」是一款浏览器插件。用于英文阅读时，不断记住生词，构建个人词库。	31	Emerging	ai-powered-ereaders	20	JavaScript
3163	in03/squawk Automatic subtitles for DaVinci Resolve with OpenAI Whisper	31	Emerging	whisper-transcription-apps	38	Python
3164	indri-voice/audiotoken Audio tokenization, in the fastest way possible!	31	Emerging	tokenization-libraries	53	Python
3165	charlescao460/SpeechRecognitionByGoogleCloud A .NET program that captures local audio and recognizes speech	31	Emerging	dotnet-tts-libraries	4	C#
3166	milosgajdos/go-playht PlayHT API client Go module	31	Emerging	go-tts-libraries	7	Go
3167	binglel/asr_baidu_web_server asr web server based on flask	31	Emerging	funasr-speech-recognition	4	Python
3168	aks-devs/mod_whisper_asr Freeswitch ASR module	31	Emerging	vosk-asr-implementations	22	C
3169	theawless/sr-lib Automatic Speech Recognition library for my BTech Project.	31	Emerging	keyword-speech-recognition	4	C++
3170	kouyt5/lightning-asr 基于pytorch-lighting框架搭建的端到端语音识别模型，目前还在实验中，性能在不断优化	31	Emerging	end-to-end-asr-frameworks	4	Python
3171	AppleHolic/FastSpeech2 Refactored version of https://github.com/ming024/FastSpeech2	31	Emerging	fastspeech-tts-models	14	Python
3172	denizariyan/Real-Time-Auto-Transcriber Automatic transcriber made with the Nvidia NeMo AI toolkit. Used to...	31	Emerging	video-transcription-extraction	4	Python
3173	naturalDesign/fusion-remote Chatbot for Autodesk Fusion 360 with speech recognition	31	Emerging	voice-command-assistants	4	JavaScript
3174	cjh0613/vosk-android-demo-chinese 中文 vosk-android-demo	31	Emerging	java-tts-libraries	4	Java
3175	MatteoM95/Smart-Home-Vigilance-System An indoor video surveillance system capable of recognizing the presence of a...	31	Emerging	face-recognition-systems	4	Python
3176	kehlawicode/audiblez 🎧 Create high-quality audiobooks from e-books with ease using Audiblez,...	31	Emerging	ebook-to-audiobook-conversion	4	Python
3177	guibranco/talabat-hackathon-2022 🏃 💡 Talabat Hackathon 2022 API project	31	Emerging	dotnet-tts-libraries	2	C#
3178	egorsmkv/radtts-uk 🇺🇦 Ukrainian RAD-TTS++ models (decoder + models with 3 voices) and HiFiGAN model	31	Emerging	ukrainian-voice-ai	4	—
3179	zhurlik/smart-home A multi-project that contains UDP server, MQTT broker and a few sub-projects...	31	Emerging	voice-controlled-robotics	4	HTML
3180	1epalpyrgou/smartbell-server Ένα έξυπνο κουδούνι για το σχολείο μας - 1ο Επαγγελματικό Λύκειο Πύργου	31	Emerging	voice-controlled-robotics	4	Python
3181	nisiddharth/TextToSpeech A Simple Java based Text to Speech converter made using NetBeans 8.2	31	Emerging	java-tts-libraries	2	Java
3182	burrmill/sph2pipe sph2pipe v2.5. We do not maintain this, and/or accept pull requests; just...	31	Emerging	automatic-speech-recognition	4	C
3183	MaikeMota/comando-voz Utilizando HTML5 SpeechRecognizer para Reconhecimento de Comandos.	31	Emerging	voice-interactive-games	4	HTML
3184	Zuhef/Text-to-Speech USING HTML , CSS AND JAVASCRIPT I HAVE BUILD A SIMPLE TEXT TO SPEECH CONVERTER.	31	Emerging	web-speech-api-tts	4	CSS
3185	pkprajapati7402/Darvin-Chatbot Darvin is a Python-based voice-activated chatbot that interacts with users...	31	Emerging	general-purpose-voice-assistants	14	Python
3186	GitPolyakoff/voice-assistant Voice Assistant — приложение на C# для управления компьютером голосом....	31	Emerging	voice-command-assistants	4	C#
3187	wukan1986/KWebSpeaker 保持原排版可选段的网页朗读神器	31	Emerging	ai-powered-ereaders	4	Java
3188	Flux9665/ArticulatoryTextFrontend This is a text-processing frontend that converts graphemes to phonemes and...	31	Emerging	lightweight-tts-libraries	14	Python
3189	Ex094/VoiceCom A Simple Voice Command Application powered by Java and Sphinx4 Speech...	31	Emerging	android-voice-assistants	18	Java
3190	ognistik/alfred-superwhisper Use Alfred to Control Superwhisper - AI Powered Voice to Text	31	Emerging	audio-transcription-tools	122	JavaScript
3191	speechnotes/speechnotes-speech-recognizer The speech recognition engine behind Speechnotes, based on the Webspeech-API	31	Emerging	web-speech-api-libraries	4	—
3192	backpropper/DNN-Activation-Brain Code repository for Dissecting the DNN Brain for a Better Insight (ICASSP 2016)	31	Emerging	keyword-speech-recognition	4	Python
3193	Alan-6666/chinese_asr a demo of chinese asr	31	Emerging	ctc-asr-implementations	4	Python
3194	mayank-kumar-giri/Speech-Recognizer-cum-Voice-Typing-Editor Speech Recognizer cum text editor that facilitates voice typing using Google...	31	Emerging	speech-recognition-apis	4	Python
3195	CodingWithEnjoy/Speech-To-Text-Python متن به صدا \| Text To Speech 😊🤩	31	Emerging	lightweight-tts-libraries	4	Python
3196	HawksLab/narratify e-book to audiobook convertor	31	Emerging	ebook-to-audiobook-conversion	4	Python
3197	PalabraAI/palabra-ai-java Java SDK for Palabra AI's real-time speech-to-speech translation API. Break...	30	Emerging	java-tts-libraries	1	Java
3198	grayhatdevelopers/deepdub 🗣️ Videos for everyone. Implementation of "Automated Dubbing and Facial...	30	Emerging	voice-cloning-synthesis	5	Shell
3199	mallorbc/brillibot-client Easy to use voice commands API python client. Create your own commands in...	30	Emerging	voice-chatbot-applications	2	Python
3200	VisionBrain/Neural_Voice_Cloning Open Source Implementation of Neural Voice Cloning with Few Audio Samples...	30	Emerging	voice-cloning-synthesis	17	Python

« Prev 1 2 3 … 30 31 32 33 34 … 80 81 82 Next »