All Voice AI Tools

8,165 tools ranked by quality score · Page 26 of 82

Showing 2501–2600 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
2501	sandy1990418/ChineseTaiwaneseWhisper This repository focuses on leveraging OpenAI's Whisper model for speech...	34	Emerging	whisper-fine-tuning	70	Python
2502	nhaouari/local11labs Local11Labs allows generating high-quality text-to-speech and podcast...	34	Emerging	kokoro-tts-ecosystem	52	Python
2503	LucaDe/text_to_speech_api A simple wrapper for Google's Text-To-Spech API for Dart and Flutter projects.	34	Emerging	educational-voice-apps	2	Dart
2504	Gaurav890/vocal-stack vocal-stack is a high-performance utility library for developers building...	34	Emerging	ai-tutoring-platforms	2	TypeScript
2505	alkhimey/esp32-flite Speech synthesis running on ESP32 based on Flite engine.	34	Emerging	embedded-tts-systems	75	C
2506	jiwidi/DeepSpeech-pytorch Pytorch implementation for DeepSpeech 2.0	34	Emerging	end-to-end-asr-frameworks	31	Python
2507	jianchang512/gemini-speech2srt 使用 Gemini AI 转写音视频为 SRT 字幕	34	Emerging	content-to-podcast-converters	54	Python
2508	ale-grassi/discord-elevenlabs-tts-bot A simple Discord TTS bot that uses the Eleven Labs API	34	Emerging	discord-tts-bots	5	Python
2509	medokin/soundpad-text-to-speech Text-To-Speech for Soundpad	34	Emerging	dotnet-tts-libraries	47	C#
2510	hwk06023/SONATA SONATA (SOund and Narrative Advanced Transcription Assistant): An advanced...	34	Emerging	rust-tts-libraries	5	Python
2511	simalexan/speechy Voice command tool for an easy web speech recognition for your web...	34	Emerging	web-speech-api-libraries	5	JavaScript
2512	EuleMitKeule/speaker-recognition Speaker recognition service for Home Assistant using voice embeddings. Train...	34	Emerging	speaker-diarization-embedding	17	Python
2513	sskorol/respeaker-websockets This project reveals full Respeaker Core V2 potential by using bundled...	34	Emerging	vosk-asr-implementations	7	C++
2514	JensBorrisholt/GoogleSpeak This repository demonstrates how to Use Google for implementing Text to...	34	Emerging	dotnet-tts-libraries	28	Pascal
2515	r1di/neutts-fastapi OpenAI-compatible Text-to-Speech API server powered by NeuTTS. Drop-in...	34	Emerging	self-hosted-tts-servers	1	Python
2516	makeabilitylab/ProtoSound ProtoSound is a deployable interactive system for personalizing a sound...	34	Emerging	audio-event-classification	6	Java
2517	ZhuoZhuoCrayon/AcousticKeyBoard-Web ❓声学键盘｜脑洞大开：做一个能听懂键盘敲击键位的「玩具」，学习信号处理 / 深度学习 / 安卓 / Django。	34	Emerging	audio-music-learning	88	Python
2518	Pallas1303/FestPB FestPB é um projeto com objetivo de oferecer suporte ao Português Brasileiro...	34	Emerging	cross-platform-tts-frameworks	10	Shell
2519	Speech-to-text-Kafka-Airflow-Spark/StoTkas Data engineering pipeline that allows recording millions of Amharic and...	34	Emerging	voice-ai-learning-collections	2	Jupyter Notebook
2520	Supremolink81/TTSCeleb A TTS app where you can clone the voices of any person you wish.	34	Emerging	voice-cloning-tools	9	Python
2521	felipefacundes/guglinatts Guglina TTS é um sintetizador de voz, em português do Brasil, que lê telas...	34	Emerging	php-tts-libraries	7	Perl
2522	teyang-lau/YOListenO Building an AI-powered tool for auto converting audio from lectures/meetings...	34	Emerging	audio-transcription-apps	6	Python
2523	laszukdawid/cracker Usable GUI for text-to-speech services	34	Emerging	lightweight-tts-libraries	5	Python
2524	freakingrocky/EmoCh Emotion Analysis from Speech AI in Python using mfcc, mel, chroma	34	Emerging	speech-emotion-recognition	9	Python
2525	Jen-Hung-Ho/ros2_jetbot_voice Jetbot Voice to Action Tools is a set of ROS2 nodes that utilize the Jetson...	34	Emerging	text-to-speech-conversion	13	Python
2526	ThisModernDay/f5-tts F5-TTS is a web application that allows users to clone voices and generate...	34	Emerging	voice-cloning-tools	8	Python
2527	nay-cat/LiveKit-PiperTTS-Plugin Quick integration of Piper TTS (super lightweight, high-quality model) with LiveKit	34	Emerging	gradio-tts-webuis	5	Python
2528	shaheennabi/Multi-lingual-AI-Assistant-with-gTTS-and-Gemini-Pro An end-to-end AI assistant using gTTS for multi-lingual text-to-speech and...	34	Emerging	multimodal-medical-assistants	5	Python
2529	adrxLV/J.A.R.V.I.S.AI A AI-powered voice assistant based on JARVIS using ollama.	34	Emerging	python-voice-assistants	3	Python
2530	sudonitin/Audio-book-generator Convert your ebooks to audiobooks. 📖->🎧	34	Emerging	ebook-to-audiobook-conversion	74	Python
2531	TharanaBope/whisper-v3-diarization Production-ready audio transcription & speaker diarization CLI & GUI using...	34	Emerging	whisper-diarization	1	Python
2532	ctkqiang/ZhuYing 竹影是一款创新的视频语音转录与翻译工具，专注于提供高质量的视频音频转文字服务和多语言翻译功能。本项目采用先进的人工智能技术，为用户提供便捷的视频内容处理解决方案。	34	Emerging	viral-clip-generation	13	Python
2533	Dark2C/Viral-Faceless-Shorts-Generator Automatically generate faceless YouTube Shorts from trending topics using AI...	34	Emerging	ai-video-generation	41	HTML
2534	ARAI-Telegram/teledash-backend-processing Optional AI-powered features of Teledash, an open-source software for...	34	Emerging	telegram-voice-transcription	4	Python
2535	boochow/TFLite_Micro_MicroSpeech_M5Stack M5Stack (ESP32) port of TensorFlow Lite for Microcontrollers demo "Micro Speech"	34	Emerging	wake-word-detection	31	C++
2536	kaituoxu/Tacotron2 A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS)...	34	Emerging	tacotron-tts-models	52	Python
2537	dfop02/auto-sub Automatically subtitle a video from almost any language to your native...	34	Emerging	whisper-subtitle-generation	18	Python
2538	rezkyatinnov/capetangjs A JavaScript library for text to speech vice versa using Web Speech API	34	Emerging	web-speech-api-tts	6	JavaScript
2539	DePasqualeOrg/swift-tiktoken A pure Swift implementation of OpenAI's tiktoken tokenizer	34	Emerging	text-tokenization-libraries	3	Swift
2540	twangodev/speak-mintlify Automatically generate voice narration for your Mintlify documentation.	34	Emerging	web-speech-api-tts	2	TypeScript
2541	upskyy/Paper-Review Paper Review about Speech Recognition · NLP	34	Emerging	speech-ai-coursework	10	—
2542	vibhasdutta/PC-ASSISTANT A voice-operated PC assistant for Windows , enabling hands-free control for...	34	Emerging	voice-controlled-desktop-automation	15	Python
2543	tuanio/nextformer PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For...	34	Emerging	conformer-asr-implementations	10	Python
2544	GENIVI/VCIVING-SpeechRecognition GENIVI GSoC 2018 and 2019	34	Emerging	voice-controlled-robotics	6	Python
2545	GeoHaberC/Story-to-Video Create a Movie animation plus Audio plus Subtitle from a text file	34	Emerging	ai-video-generation	44	Python
2546	spandan114/AI-realtime-voice-agent A Python-based real-time voice-to-voice conversation system that lets you...	34	Emerging	voice-agent-applications	6	Python
2547	Llamacha/asr-htk-quechua ASR for quechua language is an open source which can run in real time using...	33	Emerging	automatic-speech-recognition	3	—
2548	anooptoffy/DLJeju2018CodeRepoASR Details on my work on using GANs for speech synthesis for improving Speech...	33	Emerging	neural-vocoder-implementations	8	—
2549	eazhary/dctts2 Deep Convolution Text to Speech	33	Emerging	fastspeech-tts-models	34	Python
2550	nowickam/facial-animation Audio-driven facial animation generator with BiLSTM used for transcribing...	33	Emerging	ai-avatar-platforms	36	Jupyter Notebook
2551	lucasnewman/e2-tts-mlx Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive...	33	Emerging	fastspeech-tts-models	21	Python
2552	Bassamejlaoui/Voice-Cloning-Translation-Transcription Voice cloning, a revolutionary technology, allows us to replicate and...	33	Emerging	voice-assistant-devices	8	—
2553	zoebchhatriwala/CamWord CamWord Is an android application that uses character recognition and voice...	33	Emerging	android-voice-assistants	8	Java
2554	victor369basu/End2EndAutomaticSpeechRecognition In this repository, I have developed an end to end Automatic speech...	33	Emerging	speaker-diarization-embedding	34	Python
2555	aishoot/Multi-Hotword_Spotting Won't it be cool to build a speech assistant like Alexa or Siri yourself...	33	Emerging	wake-word-detection	34	Jupyter Notebook
2556	pnkvalavala/digitaltwin Using a single image and just 10 seconds of sample audio, our project...	33	Emerging	voice-cloning-tools	40	Jupyter Notebook
2557	prathamsolanki/gender-recognition-by-voice Identify a voice as male or female.	33	Emerging	speech-ai-coursework	33	Jupyter Notebook
2558	tabahi/WebSpeechAnalyzer JS speech analyzer for fast speech analysis and labeling	33	Emerging	web-speech-api-libraries	39	JavaScript
2559	CypherousSkies/reading-for-listeners A deep-learning powered accessibility application which turns pdfs into...	33	Emerging	ocr-document-extraction	25	Python
2560	AASHISHAG/DeepSpeech-API The code enables users to use Mozilla's Deep Speech model over the Web Browser.	33	Emerging	web-speech-api-libraries	32	TypeScript
2561	bhattbhavesh91/speech-python-demos pyttsx3 is a text-to-speech conversion library in Python. Its a Python-based...	33	Emerging	lightweight-tts-libraries	3	Jupyter Notebook
2562	Issac-Moses/Beacon Beacon – A lightweight voice-controlled AI assistant using Whisper.cpp. ...	33	Emerging	local-voice-assistants	8	C++
2563	Enforcer03/voice-cloning Voice cloning with tortoise-tts	33	Emerging	voice-cloning-tools	30	Jupyter Notebook
2564	HerambVD/spoken2written A source of python package which converts language styles in speech to its...	33	Emerging	speech-recognition-apis	2	Python
2565	MrAliHasan/Sophia-AI-Assistant Sophia AI Assistant is a Python-based desktop AI that performs a variety of...	33	Emerging	voice-controlled-desktop-automation	30	CSS
2566	Ishan7390/Jarvis_AI This is my attempt at building a not so much of an AI, Jarvis	33	Emerging	python-voice-assistants	30	Python
2567	Zuellni/Orpheus-GGUF Orpheus-TTS inference.	33	Emerging	gradio-tts-webuis	3	Python
2568	thewh1teagle/vad-rs Speech detection using silero vad in Rust	33	Emerging	rust-speech-recognition	30	Rust
2569	The-Data-Dilemma/MediBeng-Whisper-Tiny MediBeng Whisper Tiny improves doctor-patient transcription by training the...	33	Emerging	whisper-speech-transcription	29	Python
2570	RF5/transfusion-asr Transcribing Speech with Multinomial Diffusion, training code and models.	33	Emerging	end-to-end-asr-frameworks	80	Python
2571	stellarloop/bitbat.ai My father, a journalist, used to painstakingly transcribe interviews from a...	33	Emerging	audio-transcription-tools	81	Svelte
2572	yakhyo/kokoro-onnx Kokoro-82m TTS ONNX Runtime inference \| Gradio Demo \| HuggingFace Demo \| Docker	33	Emerging	kokoro-tts-ecosystem	3	Jupyter Notebook
2573	rhulha/Speech2Speech A web application that converts speech to speech 100% private	33	Emerging	voice-ai-assistants	84	JavaScript
2574	mravanelli/pytorch_MLP_for_ASR This code implements a basic MLP for speech recognition. The MLP is trained...	33	Emerging	end-to-end-asr-frameworks	40	Perl
2575	orhun/dialogflowbot Google's Dialogflow implementation on Android with additional features.	33	Emerging	voice-command-assistants	11	Java
2576	gogyzzz/beamformit_matlab A MATLAB implementation of CHiME4 baseline Beamformit	33	Emerging	keyword-speech-recognition	27	HTML
2577	neosapience/n8n-nodes-typecast Integrate Typecast AI TTS into your n8n workflows with this community node.	33	Emerging	google-tts-libraries	1	TypeScript
2578	agentvoiceresponse/avr-tts-deepgram This project demonstrates the integration of Agent Voice Response with...	33	Emerging	deepgram-starter-projects	1	JavaScript
2579	aydinnyunus/LinuxVoiceAssistant Linux Voice Assistant for to Make Your Work Easier	33	Emerging	general-purpose-voice-assistants	38	Python
2580	Serkali-sudo/auto-subtitle-generator An Android app that automatically generates subtitles for videos locally,...	33	Emerging	whisper-subtitle-generation	27	Java
2581	KathyReid/opensource-voice-tools A repo listing known open source voice tools, ordered by where they sit in...	33	Emerging	voice-ai-learning-collections	27	TeX
2582	pschatzmann/arduino-simple-tts A simple TTS solution based on pre-recorded audio	33	Emerging	embedded-tts-systems	21	C
2583	Madhur215/Chatbot-cum-voice-Assistant An AI chatbot with features like conversation through voice, fetching events...	33	Emerging	general-purpose-voice-assistants	37	Python
2584	va-kiet/Voice-Assistant-wake-word-detection-model Build a Wake Word Detection model for Voice Assistant using PyTorch	33	Emerging	virtual-assistants-nlp	26	Python
2585	daanzu/wav2vec2_stt_python Simple Python library, distributed via binary wheels with few direct...	33	Emerging	wav2vec2-asr-models	23	Python
2586	codename0og/codename-rvc-fork-3 Codename's rvc fork version 3, based on Applio.	33	Emerging	voice-cloning-tools	37	Python
2587	theoomoregbee/paysense-backend This is our paysense backend , a sails app	33	Emerging	audio-transcription-apps	5	JavaScript
2588	lucadellalib/audiocodecs A collections of audio codecs with a standardized API	33	Emerging	neural-vocoder-implementations	36	Python
2589	mtokar3v/ReversoAPI-NET 🌐 An API Client for the reverso.net, written in C#/.NET (Based on Site API...	33	Emerging	dotnet-tts-libraries	4	HTML
2590	ignabelitzky/easy-subber A Python-based tool that that takes video files and generates .srt subtitle...	33	Emerging	whisper-subtitle-generation	8	Python
2591	hanxiao/mls MLX Local Serving (MLS) - Unified ASR, TTS, and Translation on Apple Silicon	33	Emerging	ios-speech-frameworks	10	HTML
2592	gunarakulangunaretnam/voice-typer A voice recognition based typing tool for English, Tamil, Sinhala languages.	33	Emerging	speech-recognition-apis	3	C#
2593	shawnrushefsky/talky-talky MCP server for Audio Generation and Analysis with a Variety of Open Models.	33	Emerging	voice-enabled-coding-assistants	2	Python
2594	revsic/tf-glow-tts Tensorflow implementation of Glow-TTS	33	Emerging	fastspeech-tts-models	7	Python
2595	echo8795/react-native-android-text-to-speech React Native Text-To-Speech wrapper module for android	33	Emerging	react-native-voice-libraries	7	Java
2596	Animator617/jasper Jasper is a AI asistence programm based on deeplearning	33	Emerging	automatic-speech-recognition	2	C++
2597	m0wer/aibot Telegram bot powered by Ollama, capable of handling text and voice messages,...	33	Emerging	local-voice-assistants	7	Python
2598	fquirin/speech-recognition-experiments Experiments to test different speech recognition systems for SEPIA Framework	33	Emerging	automatic-speech-recognition	63	Python
2599	Ahmed5attab/Qaf-QuranSearchAndMemorization iOS Islamic application for the holy Quran, helps the Muslims to have the...	33	Emerging	ios-speech-frameworks	2	Swift
2600	rt400/ReversoTTS-HA ReversoTTS component for HomeAssistant	33	Emerging	home-assistant-tts	41	Python

« Prev 1 2 3 … 24 25 26 27 28 … 80 81 82 Next »