All Voice AI Tools

8,165 tools ranked by quality score · Page 23 of 82

Showing 2201–2300 of 8,165

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
2201	deepgram-devs/flask-live-chatgpt-text-to-speech Get started using Deepgram's Live ChatGPT Text-to-Speech with this Flask demo app	36	Emerging	deepgram-starter-projects	6	Python
2202	silenterus/deepspeech-cleaner Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework	36	Emerging	speech-corpora-datasets	48	Python
2203	parzibyte/tts-js Demostración de speechSynthesis con JavaScript: TTS o Síntesis de habla	36	Emerging	web-speech-api-tts	6	JavaScript
2204	Hamahmi/kaldi-tut This is a Kaldi tutorial for beginners	36	Emerging	kaldi-asr-ecosystem	6	Shell
2205	OssiaAI/OssiaVoice Ossia is an accessibility tool for those unable to speak or type; Ossia...	36	Emerging	audio-transcription-tools	5	Vue
2206	nico-byte/whisper-web The Whisper Web Transcription Server is a Python-based real-time...	36	Emerging	speech-to-text-converters	3	Python
2207	BayramAnnakov/gmail-to-podcast Transform Gmail newsletters into AI-generated podcast conversations using...	36	Emerging	content-to-podcast-converters	5	Python
2208	LonePheasantWarrior/TalkifyTTS 云端大模型驱动的 Android 语音合成应用（TTS引擎）。支持豆包、腾讯、微软、千问等模型。An Android text-to-speech...	36	Emerging	java-tts-libraries	60	Kotlin
2209	LonePheasantWarrior/VolcengineTTS 基于火山引擎豆包语音服务的在线TTS安卓应用 (An online TTS Android application based on the...	36	Emerging	java-tts-libraries	18	Kotlin
2210	MiguelsPizza/local-transcription-mcp--parakeet-tdt-0.6b-v2-- Local MCP server that converts and transcribes video and audio files 100% on device	36	Emerging	voice-enabled-coding-assistants	10	Python
2211	rishikksh20/LightSpeech LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search	36	Emerging	fastspeech-tts-models	94	Python
2212	prohetamine/tor-speech 🔉 Yandex & Google + Tor	36	Emerging	google-tts-libraries	6	JavaScript
2213	ankushbhatia2/django-speech-to-text A small API for speech to text made in Django.	36	Emerging	web-based-tts-apps	5	Python
2214	6Morpheus6/Chattered All in one Gradio interface for chatterbox. Voice cloning from uploaded...	35	Emerging	self-hosted-tts-servers	8	JavaScript
2215	ikfly/java-tts java-tts 文本转语音	35	Emerging	java-tts-libraries	59	Java
2216	golemfactory/g-flite g-flite: flite app distributed over Golem Network	35	Emerging	rust-tts-libraries	8	Rust
2217	purvanshjoshi/IndiVoice-DeepASR Deep Learning framework for Indian-accented Speech-to-Text using Whisper and...	35	Emerging	whisper-speech-transcription	2	Python
2218	Lightning-Universe/Echo Production-ready audio and video transcription app that can run on your...	35	Emerging	speech-to-text-converters	71	TypeScript
2219	adhadse/Deepdubpy A complete end-to-end Deep Learning system to generate high quality human...	35	Emerging	lip-reading-synthesis	13	Jupyter Notebook
2220	innovatorved/whisper-openai-gradio-implementation Whisper is an automatic speech recognition (ASR) system Gradio Web UI Implementation	35	Emerging	speech-to-text-converters	75	Python
2221	jaoafa/ChatWatcher 🗣 Discord voice-chat speech recognition	35	Emerging	discord-tts-bots	1	Java
2222	timoil/whisper-subtitles 🎬 AI-powered localhost subtitle generator for hearing-impaired users....	35	Emerging	whisper-subtitle-generation	38	Python
2223	M86xKC/edge-tts Simple TTS using MS Edge built-in voices	35	Emerging	edge-tts-implementations	28	JavaScript
2224	PareekshithPalat/Transcriptor The Transcriptor is a subtitle extractor, lightweight web application built...	35	Emerging	youtube-transcript-summarization	1	Python
2225	jim11662418/General_Instrument_CTS256_SP0256_Speech_Synthesizer Vintage General Instrument Speech Synthesizer CTS256 with SP0256	35	Emerging	embedded-tts-systems	11	Assembly
2226	samsad35/source-filter-vae [SpeechCom Journal] Learning and controlling the source-filter...	35	Emerging	audio-noise-reduction	45	Python
2227	BenLubar/espeak Package espeak is a wrapper around espeak-ng that works both natively and in...	35	Emerging	espeak-ng-ecosystem	10	Go
2228	Kaljurand/Diktofon An Android app, a dictaphone with Estonian speech-to-text	35	Emerging	android-speech-apps	14	Java
2229	nexxeln/spotify-voice-control Voice control for Spotify through the terminal	35	Emerging	voice-controlled-robotics	79	Python
2230	junjie-xyz/whisper-video Generate subtitles for all the videos in a folder with OpenAI's Whisper...	35	Emerging	audio-transcription-tools	35	Python
2231	heartsuit/BaiduASRAndTTS Using Baidu API. ASR: Automatic Speech Recognition;TTS: Text To Speech;...	35	Emerging	dotnet-tts-libraries	47	C#
2232	jx1100370217/DFCNN-master 这是一个基于全卷积神经网络的语音识别系统	35	Emerging	ctc-asr-implementations	79	Python
2233	Yukaii/gakuon Review Anki cards using Generative AI voice	35	Emerging	anki-tts-integration	5	TypeScript
2234	JustinGOSSES/spoken-floodplain Website that verbally tells users when they enter or leave a floodplain in...	35	Emerging	web-speech-api-libraries	6	Jupyter Notebook
2235	Babakinha/Dectalk A Simple package for using Dectalk	35	Emerging	dotnet-tts-libraries	5	TypeScript
2236	zerospeech/benchmarks A command line tool that helps use the "Zero Ressource Challenge" benchmarks	35	Emerging	ml-benchmarking-frameworks	12	Python
2237	MelvilQ/stacksrs A simple Spaced Repetition app for Android.	35	Emerging	android-speech-apps	9	Java
2238	vectominist/spin Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for...	35	Emerging	end-to-end-asr-frameworks	64	Python
2239	Leonard2310/LibrAI iOS app with AI for an immersive audiobook experience, text-to-speech and...	35	Emerging	ai-image-generation-platforms	17	Swift
2240	ikarago/Talkinator Talkinator is an easy to use text-to-speech-app for Windows 10-devices	35	Emerging	dotnet-tts-libraries	10	C#
2241	lelosaiyan/J.A.R.V.I.S. A voice virtual desktop assistant for Windows 7/10	35	Emerging	python-voice-assistants	8	Python
2242	matusstas/openai-whisper-microservice This is an OpenAI Whisper automatic speech recognition microservice	35	Emerging	speech-to-text-converters	24	Python
2243	noir-neo/UniSpeech iOS speech framework native plugin for Unity	35	Emerging	dotnet-tts-libraries	14	C#
2244	qkl9527/voice-assistant 基于Funasr的[实时]AI语音助手	35	Emerging	funasr-speech-recognition	24	Python
2245	orianemartin/WhispGrid A Whisper to TextGrid script that I use to automatize Corpus Annotation on...	35	Emerging	whisper-diarization	13	Python
2246	charstorm/vilberta Voice chatbot with voice+screen output to show that "not everything needs to...	35	Emerging	voice-chatbot-applications	6	Python
2247	dcavar/ELAN2split Split ELAN Annotation Files and corresponding speech files into a corpus...	35	Emerging	automatic-speech-recognition	11	C++
2248	systoolz/dosbtalk unofficial API implementation for Text-to-Speech Engine by First Byte	35	Emerging	dotnet-tts-libraries	5	C
2249	alisolphp/EchoTalk A browser-based language training app using Shadowing technique with...	35	Emerging	ai-tutoring-platforms	2	TypeScript
2250	tuhinpal/text-to-speech Text to Speech using Google's Library (Made for Fun)	35	Emerging	lightweight-tts-libraries	6	HTML
2251	SupernovifieD/FreeSpeechToText A python program that extracts text from audio files - .mp3 or .wav - for free!	35	Emerging	speech-recognition-apis	9	Python
2252	MazueraAlvaro/speech-recognition-asterisk A script for speech recognition in asterisk	35	Emerging	web-speech-api-libraries	6	PHP
2253	ORI-Muchim/One-Click-VITS-Training VITS(Data Preprocessing + Whisper ASR + Text Preprocessing + Modification...	35	Emerging	vits-tts-implementations	37	Python
2254	chienhsiang-hung/voice-and-wav-cloning 通過少量語音與影片樣本生成高質量的語音與影片克隆 ( AI 人像口白生成 )，並提供多種音頻處理技術來提升音質和真實感。	35	Emerging	voice-cloning-synthesis	9	Jupyter Notebook
2255	codekraft-studio/vue-speech Vue integration and components for the Web Speech API	35	Emerging	vue-speech-recognition	8	Vue
2256	yc9701/pansori-tedxkr-corpus Korean ASR Corpus generated from TEDx talks	35	Emerging	speech-corpora-datasets	27	—
2257	dialpad/mucs_2021_dialpad Dialpad team's submission to the MUCS 2021 workshop	35	Emerging	automatic-speech-recognition	5	Python
2258	huckiyang/QuantumSpeech-QCNN IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing...	35	Emerging	ctc-asr-implementations	107	Jupyter Notebook
2259	hebbihebb/MBook EPUB to M4B using Maya1	35	Emerging	ebook-to-audiobook-conversion	5	Python
2260	nhut-ngnn/Voice-Based-Age-and-Gender-Recogniton [ICTC'24] - "Voice-Based Age and Gender Recognition: A Comparative Study of...	35	Emerging	facial-attribute-classification	10	Python
2261	HarunoriKawano/BEST-RQ Implementation of the paper "Self-supervised Learning with Random-projection...	35	Emerging	neural-vocoder-implementations	91	Python
2262	placebokkk/e6870 assignments for e6870 ASR class	35	Emerging	keyword-speech-recognition	42	C
2263	maetshju/flux-blstm-implementation An implementation of the Graves & Schmidhuber (2005) bidirectional LSTM in Flux.	35	Emerging	neural-vocoder-implementations	11	Julia
2264	mattzzz/rick-voice Give any bot the voice of Rick Sanchez	35	Emerging	discord-tts-bots	14	Python
2265	indonesian-nlp/multilingual-asr Multilingual Speech Recognition for Indonesian Languages	35	Emerging	voice-cloning-synthesis	70	Python
2266	HuuHuy227/XphoneBert_Vits2 VITS2 extended with XPhoneBERT encoder	35	Emerging	text-to-speech-frameworks	10	Python
2267	markhliu/mpt Code repository for the book Make Python Talk	35	Emerging	speech-recognition-apis	46	Python
2268	darsh-1010/Jarvis-A-Voice-Based-Assistant-Powered-by-LLaMA Jarvis is a voice-based assistant built in Python that simplifies daily...	35	Emerging	python-voice-assistants	6	Python
2269	kostas2370/Video-Creator This project is to automate the video creation.	35	Emerging	ai-video-generation	25	Python
2270	thevickypedia/Jarvis_UI Light weight UI to interact with Jarvis via API calls	35	Emerging	python-voice-assistants	6	Python
2271	yanorei32/winrt-tts-server A simple Web Based Windows Runtime (WinRT) Speech Synthesis API	35	Emerging	rust-tts-libraries	1	Rust
2272	mo7amedaliEbaid/run-tracker A flutter run tracker app - clean architecture	35	Emerging	educational-voice-apps	24	Dart
2273	go-restream/supertts 🎧 Supertonic TTS ONNX Inference Openai Speech REST API	35	Emerging	lightweight-tts-runtimes	5	Rust
2274	opensource-spraakherkenning-nl/asr_nl Dutch Speech Recognition webservice	35	Emerging	automatic-speech-recognition	8	Python
2275	Vaibhavs10/ml-with-audio HF's ML for Audio study group	35	Emerging	speech-ai-coursework	202	Jupyter Notebook
2276	botbahlul/Live-Subtitle ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free...	35	Emerging	live-caption-generation	17	Java
2277	void-xtreme/audible-text-editor An automated Sinhala audio Text Editor for visually impaired and blind students	35	Emerging	web-speech-api-tts	2	TypeScript
2278	drivendataorg/childrens-speech-recognition-benchmark-pub Tutorial code for the On Top of Pasketti: Children’s Speech Recognition Challenge	35	Emerging	automatic-speech-recognition	2	Jupyter Notebook
2279	shreyasnisal/SpeechProgrammer The Speech Programmer writes code based on voice commands. Right now it only...	35	Emerging	speech-recognition-apis	5	JavaScript
2280	chimechallenge/chime-utils Scripts for data generation, scoring and data manifest preparation for...	35	Emerging	automatic-speech-recognition	24	Python
2281	Tristan296/Universal-MacAssistant Advanced Personal Assistant created for macOS that utilises AppleScripts,...	35	Emerging	voice-controlled-desktop-automation	12	Python
2282	saurabhchalke/whisper-meta-quest Running speech-to-text in a Meta Quest headset using OpenAI's Whisper tiny model	35	Emerging	whisper-transcription-apps	45	C#
2283	Hamtech-ai/wav2vec2-fa fine-tune Wav2vec2. an ASR model released by Facebook	35	Emerging	wav2vec2-asr-models	36	Jupyter Notebook
2284	HaoQChen/iflytek_awaken_asr use iflytek's technology to realize awaken and order recognition	35	Emerging	automatic-speech-recognition	71	C
2285	pncnmnp/phoenix10.1 Creates personalized radio stations with your own radio jockey!	35	Emerging	news-audio-bulletins	118	Python
2286	heyfoz/python-youtube-transcription This repository contains Python scripts and a local Flask web application...	35	Emerging	video-transcription-extraction	5	Python
2287	Ralireza/spoken-digit-recognition Classifying English spoken digit by Hidden Markov Model	35	Emerging	keyword-speech-recognition	13	Python
2288	syntithenai/opensnips Open source projects related to Snips https://snips.ai/.	35	Emerging	voice-ai-learning-collections	55	JavaScript
2289	yokawasa/vscode-translator-voice VS Code extension for multi-language text translation and TTS...	35	Emerging	ai-powered-ereaders	7	TypeScript
2290	AceCentre/pasco Phrase Auditory Scanning COmmunicator - AAC App for iOS and the Web	35	Emerging	react-native-voice-libraries	16	JavaScript
2291	theamazing0/global-subtitles-main Closed Captioning Everywhere, With Assembly AI	35	Emerging	whisper-subtitle-generation	6	Python
2292	candlewill/Ossian Ossian: A simple language-independent Text-to-speech frontend	35	Emerging	lightweight-tts-libraries	17	Python
2293	atomicoo/Tacotron2-PyTorch PyTorch implementation of Tacotron-2. Tacotron-2 的 PyTorch 实现。	35	Emerging	tacotron-tts-models	14	Python
2294	dokuniev/claude-voice Hear which Claude Code session needs you — speaks the repo and branch name out loud	35	Emerging	voice-enabled-coding-assistants	2	Shell
2295	Helther/voice-pick-tbot Text To Speech Synthesis Telegram Bot with voice customization	35	Emerging	telegram-voice-transcription	5	Python
2296	18F/tts-buy-challengegov-ideation Market research documents related to the Challenge.gov Ideation Platform.	35	Emerging	government-procurement-docs	4	—
2297	BullShark/JSpeak A Text to Speech Reader Front-end that Reads from the Clipboard and with...	35	Emerging	java-tts-libraries	16	Java
2298	GetProjectsIdea/Convert-Text-to-Speech-in-Python Text to speech is a process to convert any text into voice. Text to speech...	35	Emerging	lightweight-tts-libraries	5	Python
2299	HasnainDarkNet/DarKVoice DarKVoice is an open-source voice assistant and audio processing tool built...	35	Emerging	general-purpose-voice-assistants	5	Python
2300	AkojimaSLP/Frame-by-frame-closed-form-update-for-mask-based-adaptive-MVDR-beamforming speech-enhacement	35	Emerging	keyword-speech-recognition	60	Python

« Prev 1 2 3 … 21 22 23 24 25 … 80 81 82 Next »