Speech To Text Transcription Voice AI Tools

There are 31 speech to text transcription tools tracked. 2 score above 50 (established tier). The highest-rated is AbdullahHendy/live-translation at 56/100 with 13 stars.

Get all 31 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=speech-to-text-transcription&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Tool	Score	Tier	Stars	Language
1	AbdullahHendy/live-translation Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM...	56	Established	13	Python
2	i4Ds/whisper-finetune This repository contains code for fine-tuning the Whisper speech-to-text model.	51	Established	22	Jupyter Notebook
3	512z/podlens Free Podwise: AI Podcast & Youtube Transcription & Understanding Agent \|...	47	Emerging	5	Python
4	Gr122lyBr/voicetag Speaker identification powered by pyannote and resemblyzer	44	Emerging	32	Python
5	aws-solutions/content-localization-on-aws Automatically generate multi-language subtitles using AWS AI/ML services....	43	Emerging	43	Vue
6	fizamusthafa/whisper-app This repository contains a web application for multi-lingual transcription...	42	Emerging	32	Python
7	AEmotionStudio/ComfyUI-FFMPEGA Intelligent FFMPEG agent node for ComfyUI - transforms natural language...	40	Emerging	5	Python
8	gkrsv/split_audio A rough and ready Python utility which splits audio files based on silence...	38	Emerging	16	Python
9	i4Ds/whisper-prep Data preparation utility for the finetuning of OpenAI's Whisper model.	38	Emerging	11	Python
10	vdutts7/ai-rapper Talking Head of your favorite rapper using Transformers, PyTorch, Tortoise...	33	Emerging	48	Python
11	stevenlawton/GPT-Whisper-captions Automate subtitle generation for videos using OpenAI's Whisper API and...	29	Experimental	1	Go
12	nalbion/whisper-server streaming speech to text server using Whisper	28	Experimental	101	Python
13	Jayem-11/Swahili_speech_to_text Speech to Text for Swahili Language with Whisper-small.	27	Experimental	6	Jupyter Notebook
14	lukereichold/SpeechTimestamper Generate an accurate, timestamped transcript given an audio file and its...	26	Experimental	21	Swift
15	RizhongLin/PolyglotWhisperer Transcribe, translate, and learn — Whisper + LLM video pipeline with dual...	25	Experimental	7	Python
16	somosnlp/wav2vec2-spanish Pre-train a Spanish Wav2Vec2 model using the Spanish portion of the Common...	25	Experimental	5	Python
17	uqqu/sync_book audiobook generator with smart personalized translation	22	Experimental	1	Python
18	LexMainye/Kasuku-Transcriber A speech to text web app for people with speech impairments that has support...	22	Experimental	1	Python
19	stellarloop/video2text Python API & command-line tool to easily transcribe speech-based video files...	21	Experimental	5	Jupyter Notebook
20	eray-yuztyurk/python-ai-audio-transcriber-summarizer AI-powered tool for fast, accurate audio transcription and summarization....	20	Experimental	1	Python
21	Jyotibrat/Speech-To-Text Speech to Text model	19	Experimental	1	Jupyter Notebook
22	danielsobrado/audio-processor Audio processor, focused on english and arabic with diarization and summarization	19	Experimental	2	Python
23	udit-rawat/whisper-space An ASR Gradio GUI based project that transcript the audion and provides NLP...	17	Experimental	1	Python
24	antarades/emotion-aware-automatic-speech-recognition An intelligent speech recognition system that combines OpenAI's Whisper for...	13	Experimental	—	Python
25	MaharshPatelX/Speechitive A Video analytics tool converting videos to M3U8 playlists using HLS and...	12	Experimental	—	Python
26	singleshade8/japanese-subtitle-generator GPU-accelerated Japanese → English subtitle generator using faster-whisper...	11	Experimental	—	HTML
27	bivex/voice_to_text A Python application for real-time Russian voice-to-text transcription and...	11	Experimental	—	Python
28	Ashishkumar-hub/Text-to-Speech-using-gTTS Text to speech conversion NLP based end to end project	11	Experimental	4	HTML
29	ChaoticByte/audio-summarize An audio summarizer (faster-whisper and BART glued together)	11	Experimental	2	Python
30	zyovo-0829/LingxiCouplet 基于 FastAPI、Vue3、Whisper 和 DeepSeek API 的 AI 语音对联互动系统，支持语音输入、自动生成下联和智能评分。An...	10	Experimental	1	HTML
31	kalindasiaminwe/ChitongaASR A natural language processing and machine learning project for a low...	10	Experimental	2	Jupyter Notebook

Comparisons in this category

whisper-finetune and whisper-prep (51 vs 38)