Pikurrot/whisper-gui
A simple GUI to use Whisper.
This tool helps you quickly convert audio and video files into text. You input your media, and it outputs a written transcript in formats like SRT, JSON, or plain text, with optional timestamps and language detection. It's ideal for anyone who needs to transcribe interviews, meetings, lectures, or other spoken content without manual typing.
414 stars. No commits in the last 6 months.
Use this if you need an easy way to get accurate text transcripts from audio or video recordings, especially if you want to save them in different formats.
Not ideal if you need features like speaker identification or mass transcription of many files, as those are not yet fully supported.
Stars
414
Forks
39
Language
Python
License
MIT
Category
Last pushed
Jul 18, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Pikurrot/whisper-gui"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
argmaxinc/WhisperKit
On-device Speech Recognition for Apple Silicon
yeyupiaoling/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data,...
vasistalodagala/whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets...
xenova/whisper-web
ML-powered speech recognition directly in your browser