ADT109119/WhisperX-GUI

一個使用者友善的圖形介面,用於輕鬆調用 WhisperX,這是一個提供精確轉錄、強大語者分離和詞級時間戳對齊的自動語音辨識 (ASR) 工具。此 GUI 簡化了轉錄音訊和影片檔案的過程,無需使用指令,且打開後幾乎可以一鍵安裝。

21
/ 100
Experimental

This tool helps you quickly and accurately convert audio and video files into text. You provide your audio or video, and it outputs a written transcript, optionally with speaker identification and precise word-level timestamps. It's designed for anyone who needs to transcribe interviews, lectures, meetings, or multimedia content without using complex command-line tools.

No commits in the last 6 months.

Use this if you need to transcribe multiple audio or video files into text, identify different speakers, and get exact timestamps for each word, all through an easy-to-use visual interface.

Not ideal if you prefer to type out transcripts manually or require a human-level accuracy for highly sensitive or nuanced legal/medical transcription.

transcription media-analysis content-creation interview-documentation lecture-notes
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 4 / 25
Maturity 15 / 25
Community 0 / 25

How are scores calculated?

Stars

7

Forks

Language

Python

License

Apache-2.0

Last pushed

May 27, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ADT109119/WhisperX-GUI"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.