ADT109119/WhisperX-GUI

一個使用者友善的圖形介面，用於輕鬆調用 WhisperX，這是一個提供精確轉錄、強大語者分離和詞級時間戳對齊的自動語音辨識 (ASR) 工具。此 GUI 簡化了轉錄音訊和影片檔案的過程，無需使用指令，且打開後幾乎可以一鍵安裝。

/ 100

Experimental

This tool helps you quickly and accurately convert audio and video files into text. You provide your audio or video, and it outputs a written transcript, optionally with speaker identification and precise word-level timestamps. It's designed for anyone who needs to transcribe interviews, lectures, meetings, or multimedia content without using complex command-line tools.

No commits in the last 6 months.

Use this if you need to transcribe multiple audio or video files into text, identify different speakers, and get exact timestamps for each word, all through an easy-to-use visual interface.

Not ideal if you prefer to type out transcripts manually or require a human-level accuracy for highly sensitive or nuanced legal/medical transcription.

transcription media-analysis content-creation interview-documentation lecture-notes

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 4 / 25

Maturity 15 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

Apache-2.0

Higher-rated alternatives

collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Kieirra/murmure

Fully local, private and cross platform Speech-to-Text with LLM Post-processing

Softcatala/whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

pavelzbornik/whisperX-FastAPI

FastAPI service on top of WhisperX

royshil/obs-localvocal

OBS plugin for local speech recognition and captioning using AI

Explore Voice AI Tools

All categories Trending Voice AI directory Insights