mrhallonline/WhisperXTranscription4Researchers
This repository contains a Jupyter notebook for qualitative researchers to transcribe, diarize speakers, and convert audio or video files into various text formats (csv, txt, json, & vtt).
This project helps qualitative researchers efficiently transcribe and organize audio and video files. It takes your research recordings (like interviews or focus groups) and generates text transcripts, identifying individual speakers. The output is easily usable text files in formats like CSV, TXT, JSON, and VTT, helping researchers analyze their data effectively.
Use this if you are a qualitative researcher needing to accurately transcribe and identify speakers in a batch of audio or video recordings, with options for anonymization.
Not ideal if you need real-time transcription or only have a single, short audio file to transcribe, as the setup is geared towards processing multiple files.
Stars
9
Forks
—
Language
Jupyter Notebook
License
GPL-3.0
Category
Last pushed
Oct 15, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/mrhallonline/WhisperXTranscription4Researchers"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
machinelearningZH/audio-transcription
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
saharmor/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
shhossain/BanglaSpeech2Text
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned...
oseiskar/autosubsync
Automatically synchronize subtitles with audio using machine learning