pavelzbornik/whisperX-FastAPI

FastAPI service on top of WhisperX

/ 100

Established

This tool helps convert audio and video recordings into text transcripts, identifying different speakers and aligning the text with the audio. You provide an audio or video file (like an interview, meeting recording, or lecture) and receive a detailed text output, making it easier to analyze spoken content. Anyone who needs to extract written information from spoken content, such as journalists, researchers, or content creators, would find this useful.

174 stars.

Use this if you regularly need accurate, speaker-separated transcripts from audio or video files and want to automate this process.

Not ideal if you only need occasional, simple transcriptions without speaker identification or precise timing, as it requires a specific technical setup.

transcription audio-analysis video-processing speaker-diarization content-analysis

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

174

Forks

Language

Python

License

MIT

Compare

whisperX-FastAPI and whisper-asr-webservice whisperX-FastAPI and whisper.api

Related tools

collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Kieirra/murmure

Fully local, private and cross platform Speech-to-Text with LLM Post-processing

Softcatala/whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

royshil/obs-localvocal

OBS plugin for local speech recognition and captioning using AI

kurianbenoy/whisper_normalizer

A python package for whisper normalizer

Explore Voice AI Tools

All categories Trending Voice AI directory Insights