dimonier/batch-speech-to-text

Python wrapper for OpenAI's Whisper for processing all audio files in a specified folder and creating raw text + transcript with time stamps

/ 100

Emerging

This tool transcribes spoken words from audio and video files into text, helping you convert interviews, lectures, or meetings into written records. You provide one or more media files, and it outputs raw text and optionally a time-coded transcript. It is ideal for researchers, journalists, or anyone needing to analyze spoken content, especially in Russian.

No commits in the last 6 months.

Use this if you need to quickly and accurately convert a collection of audio or video recordings into written Russian text with correct punctuation and case, and optionally with timestamps.

Not ideal if you primarily work with languages other than Russian and require automatic punctuation and case recovery, or if you prefer a graphical user interface over command-line usage.

transcription media-analysis content-creation interview-analysis lecture-notes

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Kieirra/murmure

Fully local, private and cross platform Speech-to-Text with LLM Post-processing

Softcatala/whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

pavelzbornik/whisperX-FastAPI

FastAPI service on top of WhisperX

royshil/obs-localvocal

OBS plugin for local speech recognition and captioning using AI

Explore Voice AI Tools

All categories Trending Voice AI directory Insights