dimonier/batch-speech-to-text
Python wrapper for OpenAI's Whisper for processing all audio files in a specified folder and creating raw text + transcript with time stamps
This tool transcribes spoken words from audio and video files into text, helping you convert interviews, lectures, or meetings into written records. You provide one or more media files, and it outputs raw text and optionally a time-coded transcript. It is ideal for researchers, journalists, or anyone needing to analyze spoken content, especially in Russian.
No commits in the last 6 months.
Use this if you need to quickly and accurately convert a collection of audio or video recordings into written Russian text with correct punctuation and case, and optionally with timestamps.
Not ideal if you primarily work with languages other than Russian and require automatic punctuation and case recovery, or if you prefer a graphical user interface over command-line usage.
Stars
20
Forks
2
Language
Python
License
MIT
Category
Last pushed
Apr 05, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/dimonier/batch-speech-to-text"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
Kieirra/murmure
Fully local, private and cross platform Speech-to-Text with LLM Post-processing
Softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
pavelzbornik/whisperX-FastAPI
FastAPI service on top of WhisperX
royshil/obs-localvocal
OBS plugin for local speech recognition and captioning using AI