loglux/FlexAudioPrint
FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a script for programmatic use. With FFmpeg for audio conversion, it supports multiple formats like MP3 and WAV. Ideal for transcribing meetings, lectures, and podcasts, with options to save results as text file
This tool helps you convert spoken audio into written text, perfect for turning recordings like meetings, lectures, or podcasts into readable documents. You simply upload an audio file (like MP3 or WAV), and it produces a text file with a precise transcription, including speaker labels and formatted dialogue. It's designed for anyone who needs to quickly and accurately get text from spoken words.
Use this if you need to transcribe audio files like interviews, presentations, or audio notes into well-formatted text, with options for translation and subtitle generation.
Not ideal if you require real-time transcription during live conversations or if you're working with extremely short, low-quality audio snippets where 'turbo' model's word-swallowing might be an issue.
Stars
10
Forks
—
Language
Python
License
MIT
Category
Last pushed
Jan 29, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/loglux/FlexAudioPrint"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
machinelearningZH/audio-transcription
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
saharmor/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
shhossain/BanglaSpeech2Text
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned...
oseiskar/autosubsync
Automatically synchronize subtitles with audio using machine learning