shashikg/WhisperS2T
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
This tool helps convert audio recordings into written text quickly and accurately. You feed in audio files, and it produces a transcript in various formats like TXT, JSON, or SRT. It's designed for anyone who needs fast and reliable transcriptions, such as journalists, researchers, content creators, or meeting facilitators.
541 stars. No commits in the last 6 months.
Use this if you need to transcribe audio files into text exceptionally fast, especially for large volumes of audio, and require high accuracy.
Not ideal if you primarily need to translate speech into a different language without transcription, or if you don't work with audio transcription.
Stars
541
Forks
73
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Aug 27, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/shashikg/WhisperS2T"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
machinelearningZH/audio-transcription
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
saharmor/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
shhossain/BanglaSpeech2Text
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned...
oseiskar/autosubsync
Automatically synchronize subtitles with audio using machine learning