sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
This project helps convert spoken audio into written text, or translate it into other languages, much faster than other tools. You provide an audio file, and it outputs an accurate transcription or translation. It's designed for data scientists or machine learning engineers who need to process large volumes of audio data efficiently.
4,690 stars. No commits in the last 6 months.
Use this if you need to quickly transcribe or translate long audio files using advanced machine learning models, especially if you have access to powerful hardware like TPUs or GPUs.
Not ideal if you are looking for a simple, out-of-the-box application for everyday audio transcription without needing to write code or manage machine learning infrastructure.
Stars
4,690
Forks
414
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Apr 03, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/sanchit-gandhi/whisper-jax"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
machinelearningZH/audio-transcription
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
saharmor/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
shhossain/BanglaSpeech2Text
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned...
oseiskar/autosubsync
Automatically synchronize subtitles with audio using machine learning