sanchit-gandhi/whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

/ 100

Emerging

This project helps convert spoken audio into written text, or translate it into other languages, much faster than other tools. You provide an audio file, and it outputs an accurate transcription or translation. It's designed for data scientists or machine learning engineers who need to process large volumes of audio data efficiently.

4,690 stars. No commits in the last 6 months.

Use this if you need to quickly transcribe or translate long audio files using advanced machine learning models, especially if you have access to powerful hardware like TPUs or GPUs.

Not ideal if you are looking for a simple, out-of-the-box application for everyday audio transcription without needing to write code or manage machine learning infrastructure.

audio-transcription speech-to-text language-translation audio-processing machine-learning-operations

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

4,690

Forks

414

Language

Jupyter Notebook

License

Apache-2.0

Higher-rated alternatives

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

machinelearningZH/audio-transcription

Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.

saharmor/whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

shhossain/BanglaSpeech2Text

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned...

oseiskar/autosubsync

Automatically synchronize subtitles with audio using machine learning

Explore Voice AI Tools

All categories Trending Voice AI directory Insights