vasistalodagala/whisper-finetune

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

/ 100

Emerging

This project helps machine learning engineers and researchers improve Automatic Speech Recognition (ASR) performance for specific languages or accents. It takes audio recordings and their human-transcribed text as input, then customizes an existing Whisper ASR model. The output is a specialized ASR model that is more accurate for your unique audio data.

361 stars. No commits in the last 6 months.

Use this if you need an ASR model that performs exceptionally well on audio data with characteristics not fully captured by general-purpose models.

Not ideal if you simply need to transcribe general audio with an off-the-shelf ASR model, or if you don't have labeled audio-text pairs for fine-tuning.

speech-to-text voice-recognition audio-transcription natural-language-processing machine-learning-engineering

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 23 / 25

How are scores calculated?

Stars

361

Forks

Language

Python

License

MIT

Compare

whisper-finetune and Whisper-Finetune

Higher-rated alternatives

linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

argmaxinc/WhisperKit

On-device Speech Recognition for Apple Silicon

yeyupiaoling/Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data,...

xenova/whisper-web

ML-powered speech recognition directly in your browser

Pikurrot/whisper-gui

A simple GUI to use Whisper.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights