i4Ds/whisper-finetune

This repository contains code for fine-tuning the Whisper speech-to-text model.

51
/ 100
Established

This project helps machine learning engineers and researchers adapt the Whisper speech-to-text model for specific audio environments or accents. You provide an existing Whisper model and specialized audio datasets, and the project outputs a refined Whisper model that performs better on your unique data. It's designed for professionals working with speech recognition models.

Use this if you need to improve the accuracy of speech-to-text transcription for audio data that differs significantly from standard training sets, such as domain-specific jargon or unusual acoustic conditions.

Not ideal if you are looking for a pre-trained, off-the-shelf transcription service or a tool for basic audio file conversion without needing to customize the underlying model.

speech-recognition audio-processing machine-learning-engineering natural-language-processing model-adaptation
No Package No Dependents
Maintenance 13 / 25
Adoption 6 / 25
Maturity 16 / 25
Community 16 / 25

How are scores calculated?

Stars

22

Forks

6

Language

Jupyter Notebook

License

MIT

Last pushed

Mar 18, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/i4Ds/whisper-finetune"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.