naver/multilingual-distilwhisper

This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.

21
/ 100
Experimental

This project helps create highly accurate automated speech recognition (ASR) systems for specific languages. It takes audio input and outputs transcribed text, focusing on improving performance for less common or niche languages. It's designed for machine learning engineers or researchers building speech-to-text applications where precision in diverse languages is critical.

Use this if you need to improve the accuracy of speech-to-text transcription for a specific language beyond what general multilingual models typically offer.

Not ideal if you primarily work with common languages where general ASR models already perform well, or if you don't have the technical expertise to train and fine-tune machine learning models.

speech-to-text ASR machine-learning-engineering language-technology audio-transcription
No License No Package No Dependents
Maintenance 6 / 25
Adoption 7 / 25
Maturity 8 / 25
Community 0 / 25

How are scores calculated?

Stars

33

Forks

Language

Python

License

Last pushed

Oct 23, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/naver/multilingual-distilwhisper"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.