naver/multilingual-distilwhisper

This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.

/ 100

Experimental

This project helps create highly accurate automated speech recognition (ASR) systems for specific languages. It takes audio input and outputs transcribed text, focusing on improving performance for less common or niche languages. It's designed for machine learning engineers or researchers building speech-to-text applications where precision in diverse languages is critical.

Use this if you need to improve the accuracy of speech-to-text transcription for a specific language beyond what general multilingual models typically offer.

Not ideal if you primarily work with common languages where general ASR models already perform well, or if you don't have the technical expertise to train and fine-tune machine learning models.

speech-to-text ASR machine-learning-engineering language-technology audio-transcription

No License No Package No Dependents

Maintenance 6 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

—

Higher-rated alternatives

YuanGongND/whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech...

adi-gov-tw/Taiwan-Tongues-ASR-CE

Taiwan Tongues ASR CE 是一個開源語音辨識（Automatic Speech Recognition, ASR）模型專案，專為台灣多元語言環境設計。本模型支援...

huggingface/distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

phineas-pta/fine-tune-whisper-vi

jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2

KevKibe/African-Whisper

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights