di37/speech-to-text-fine-tuning-on-unseen-language

This projects aims to show how whisper model can be fine-tuned on language it was not trained but is trained on similar language to it.

/ 100

Experimental

This project helps speech technologists and AI engineers adapt existing speech-to-text AI models to new, previously unsupported languages. You provide audio recordings and their corresponding text transcripts in the target language. The output is an improved speech-to-text model capable of transcribing the new language more accurately.

No commits in the last 6 months.

Use this if you need to create a high-performing speech-to-text model for a language that current general-purpose models don't support well, but is similar to a language they do support.

Not ideal if you need a speech-to-text solution for a widely supported language, or if you don't have existing audio and text data for your target language.

speech-recognition natural-language-processing AI-model-adaptation low-resource-languages machine-learning-engineering

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Jupyter Notebook

License

—

Higher-rated alternatives

LM-Kit/LynxTranscribe

LynxTranscribe is a comprehensive, professional-grade audio transcription application built with...

RiteshGenAI/openai_whisper_transcribe_yt_videos

This project is a Streamlit-based application that allows users to download audio from YouTube...

victorwoo/transcript-video

A PowerShell script that automatically generates subtitles in bulk for video files using...

SouthernMethodistUniversity/whisper-transcription

Helm chart repo for application developed by OIT STARs students for audio transcription using...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights