matlab-deep-learning/wav2vec-2.0

This repo provides the pretrained baseline 960 hours wav2vec 2.0 model in MATLAB.

/ 100

Emerging

This tool helps researchers, audio analysts, and developers working with speech translate spoken audio directly into written text. You feed it an audio file containing speech, and it outputs the transcribed text. It's designed for anyone needing to quickly and accurately convert spoken words into a written format within a MATLAB environment.

No commits in the last 6 months.

Use this if you need to convert spoken English audio files into written text within MATLAB, especially for research or analysis where accurate transcription is critical.

Not ideal if you need to transcribe audio in languages other than English or require real-time transcription from a live audio stream.

speech-to-text audio-analysis linguistics voice-processing transcription

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

—

License

—

Higher-rated alternatives

felixbur/nkululeko

Machine learning speaker characteristics

claritychallenge/clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

juanmc2005/diart

A python package to build AI-powered real-time audio applications

astorfi/3D-convolutional-speaker-recognition

:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Explore ML Frameworks

All categories Trending ML Framework directory Insights