tarun-bisht/wav2vec2-asr

wav2vec2 asr with transformers

/ 100

Experimental

This project helps convert spoken words into written text. It takes an audio recording or a live voice input and produces a transcription of what was said. This is useful for anyone who needs to quickly convert spoken content into a readable format, such as journalists, researchers, or meeting facilitators.

No commits in the last 6 months.

Use this if you need a way to automatically transcribe audio files or live speech into text.

Not ideal if you require real-time, highly accurate transcription for very specialized or noisy audio without prior language model training.

audio-transcription speech-to-text voice-recording-analysis content-creation data-entry-automation

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

felixbur/nkululeko

Machine learning speaker characteristics

claritychallenge/clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

juanmc2005/diart

A python package to build AI-powered real-time audio applications

astorfi/3D-convolutional-speaker-recognition

:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Explore ML Frameworks

All categories Trending ML Framework directory Insights