daanzu/wav2vec2_stt_python

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

/ 100

Emerging

This is a simple Python library for converting spoken audio into written text. You provide audio files, typically in WAV format, and it outputs the transcribed text. This is useful for developers who need to integrate high-quality speech-to-text capabilities into their applications on Linux.

No commits in the last 6 months.

Use this if you are a Python developer building an application that needs to accurately convert audio recordings into text using Wav2Vec2 2.0 models.

Not ideal if you need a graphical user interface, an out-of-the-box solution for non-developers, or robust support for macOS or Windows.

audio-transcription speech-to-text natural-language-processing developer-tool

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

AGPL-3.0

Higher-rated alternatives

liangstein/Chinese-speech-to-text

Chinese Speech To Text Using Wavenet

louiskirsch/speechT

An opensource speech-to-text software written in tensorflow

Open-Speech-EkStep/vakyansh-models

Open source speech to text models for Indic Languages

oliverguhr/wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Open-Speech-EkStep/vakyansh-wav2vec2-experimentation

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights