daanzu/wav2vec2_stt_python

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

33
/ 100
Emerging

This is a simple Python library for converting spoken audio into written text. You provide audio files, typically in WAV format, and it outputs the transcribed text. This is useful for developers who need to integrate high-quality speech-to-text capabilities into their applications on Linux.

No commits in the last 6 months.

Use this if you are a Python developer building an application that needs to accurately convert audio recordings into text using Wav2Vec2 2.0 models.

Not ideal if you need a graphical user interface, an out-of-the-box solution for non-developers, or robust support for macOS or Windows.

audio-transcription speech-to-text natural-language-processing developer-tool
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 16 / 25
Community 11 / 25

How are scores calculated?

Stars

23

Forks

3

Language

Python

License

AGPL-3.0

Last pushed

Aug 16, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/daanzu/wav2vec2_stt_python"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.