daanzu/wav2vec2_stt_python
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition
This is a simple Python library for converting spoken audio into written text. You provide audio files, typically in WAV format, and it outputs the transcribed text. This is useful for developers who need to integrate high-quality speech-to-text capabilities into their applications on Linux.
No commits in the last 6 months.
Use this if you are a Python developer building an application that needs to accurately convert audio recordings into text using Wav2Vec2 2.0 models.
Not ideal if you need a graphical user interface, an out-of-the-box solution for non-developers, or robust support for macOS or Windows.
Stars
23
Forks
3
Language
Python
License
AGPL-3.0
Category
Last pushed
Aug 16, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/daanzu/wav2vec2_stt_python"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
liangstein/Chinese-speech-to-text
Chinese Speech To Text Using Wavenet
louiskirsch/speechT
An opensource speech-to-text software written in tensorflow
Open-Speech-EkStep/vakyansh-models
Open source speech to text models for Indic Languages
oliverguhr/wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Open-Speech-EkStep/vakyansh-wav2vec2-experimentation
Repository containing experimentation platform on how to train, infer on wav2vec2 models.