vadimkantorov/inferspeech

PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant

/ 100

Experimental

This is a tool for converting spoken audio recordings into written text. You provide an audio file, and it outputs a transcription of the speech within that file. This is ideal for developers who need to integrate basic speech-to-text functionality into their applications, especially those experimenting with AI models.

No commits in the last 6 months.

Use this if you are a developer looking for a basic, script-based solution to convert English or Russian audio files into text for proof-of-concept or integration work.

Not ideal if you need a robust, production-ready speech-to-text system that can handle large volumes of audio or requires advanced features like chunking and different decoding strategies.

speech-to-text audio-transcription AI-model-inference natural-language-processing developer-tooling

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

liangstein/Chinese-speech-to-text

Chinese Speech To Text Using Wavenet

louiskirsch/speechT

An opensource speech-to-text software written in tensorflow

Open-Speech-EkStep/vakyansh-models

Open source speech to text models for Indic Languages

oliverguhr/wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Open-Speech-EkStep/vakyansh-wav2vec2-experimentation

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights