vadimkantorov/inferspeech

PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant

28
/ 100
Experimental

This is a tool for converting spoken audio recordings into written text. You provide an audio file, and it outputs a transcription of the speech within that file. This is ideal for developers who need to integrate basic speech-to-text functionality into their applications, especially those experimenting with AI models.

No commits in the last 6 months.

Use this if you are a developer looking for a basic, script-based solution to convert English or Russian audio files into text for proof-of-concept or integration work.

Not ideal if you need a robust, production-ready speech-to-text system that can handle large volumes of audio or requires advanced features like chunking and different decoding strategies.

speech-to-text audio-transcription AI-model-inference natural-language-processing developer-tooling
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 8 / 25
Community 15 / 25

How are scores calculated?

Stars

10

Forks

4

Language

Python

License

Last pushed

Aug 12, 2019

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/vadimkantorov/inferspeech"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.