Narasimha1997/wavenet-stt

An end-to-end speech recognition system with Wavenet. Built using C++ and python.

22
/ 100
Experimental

This project offers an end-to-end speech recognition system based on DeepMind's Wavenet research, converting spoken audio directly into text. It takes audio inputs and outputs the transcribed words. This tool is for developers and researchers who need to implement or experiment with state-of-the-art speech-to-text capabilities.

No commits in the last 6 months.

Use this if you are a developer looking for a C++ and Python implementation of a Wavenet-based speech recognition system for integration into applications or for research.

Not ideal if you are an end-user needing a ready-to-use application for transcribing audio without any programming or technical setup.

speech-to-text audio-processing machine-learning-development natural-language-processing AI-research
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

21

Forks

Language

Python

License

GPL-3.0

Last pushed

Sep 21, 2020

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Narasimha1997/wavenet-stt"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.