Narasimha1997/wavenet-stt

An end-to-end speech recognition system with Wavenet. Built using C++ and python.

/ 100

Experimental

This project offers an end-to-end speech recognition system based on DeepMind's Wavenet research, converting spoken audio directly into text. It takes audio inputs and outputs the transcribed words. This tool is for developers and researchers who need to implement or experiment with state-of-the-art speech-to-text capabilities.

No commits in the last 6 months.

Use this if you are a developer looking for a C++ and Python implementation of a Wavenet-based speech recognition system for integration into applications or for research.

Not ideal if you are an end-user needing a ready-to-use application for transcribing audio without any programming or technical setup.

speech-to-text audio-processing machine-learning-development natural-language-processing AI-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

GPL-3.0

Higher-rated alternatives

liangstein/Chinese-speech-to-text

Chinese Speech To Text Using Wavenet

louiskirsch/speechT

An opensource speech-to-text software written in tensorflow

Open-Speech-EkStep/vakyansh-models

Open source speech to text models for Indic Languages

oliverguhr/wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Open-Speech-EkStep/vakyansh-wav2vec2-experimentation

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights