tacotron and tacotron_asr

These are ecosystem siblings—one implements Tacotron for the TTS (text-to-speech) direction while the other adapts the same architecture for the reverse ASR (automatic speech recognition) direction, sharing the same foundational model design.

tacotron
51
Established
tacotron_asr
47
Emerging
Maintenance 0/25
Adoption 10/25
Maturity 16/25
Community 25/25
Maintenance 0/25
Adoption 10/25
Maturity 16/25
Community 21/25
Stars: 1,833
Forks: 431
Downloads:
Commits (30d): 0
Language: Python
License: Apache-2.0
Stars: 164
Forks: 39
Downloads:
Commits (30d): 0
Language: Python
License: Apache-2.0
Stale 6m No Package No Dependents
Stale 6m No Package No Dependents

About tacotron

Kyubyong/tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

This project helps you transform written text into natural-sounding spoken audio. You input text, and it produces an audio file of that text being read aloud, much like an audiobook. This is ideal for content creators, educators, or anyone needing to generate speech from text for various applications.

text-to-speech audiobook-creation content-localization narration e-learning

About tacotron_asr

Kyubyong/tacotron_asr

Speech Recognition Using Tacotron

This project offers a way to convert spoken audio into written text, essentially transcribing speech. You provide audio files containing spoken English, and it outputs the corresponding text. This is useful for anyone needing to automatically generate text from voice recordings, such as researchers working with spoken language data or content creators looking to subtitle audio.

audio-transcription voice-to-text content-creation linguistics data-processing

Scores updated daily from GitHub, PyPI, and npm data. How scores work