sp-nitech/DNN-HSMM
pytorch implementation of DNN-HSMM for TTS
This project helps researchers in speech technology develop and evaluate new text-to-speech (TTS) systems. It takes linguistic and acoustic features from speech data as input and produces a trained model and generated acoustic features as output. The primary user is a speech synthesis researcher or engineer.
No commits in the last 6 months.
Use this if you are a speech synthesis researcher interested in experimenting with DNN-HSMM models for building statistical parametric text-to-speech systems.
Not ideal if you are looking for an out-of-the-box text-to-speech system for end-user applications or if you want to synthesize audio directly from text without needing to work with acoustic features.
Stars
70
Forks
15
Language
Python
License
BSD-3-Clause
Category
Last pushed
Mar 14, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/sp-nitech/DNN-HSMM"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model