Barbany/Multi-speaker-Neural-Vocoder

Bachelor's thesis carried at Universitat Politecnica de Catalunya in partial fullfilment of the requirements for the degree in Telecommunications Technologies and Services Engineering

/ 100

Emerging

This project helps researchers in speech technology by synthesizing human-like speech from existing voice recordings, generating new audio in multiple Spanish voices. It takes acoustic parameters extracted from speech and produces synthesized speech. Speech technology researchers or academics working on voice synthesis would use this.

No commits in the last 6 months.

Use this if you are a researcher developing new neural vocoders and need a multi-speaker speech synthesis baseline model for Spanish voices.

Not ideal if you need a readily deployable, production-ready speech synthesis system for general use.

speech-synthesis voice-generation audio-research neural-networks computational-linguistics

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

fatchord/WaveRNN

WaveRNN Vocoder + TTS

shangeth/wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...

rishikksh20/iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...

seungwonpark/melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights