huawei-noah/Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

43
/ 100
Emerging

This project offers advanced speech technology models, including Grad-TTS for creating natural-sounding speech from text, SPIRAL for self-supervised speech representation learning, and DiffVC for converting one person's voice to another while preserving content. It's designed for researchers and engineers working on speech synthesis and voice manipulation.

602 stars. No commits in the last 6 months.

Use this if you are a speech researcher or engineer developing new text-to-speech systems or voice conversion applications.

Not ideal if you need a ready-to-use speech application or an SDK for general audio processing tasks.

speech-synthesis voice-conversion audio-generation machine-learning-research speech-technology
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 25 / 25

How are scores calculated?

Stars

602

Forks

130

Language

Jupyter Notebook

License

Last pushed

Sep 18, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/huawei-noah/Speech-Backbones"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.