ScottishFold007/TTSAudioNormalizer

TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudness normalization operations.

32
/ 100
Emerging

This tool helps professionals creating Text-to-Speech (TTS) voices to analyze and standardize their audio datasets. It takes raw audio recordings, often from multiple speakers, and processes them to ensure consistent volume, clarity, and format. The output is a set of high-quality, uniformly processed audio files optimized for training robust TTS models. This is ideal for voice talent managers, audio engineers, or AI researchers building custom voice models.

111 stars. No commits in the last 6 months.

Use this if you need to prepare diverse raw audio recordings into a consistent, high-quality dataset for training Text-to-Speech models, ensuring uniform loudness and clarity across all samples.

Not ideal if you are looking for a general-purpose audio editor or need to process single audio files for non-TTS applications like music production or podcasting.

Text-to-Speech production voice dataset preparation audio engineering AI voice modeling speech synthesis
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 8 / 25
Community 15 / 25

How are scores calculated?

Stars

111

Forks

15

Language

Python

License

Last pushed

Dec 20, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ScottishFold007/TTSAudioNormalizer"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.