ScottishFold007/TTSAudioNormalizer
TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudness normalization operations.
This tool helps professionals creating Text-to-Speech (TTS) voices to analyze and standardize their audio datasets. It takes raw audio recordings, often from multiple speakers, and processes them to ensure consistent volume, clarity, and format. The output is a set of high-quality, uniformly processed audio files optimized for training robust TTS models. This is ideal for voice talent managers, audio engineers, or AI researchers building custom voice models.
111 stars. No commits in the last 6 months.
Use this if you need to prepare diverse raw audio recordings into a consistent, high-quality dataset for training Text-to-Speech models, ensuring uniform loudness and clarity across all samples.
Not ideal if you are looking for a general-purpose audio editor or need to process single audio files for non-TTS applications like music production or podcasting.
Stars
111
Forks
15
Language
Python
License
—
Category
Last pushed
Dec 20, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ScottishFold007/TTSAudioNormalizer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
speechio/chinese_text_normalization
Chinese text normalization for speech processing
NickZaitsev/ru-normalizr
ru-normalizr — лучший open-source нормализатор русского текста. Приводит числа, даты, время,...
gladiaio/normalization
A lightweight library for normalizing speech transcripts before computing WER
34j/mecab-text-cleaner
Simple Python package (CLI/Python API) for getting japanese readings (yomigana) and accents using MeCab.
repodiac/german_transliterate
Python module to clean and transliterate (i.e. normalize) German text including abbreviations,...