crlandsc/torch-log-wmse

logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source separation systems.

46
/ 100
Emerging

This tool helps audio engineers and researchers evaluate and improve audio source separation and denoising models. It takes unprocessed, processed, and target audio files as input, and outputs a quality score that accurately reflects human perception, especially for segments containing digital silence. It's designed for anyone working on machine learning models that process audio.

Available on PyPI.

Use this if you are training or evaluating audio processing models, particularly for source separation or denoising, and need a robust quality metric that handles digital silence and aligns with human hearing.

Not ideal if you need a metric that is invariant to arbitrary scaling or polarity inversion, or if you require a full model of human auditory perception like auditory masking.

audio-processing sound-engineering machine-learning-audio audio-quality-assessment speech-enhancement
Maintenance 10 / 25
Adoption 8 / 25
Maturity 25 / 25
Community 3 / 25

How are scores calculated?

Stars

45

Forks

1

Language

Python

License

Apache-2.0

Last pushed

Jan 29, 2026

Commits (30d)

0

Dependencies

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/crlandsc/torch-log-wmse"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.