A-SHOJAEI/cross-lingual-phoneme-aware-speech-enhancement-with-adaptive-masking
Multi-stage speech enhancement system that leverages cross-lingual phoneme embeddings to guide adaptive time-frequency masking for noise reduction in low-resource languages. The model uses phoneme-conditioned attention to learn language-agnostic acoustic patterns from high-resource languages (English, Spanish) and transfers them to low-resource lan
Stars
—
Forks
—
Language
Python
License
MIT
Category
Last pushed
Feb 21, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/A-SHOJAEI/cross-lingual-phoneme-aware-speech-enhancement-with-adaptive-masking"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
iver56/audiomentations
A Python library for audio data augmentation. Useful for making audio ML models work well in the...
Rikorose/DeepFilterNet
Noise supression using deep filtering
torchsynth/torchsynth
A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.
marl/openl3
OpenL3: Open-source deep audio and image embeddings
archinetai/audio-data-pytorch
A collection of useful audio datasets and transforms for PyTorch.