kaistmm/seed-pytorch

[INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement Diffusion Model"

/ 100

Experimental

This project helps speech technologists improve the accuracy of speaker recognition systems when dealing with noisy audio. It takes existing speaker embeddings (digital representations of a voice) from a pre-trained model and refines them to be more robust to background noise. The result is more accurate speaker identification, even in challenging acoustic environments, benefiting professionals working on voice authentication or speaker diarization.

Use this if your speaker recognition or verification system struggles with performance degradation due to environmental noise and you need to enhance the robustness of your speaker embeddings.

Not ideal if your primary goal is to train a speaker recognition model from scratch, as this tool focuses on enhancing existing speaker embeddings rather than building them.

speaker-recognition voice-biometrics speech-enhancement audio-processing speech-technology

No License No Package No Dependents

Maintenance 6 / 25

Adoption 8 / 25

Maturity 7 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

felixbur/nkululeko

Machine learning speaker characteristics

claritychallenge/clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

juanmc2005/diart

A python package to build AI-powered real-time audio applications

astorfi/3D-convolutional-speaker-recognition

:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Explore ML Frameworks

All categories Trending ML Framework directory Insights