theolepage/ssl-for-slr

Collection of self-supervised models for speaker and language recognition tasks.

/ 100

Experimental

This project provides pre-trained models that can identify who is speaking or what language is being spoken from audio recordings. It takes raw audio files as input and outputs classifications about the speaker or language. This is useful for researchers and developers working on speech technology applications like voice assistants or call center automation.

No commits in the last 6 months.

Use this if you need robust, self-supervised models for tasks involving speaker identification or language recognition from audio data.

Not ideal if you are looking for ready-to-use applications for end-users, as this project focuses on the underlying models for developers.

speaker-recognition language-identification audio-processing speech-technology voice-biometrics

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

AdaptiveMotorControlLab/CEBRA

Learnable latent embeddings for joint behavioral and neural analysis - Official implementation of CEBRA

theolepage/sslsv

Toolkit for training and evaluating Self-Supervised Learning (SSL) frameworks for Speaker...

PaddlePaddle/PASSL

PASSL包含 SimCLR，MoCo v1/v2，BYOL，CLIP，PixPro，simsiam, SwAV, BEiT，MAE 等图像自监督算法以及 Vision...

YGZWQZD/LAMDA-SSL

30 Semi-Supervised Learning Algorithms

ModSSC/ModSSC

ModSSC: A Modular Framework for Semi Supervised Classification

Explore ML Frameworks

All categories Trending ML Framework directory Insights