theolepage/ssl-for-slr

Collection of self-supervised models for speaker and language recognition tasks.

22
/ 100
Experimental

This project provides pre-trained models that can identify who is speaking or what language is being spoken from audio recordings. It takes raw audio files as input and outputs classifications about the speaker or language. This is useful for researchers and developers working on speech technology applications like voice assistants or call center automation.

No commits in the last 6 months.

Use this if you need robust, self-supervised models for tasks involving speaker identification or language recognition from audio data.

Not ideal if you are looking for ready-to-use applications for end-users, as this project focuses on the underlying models for developers.

speaker-recognition language-identification audio-processing speech-technology voice-biometrics
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 8 / 25
Community 8 / 25

How are scores calculated?

Stars

19

Forks

2

Language

Jupyter Notebook

License

Last pushed

Jan 18, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/theolepage/ssl-for-slr"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.