habla-liaa/encodecmae

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

/ 100

Experimental

This is a feature extractor designed for researchers and machine learning engineers working with audio data. It takes raw audio files as input and outputs a structured set of features (embeddings) that represent the audio's content. These embeddings can then be used for various downstream tasks like audio classification or similarity search, making it easier to analyze and understand complex audio patterns.

101 stars. No commits in the last 6 months.

Use this if you need to transform raw audio into meaningful, numerical representations for machine learning models, especially if you're exploring universal audio representation learning.

Not ideal if you primarily need to manipulate or generate audio waveforms directly without an intermediate feature extraction step.

audio-analysis speech-processing sound-recognition audio-research machine-learning-engineering

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 7 / 25

How are scores calculated?

Stars

101

Forks

Language

Python

License

—

Higher-rated alternatives

Westlake-AI/openmixup

CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark

YU1ut/MixMatch-pytorch

Code for "MixMatch - A Holistic Approach to Semi-Supervised Learning"

kamata1729/QATM_pytorch

Pytorch Implementation of QATM:Quality-Aware Template Matching For Deep Learning

nttcslab/msm-mae

Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations

rgeirhos/generalisation-humans-DNNs

Data, code & materials from the paper "Generalisation in humans and deep neural networks" (NeurIPS 2018)

Explore ML Frameworks

All categories Trending ML Framework directory Insights