palonso/MAEST
Pre-training, fine-tuning, and inference code with the MAEST models for music analysis applications.
This tool helps music researchers and analysts automatically categorize and understand musical pieces. You input raw audio files or mel-spectrograms, and it outputs detailed musical embeddings and label predictions, telling you about the characteristics of the music. It's designed for anyone working with large collections of audio who needs to extract meaningful features for tasks like genre classification or mood detection.
No commits in the last 6 months.
Use this if you need to extract advanced features and predictions from music audio for research, content analysis, or building music information retrieval systems.
Not ideal if you're looking for a simple, off-the-shelf music player or a tool for basic audio editing.
Stars
54
Forks
4
Language
Python
License
AGPL-3.0
Category
Last pushed
Jun 27, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/palonso/MAEST"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
CouncilDataProject/speakerbox
Speakerbox: Fine-tune Audio Transformers for speaker identification.
CVxTz/music_genre_classification
music genre classification : LSTM vs Transformer
HHousen/speaker-change-detection
Speaker change detection using SincNet and an LSTM/Transformer
icon-lab/HST
Official implementation of Hierarchical Spectrogram Transformers (HST)
aaronstevenwhite/spectrans
Modular spectral transformer implementations in PyTorch with Fourier, wavelet, and other...