galilai-group/stable-pretraining

Reliable, minimal and scalable library for pretraining foundation and world models

/ 100

Established

This project helps machine learning engineers and researchers efficiently train 'foundation models' using self-supervised learning techniques. It takes raw, unlabeled datasets (like images or text) and produces highly effective, general-purpose models that can then be adapted for many specific tasks. The core value is providing real-time visibility into the model's learning process, helping to quickly identify and fix issues.

133 stars.

Use this if you are a machine learning engineer or researcher focused on developing powerful, general-purpose AI models from large, unlabeled datasets using self-supervised methods, and you need robust tools for monitoring and debugging the training process.

Not ideal if you are primarily working with traditional supervised learning tasks, or if you are not deeply involved in the development and pre-training of large-scale foundation models.

foundation-model-training self-supervised-learning representation-learning machine-learning-engineering model-debugging

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

133

Forks

Language

Python

License

MIT

Related models

CognitiveAISystems/MAPF-GPT

[AAAI-2025] This repository contains MAPF-GPT, a deep learning-based model for solving MAPF...

UKPLab/gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled...

larslorch/avici

Amortized Inference for Causal Structure Learning, NeurIPS 2022

svdrecbd/mhc-mlx

MLX + Metal implementation of mHC: Manifold-Constrained Hyper-Connections by DeepSeek-AI.

kyegomez/MHMoE

Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch

Explore Transformer Models

All categories Trending Transformer directory Insights