SymbioticLab/ModelKeeper
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
This tool helps machine learning engineers and researchers accelerate the training of deep neural networks (DNNs) in a cluster environment. It takes existing DNN models and their training configurations as input, then intelligently reuses previously trained model components to "warm up" new training runs. The output is significantly faster DNN training, reducing the time and computational resources needed for experimentation and deployment.
No commits in the last 6 months.
Use this if you are a machine learning engineer or researcher frequently training many deep neural network models on a compute cluster and want to significantly reduce training times.
Not ideal if you are working with non-DNN models, training on a single machine, or not actively managing multiple model training runs.
Stars
36
Forks
5
Language
Python
License
—
Category
Last pushed
Jan 09, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/SymbioticLab/ModelKeeper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
treeverse/dvc
🦉 Data Versioning and ML Experiments
runpod/runpod-python
🐍 | Python library for RunPod API and serverless worker SDK.
microsoft/vscode-jupyter
VS Code Jupyter extension
4paradigm/OpenMLDB
OpenMLDB is an open-source machine learning database that provides a feature platform computing...
uber/petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning...