IMvision12/AdEMAMix-Optimizer-Keras
A Keras 3 Implementation of AdEMAMix Optimizer
This project helps machine learning engineers and researchers train their neural networks more effectively, especially large language models and image classification models. By adjusting how past training information is used, it takes your model and training data, and produces a trained model that converges faster and often achieves better performance. This is for machine learning practitioners building and optimizing complex AI models.
No commits in the last 6 months.
Use this if you are training large machine learning models, particularly for tasks like language modeling or image classification, and want to achieve faster convergence or better final model performance than standard optimizers like Adam.
Not ideal if you are working with very small models, simple datasets, or if your primary concern is not training speed or achieving the absolute best performance for complex tasks.
Stars
10
Forks
2
Language
Python
License
MIT
Category
Last pushed
Sep 19, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/IMvision12/AdEMAMix-Optimizer-Keras"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
nschaetti/EchoTorch
A Python toolkit for Reservoir Computing and Echo State Network experimentation based on...
metaopt/torchopt
TorchOpt is an efficient library for differentiable optimization built upon PyTorch.
gpauloski/kfac-pytorch
Distributed K-FAC preconditioner for PyTorch
opthub-org/pytorch-bsf
PyTorch implementation of Bezier simplex fitting
pytorch/xla
Enabling PyTorch on XLA Devices (e.g. Google TPU)