IMvision12/AdEMAMix-Optimizer-Keras

A Keras 3 Implementation of AdEMAMix Optimizer

/ 100

Emerging

This project helps machine learning engineers and researchers train their neural networks more effectively, especially large language models and image classification models. By adjusting how past training information is used, it takes your model and training data, and produces a trained model that converges faster and often achieves better performance. This is for machine learning practitioners building and optimizing complex AI models.

No commits in the last 6 months.

Use this if you are training large machine learning models, particularly for tasks like language modeling or image classification, and want to achieve faster convergence or better final model performance than standard optimizers like Adam.

Not ideal if you are working with very small models, simple datasets, or if your primary concern is not training speed or achieving the absolute best performance for complex tasks.

deep-learning-training large-language-models image-classification model-optimization neural-network-training

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

nschaetti/EchoTorch

A Python toolkit for Reservoir Computing and Echo State Network experimentation based on...

metaopt/torchopt

TorchOpt is an efficient library for differentiable optimization built upon PyTorch.

gpauloski/kfac-pytorch

Distributed K-FAC preconditioner for PyTorch

opthub-org/pytorch-bsf

PyTorch implementation of Bezier simplex fitting

pytorch/xla

Enabling PyTorch on XLA Devices (e.g. Google TPU)

Explore ML Frameworks

All categories Trending ML Framework directory Insights