kyegomez/Sophia

Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.

/ 100

Emerging

This project offers an optimized training method for large language models, allowing machine learning engineers to reduce the computational resources needed. It takes your existing model training setup and, by simply integrating this optimizer, outputs a trained model faster and at half the cost compared to standard methods like Adam. This is designed for AI/ML engineers and researchers working on large-scale model development and pre-training.

381 stars. No commits in the last 6 months.

Use this if you are training large language models and want to significantly cut down on compute costs, training time, and energy consumption without changing your model architecture or infrastructure.

Not ideal if you are working with small models or non-language model tasks, as the primary benefit is seen in the demanding computational environment of LLM pre-training.

large-language-models deep-learning-optimization model-pretraining ml-cost-reduction gpu-resource-management

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

381

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

LLM-Tuning-Safety/LLMs-Finetuning-Safety

We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially...

appier-research/robust-llm-finetunes

Accepted to NeurIPS 2025

uthmandevsec/Self-Distillation

🤖 Enable continual learning by reproducing the On-Policy Self-Distillation algorithm for robust...

jmcentire/apprentice

Train cheap models on expensive ones. Automatically. With receipts.

phonism/LLMNotes

LLM 学习笔记：Transformer 架构、强化学习 (RLHF/DPO/PPO)、分布式训练、推理优化。含完整数学推导与Slides。

Explore LLM Tools

All categories Trending LLM Tool directory Insights