epfml/llm-optimizer-benchmark

Benchmarking Optimizers for LLM Pretraining

/ 100

Emerging

This project offers a standardized way to compare different optimization techniques used in training Large Language Models (LLMs). It takes various optimizer configurations, model sizes, and training durations as input and produces benchmark results showing which optimizer performs best under specific conditions. LLM researchers and practitioners would use this to inform their choice of optimization methods for pretraining LLMs.

Use this if you are pretraining Large Language Models and need to systematically evaluate and select the most effective optimization technique for your specific model size, batch size, or training duration.

Not ideal if you are looking for a tool to train LLMs for immediate application or if your primary focus is fine-tuning an existing LLM for a downstream task.

LLM pretraining Deep Learning optimization Model development AI research Language model engineering

No Package No Dependents

Maintenance 6 / 25

Adoption 8 / 25

Maturity 15 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Compare

llm-optimizer-benchmark and ollama-benchmark llm-optimizer-benchmark and LLMeBench

Higher-rated alternatives

stanfordnlp/axbench

Stanford NLP Python library for benchmarking the utility of LLM interpretability methods

aidatatools/ollama-benchmark

LLM Benchmark for Throughput via Ollama (Local LLMs)

LarHope/ollama-benchmark

Ollama based Benchmark with detail I/O token per second. Python with Deepseek R1 example.

qcri/LLMeBench

Benchmarking Large Language Models

THUDM/LongBench

LongBench v2 and LongBench (ACL 25'&24')

Explore Transformer Models

All categories Trending Transformer directory Insights