AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

/ 100

Verified

MaxText helps AI engineers and researchers efficiently train and fine-tune large language models (LLMs) on powerful hardware like Google Cloud TPUs and GPUs. You provide raw text data and choose from a library of existing model architectures like Gemma or Llama. MaxText then outputs a highly optimized, custom-trained LLM ready for integration into your applications or further research.

2,169 stars. Actively maintained with 321 commits in the last 30 days. Available on PyPI.

Use this if you need to pre-train or fine-tune large language models from scratch or adapt existing ones for specific tasks, aiming for high performance and scalability on accelerator hardware.

Not ideal if you're looking for an off-the-shelf API for LLM inference or if you don't have access to specialized AI accelerator hardware for training.

large-language-model-training deep-learning-research ai-model-customization high-performance-computing machine-learning-engineering

No Dependents

Maintenance 22 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 25 / 25

How are scores calculated?

Stars

2,169

Forks

485

Language

Python

License

Apache-2.0

Recent Releases

maxtext-v0.2.1 23 Mar 2026 maxtext-v0.2.0 06 Mar 2026 maxtext-tutorial-v1.5.0 30 Dec 2025 maxtext-tutorial-v1.4.0 12 Dec 2025 maxtext-tutorial-v1.3.0 20 Nov 2025

Related models

rasbt/reasoning-from-scratch

Implement a reasoning LLM in PyTorch from scratch, step by step

mindspore-lab/mindnlp

MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore with seamless...

mosaicml/llm-foundry

LLM training code for Databricks foundation models

rickiepark/llm-from-scratch

<밑바닥부터 만들면서 공부하는 LLM>(길벗, 2025)의 코드 저장소

CASE-Lab-UMD/LLM-Drop

The official implementation of the paper "Uncovering the Redundancy in Transformers via a...

Explore Transformer Models

All categories Trending Transformer directory Insights