young-geng/EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

/ 100

Emerging

This project helps machine learning engineers and researchers efficiently train, fine-tune, evaluate, and deploy large language models (LLMs). It takes raw text data or existing pre-trained models as input and produces custom LLMs ready for specific applications. It is designed for those who work with JAX/Flax and need to scale training across multiple GPUs or TPUs.

2,522 stars. No commits in the last 6 months.

Use this if you are a machine learning engineer or researcher focused on developing custom large language models using JAX/Flax and require a streamlined framework for scaling your training efforts across multiple accelerators.

Not ideal if you are looking for a no-code solution or prefer frameworks outside of JAX/Flax, as this tool is specifically designed for developers working with that ecosystem.

large-language-models machine-learning-engineering deep-learning-research model-training natural-language-processing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

2,522

Forks

261

Language

Python

License

Apache-2.0

Higher-rated alternatives

PaddlePaddle/PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started...

arcee-ai/mergekit

Tools for merging pretrained large language models.

changyeyu/LLM-RL-Visualized

🌟100+ 原创 LLM / RL 原理图📚，《大模型算法》作者巨献！💥（100+ LLM/RL Algorithm Maps ）

mindspore-lab/step_into_llm

MindSpore online courses: Step into LLM

Explore Transformer Models

All categories Trending Transformer directory Insights