young-geng/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
This project helps machine learning engineers and researchers efficiently train, fine-tune, evaluate, and deploy large language models (LLMs). It takes raw text data or existing pre-trained models as input and produces custom LLMs ready for specific applications. It is designed for those who work with JAX/Flax and need to scale training across multiple GPUs or TPUs.
2,522 stars. No commits in the last 6 months.
Use this if you are a machine learning engineer or researcher focused on developing custom large language models using JAX/Flax and require a streamlined framework for scaling your training efforts across multiple accelerators.
Not ideal if you are looking for a no-code solution or prefer frameworks outside of JAX/Flax, as this tool is specifically designed for developers working with that ecosystem.
Stars
2,522
Forks
261
Language
Python
License
Apache-2.0
Category
Last pushed
Aug 13, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/young-geng/EasyLM"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PaddlePaddle/PaddleNLP
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
meta-llama/llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started...
arcee-ai/mergekit
Tools for merging pretrained large language models.
changyeyu/LLM-RL-Visualized
๐100+ ๅๅ LLM / RL ๅ็ๅพ๐๏ผใๅคงๆจกๅ็ฎๆณใไฝ่ ๅทจ็ฎ๏ผ๐ฅ๏ผ100+ LLM/RL Algorithm Maps ๏ผ
mindspore-lab/step_into_llm
MindSpore online courses: Step into LLM