SafeAILab/EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

57
/ 100
Established

EAGLE helps practitioners accelerate how quickly their large language models (LLMs) generate text, such as responses in a chatbot or content for marketing. It takes an existing LLM and outputs a much faster version that produces the exact same text. This is designed for engineers or ML specialists deploying and managing LLMs.

2,213 stars.

Use this if you need to significantly speed up the text generation of your large language models while maintaining the quality and consistency of the output, especially on less powerful GPUs.

Not ideal if you are looking for a solution to improve the accuracy or factual correctness of your LLM's responses, as this tool focuses solely on inference speed.

LLM deployment ML model serving AI inference optimization Generative AI Model acceleration
No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 21 / 25

How are scores calculated?

Stars

2,213

Forks

260

Language

Python

License

Last pushed

Feb 20, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/SafeAILab/EAGLE"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.