horseee/Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

/ 100

Emerging

This is a curated list of research papers and projects focused on making Large Language Models (LLMs) run more efficiently. It provides a comprehensive collection of resources on topics like pruning, quantization, and efficient training, helping researchers and engineers find solutions to optimize LLMs for speed and resource use. You'll find links to papers and their code repositories, categorized by technical approach.

1,967 stars. No commits in the last 6 months.

Use this if you are a researcher or engineer looking for academic papers and open-source projects to make large language models (LLMs) faster, smaller, or less resource-intensive to run or train.

Not ideal if you are an end-user simply looking for a ready-to-use efficient LLM application, as this list points to research and development resources rather than consumer-facing tools.

AI Research Machine Learning Engineering Large Language Models Model Optimization Deep Learning Efficiency

No License Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 19 / 25

How are scores calculated?

Stars

1,967

Forks

154

Language

Python

License

—

Compare

Awesome-Efficient-LLM and LightCompress

Higher-rated alternatives

ModelTC/LightCompress

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs,...

p-e-w/heretic

Fully automatic censorship removal for language models

Orion-zhen/abliteration

Make abliterated models with transformers, easy and fast

YerbaPage/LongCodeZip

LongCodeZip: Compress Long Context for Code Language Models [ASE2025]

locuslab/wanda

A simple and effective LLM pruning approach.

Explore Transformer Models

All categories Trending Transformer directory Insights