horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
This is a curated list of research papers and projects focused on making Large Language Models (LLMs) run more efficiently. It provides a comprehensive collection of resources on topics like pruning, quantization, and efficient training, helping researchers and engineers find solutions to optimize LLMs for speed and resource use. You'll find links to papers and their code repositories, categorized by technical approach.
1,967 stars. No commits in the last 6 months.
Use this if you are a researcher or engineer looking for academic papers and open-source projects to make large language models (LLMs) faster, smaller, or less resource-intensive to run or train.
Not ideal if you are an end-user simply looking for a ready-to-use efficient LLM application, as this list points to research and development resources rather than consumer-facing tools.
Stars
1,967
Forks
154
Language
Python
License
—
Category
Last pushed
Jun 17, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/horseee/Awesome-Efficient-LLM"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ModelTC/LightCompress
[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs,...
p-e-w/heretic
Fully automatic censorship removal for language models
Orion-zhen/abliteration
Make abliterated models with transformers, easy and fast
YerbaPage/LongCodeZip
LongCodeZip: Compress Long Context for Code Language Models [ASE2025]
locuslab/wanda
A simple and effective LLM pruning approach.