LLM Pruning Compression Transformer Models

Tools and methods for reducing the size and computational cost of large language models through structural pruning, layer removal, and parameter elimination. Does NOT include quantization, distillation-only approaches, or general model optimization techniques.

There are 19 llm pruning compression models tracked. 2 score above 50 (established tier). The highest-rated is peremartra/optipfair at 59/100 with 29 stars.

Get all 19 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-pruning-compression&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	peremartra/optipfair Structured pruning and bias visualization for Large Language Models. Tools...	59	Established	29	Python
2	VainF/Torch-Pruning [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision...	59	Established	3,267	Python
3	horseee/LLM-Pruner [NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language...	47	Emerging	1,109	Python
4	CASIA-LMC-Lab/FLAP [AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models	44	Emerging	70	Python
5	princeton-nlp/LLM-Shearing [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via...	42	Emerging	642	Python
6	VITA-Group/LiGO [ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer...	38	Emerging	92	Python
7	oshindutta/TVAprune [ICML 2024 Es-FoMo] - Efficient LLM Pruning with Global Token-Dependency...	35	Emerging	5	Python
8	horseee/LLaMA-Pruning Structural Pruning for LLaMA	33	Emerging	54	Python
9	namgyu-youn/PyTorch-Pruning Benchmark and profile pruning researches and open-sources	32	Emerging	4	Python
10	ZhengaoLi/DISP-LLM-Dimension-Independent-Structural-Pruning An implementation of the DISP-LLM method from the NeurIPS 2024 paper:...	32	Emerging	25	Python
11	hexuandeng/DRPruning Implementation for our paper “DRPruning: Efficient Large Language Model...	30	Emerging	7	Python
12	cliang1453/SAGE No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for...	29	Experimental	29	Python
13	ahazeemi/dPrune 🌿 dPrune: A Framework for Data Pruning	29	Experimental	3	Python
14	gszfwsb/Data-Whisperer Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for...	26	Experimental	48	Python
15	visresearch/SDMPrune The official implementation of "SDMPrune: Self-Distillation MLP Pruning for...	23	Experimental	21	Python
16	Adam-Mazur/Lazy-Llama An implementation of LazyLLM token pruning for LLaMa 2 model family.	21	Experimental	13	Python
17	thegreat-art/pruneren 🛠️ Optimize LLMs with advanced pruning strategies and real-time...	16	Experimental	1	Python
18	kainoj/pruning-bias Pruning bias from transformers / NAACL 2022	12	Experimental	7	Python
19	eren23/pruneren Intelligent layer pruning toolkit for LLMs featuring iterative optimization,...	10	Experimental	1	Python

Comparisons in this category

LLM-Pruner and LLM-Shearing (47 vs 42) LLM-Pruner and LLaMA-Pruning (47 vs 33) LLM-Shearing and LLaMA-Pruning (42 vs 33)