LLM Pruning Compression Transformer Models
Tools and methods for reducing the size and computational cost of large language models through structural pruning, layer removal, and parameter elimination. Does NOT include quantization, distillation-only approaches, or general model optimization techniques.
There are 19 llm pruning compression models tracked. 2 score above 50 (established tier). The highest-rated is peremartra/optipfair at 59/100 with 29 stars.
Get all 19 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-pruning-compression&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
peremartra/optipfair
Structured pruning and bias visualization for Large Language Models. Tools... |
|
Established |
| 2 |
VainF/Torch-Pruning
[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision... |
|
Established |
| 3 |
horseee/LLM-Pruner
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language... |
|
Emerging |
| 4 |
CASIA-LMC-Lab/FLAP
[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models |
|
Emerging |
| 5 |
princeton-nlp/LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via... |
|
Emerging |
| 6 |
VITA-Group/LiGO
[ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer... |
|
Emerging |
| 7 |
oshindutta/TVAprune
[ICML 2024 Es-FoMo] - Efficient LLM Pruning with Global Token-Dependency... |
|
Emerging |
| 8 |
horseee/LLaMA-Pruning
Structural Pruning for LLaMA |
|
Emerging |
| 9 |
namgyu-youn/PyTorch-Pruning
Benchmark and profile pruning researches and open-sources |
|
Emerging |
| 10 |
ZhengaoLi/DISP-LLM-Dimension-Independent-Structural-Pruning
An implementation of the DISP-LLM method from the NeurIPS 2024 paper:... |
|
Emerging |
| 11 |
hexuandeng/DRPruning
Implementation for our paper “DRPruning: Efficient Large Language Model... |
|
Emerging |
| 12 |
cliang1453/SAGE
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for... |
|
Experimental |
| 13 |
ahazeemi/dPrune
🌿 dPrune: A Framework for Data Pruning |
|
Experimental |
| 14 |
gszfwsb/Data-Whisperer
Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for... |
|
Experimental |
| 15 |
visresearch/SDMPrune
The official implementation of "SDMPrune: Self-Distillation MLP Pruning for... |
|
Experimental |
| 16 |
Adam-Mazur/Lazy-Llama
An implementation of LazyLLM token pruning for LLaMa 2 model family. |
|
Experimental |
| 17 |
thegreat-art/pruneren
🛠️ Optimize LLMs with advanced pruning strategies and real-time... |
|
Experimental |
| 18 |
kainoj/pruning-bias
Pruning bias from transformers / NAACL 2022 |
|
Experimental |
| 19 |
eren23/pruneren
Intelligent layer pruning toolkit for LLMs featuring iterative optimization,... |
|
Experimental |