bupt-ai-club/llm-compression-papers

papers of llm compression

/ 100

Experimental

This repository helps AI researchers and machine learning engineers keep up with the latest advancements in making large language models (LLMs) smaller and more efficient. It provides a curated collection of academic and industry papers on LLM compression techniques. You can find research papers and sometimes accompanying code to understand how to reduce the size and computational demands of LLMs.

No commits in the last 6 months.

Use this if you are an AI/ML researcher or practitioner looking for comprehensive resources on optimizing large language models for better performance and reduced resource usage.

Not ideal if you are an end-user simply looking to use an LLM and are not concerned with its underlying technical architecture or compression.

Large Language Models Model Compression AI Research Machine Learning Engineering Deep Learning Optimization

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

—

License

Apache-2.0

Higher-rated alternatives

ModelTC/LightCompress

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs,...

p-e-w/heretic

Fully automatic censorship removal for language models

Orion-zhen/abliteration

Make abliterated models with transformers, easy and fast

YerbaPage/LongCodeZip

LongCodeZip: Compress Long Context for Code Language Models [ASE2025]

locuslab/wanda

A simple and effective LLM pruning approach.

Explore Transformer Models

All categories Trending Transformer directory Insights