whucs21Mzy/Model-Phase-Transitions

Navigating Model Phase Transitions to Enable Extreme Lossless Compression: A Perspective

/ 100

Emerging

This research provides a framework for understanding how to significantly reduce the size of large language models (LLMs) without losing performance. It helps practitioners identify the limits of various compression techniques like pruning and quantization by revealing "phase transition points." By understanding these limits, users can combine different methods to achieve extreme lossless compression, resulting in much smaller, yet equally performant, LLMs.

Use this if you need to deploy large language models in environments with limited computational resources or memory, and you want to reduce their size without sacrificing accuracy.

Not ideal if you are working with small models where resource constraints are not a major concern, or if you are willing to accept some performance degradation for higher compression.

Large Language Model Deployment Model Compression AI Efficiency Edge AI

No License No Package No Dependents

Maintenance 10 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 3 / 25

How are scores calculated?

Stars

Forks

Language

—

License

—

Higher-rated alternatives

ModelTC/LightCompress

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs,...

p-e-w/heretic

Fully automatic censorship removal for language models

Orion-zhen/abliteration

Make abliterated models with transformers, easy and fast

YerbaPage/LongCodeZip

LongCodeZip: Compress Long Context for Code Language Models [ASE2025]

locuslab/wanda

A simple and effective LLM pruning approach.

Explore Transformer Models

All categories Trending Transformer directory Insights