StargazerX0/ScaleKV

[NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression

/ 100

Emerging

ScaleKV helps researchers and engineers working with large visual generative models to reduce the significant memory footprint required for image generation. It takes in a trained visual autoregressive model and outputs the same model, but optimized to use substantially less memory during image generation, making it feasible to run on more constrained hardware. This project is ideal for those developing and deploying advanced image generation systems.

Use this if you are developing visual generative AI and need to significantly reduce the memory consumption of your large visual autoregressive models without sacrificing image quality.

Not ideal if you are working with text-based models or do not face memory constraints when generating images with visual autoregressive models.

visual-generative-ai image-synthesis deep-learning-optimization large-scale-models computer-vision-research

No Package No Dependents

Maintenance 6 / 25

Adoption 8 / 25

Maturity 15 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

ModelCloud/GPTQModel

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD...

intel/auto-round

🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality...

pytorch/ao

PyTorch native quantization and sparsity for training and inference

bodaay/HuggingFaceModelDownloader

Simple go utility to download HuggingFace Models and Datasets

NVIDIA/kvpress

LLM KV cache compression made easy

Explore Transformer Models

All categories Trending Transformer directory Insights