NVlabs/EoRA

[ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

/ 100

Emerging

EoRA helps machine learning engineers and researchers deploy large language models (LLMs) more efficiently by making compressed models perform better without extensive retraining. You provide a pre-compressed LLM, and it outputs an enhanced version that has improved accuracy on specific tasks, while also running faster and using less memory. This is ideal for those managing LLMs in resource-constrained environments.

Use this if you need to recover the accuracy of a compressed large language model quickly, without the time and computational cost of fine-tuning.

Not ideal if you are working with uncompressed models or if your primary goal is to train a new model from scratch rather than enhance an existing compressed one.

LLM deployment Model compression Deep learning optimization AI efficiency NLP applications

No Package No Dependents

Maintenance 13 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

adithya-s-k/AI-Engineering.academy

Mastering Applied AI, One Concept at a Time

jax-ml/jax-llm-examples

Minimal yet performant LLM examples in pure JAX

young-geng/scalax

A simple library for scaling up JAX programs

riyanshibohra/TuneKit

Upload your data → Get a fine-tuned SLM. Free.

Explore Transformer Models

All categories Trending Transformer directory Insights