Hong-Lab-UMN-ECE/RoSTE

[ICML 2025] Official code for the paper "RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models"

/ 100

Experimental

This project helps machine learning engineers and researchers fine-tune large language models (LLMs) more efficiently while maintaining high performance. You input a pre-trained LLM and a dataset for a specific task, like summarization, and it outputs a more compact, fine-tuned LLM ready for deployment. This is ideal for those working on deploying LLMs in resource-constrained environments.

No commits in the last 6 months.

Use this if you need to fine-tune a large language model for a specific task and want to reduce its memory footprint and computational cost without significant performance loss.

Not ideal if you are looking for a general-purpose LLM fine-tuning library without specific concerns about quantization or efficiency.

large-language-models model-optimization deep-learning-deployment natural-language-processing quantization

No License Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

LLM-Tuning-Safety/LLMs-Finetuning-Safety

We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially...

kyegomez/Sophia

Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is...

appier-research/robust-llm-finetunes

Accepted to NeurIPS 2025

uthmandevsec/Self-Distillation

🤖 Enable continual learning by reproducing the On-Policy Self-Distillation algorithm for robust...

jmcentire/apprentice

Train cheap models on expensive ones. Automatically. With receipts.

Explore LLM Tools

All categories Trending LLM Tool directory Insights