merantix-momentum/acip

🗜️Codebase of the ACIP algorithm 🗜️

/ 100

Emerging

This project helps machine learning practitioners and researchers reduce the size of large language models (LLMs). It takes an existing LLM, such as LLaMA or Mistral, and outputs a significantly smaller, compressed version that still performs well. Users can easily choose their desired compression level, making models more efficient for deployment and experimentation.

Use this if you need to make large language models smaller and faster, or reduce their memory footprint, while maintaining good performance.

Not ideal if you are looking for a pre-trained LLM for immediate use without any compression needs, or if you require an extremely tiny model where advanced knowledge distillation might be more suitable.

large-language-models model-optimization machine-learning-deployment resource-management natural-language-processing

No Package No Dependents

Maintenance 10 / 25

Adoption 6 / 25

Maturity 15 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

aalok-sathe/surprisal

A unified interface for computing surprisal (log probabilities) from language models! Supports...

EvolvingLMMs-Lab/lmms-engine

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

FunnySaltyFish/Better-Ruozhiba

【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集

reasoning-machines/pal

PaL: Program-Aided Language Models (ICML 2023)

microsoft/monitors4codegen

Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static...

Explore LLM Tools

All categories Trending LLM Tool directory Insights