Daemoniorum-LLC/haagenti

High-performance compression library for Rust with LZ4, Zstd, Brotli, and Deflate. Features SIMD acceleration, streaming API, and no_std support.

/ 100

Emerging

This project helps AI practitioners run very large language models (like 70 billion parameter models) on a single consumer-grade graphics card, instead of requiring expensive, specialized cloud servers or multiple high-end GPUs. It takes your massive AI model weights and transforms them into a highly compressed format, allowing them to fit within limited GPU memory. This is for AI engineers, researchers, or data scientists who need to deploy or experiment with large AI models more affordably.

Use this if you need to perform inference with large AI models (e.g., 70B+ parameters) on hardware with limited VRAM, such as a single consumer GPU, while maintaining output quality.

Not ideal if your primary need is general-purpose data compression for typical files or network traffic, where traditional algorithms are sufficient.

AI-inference Large-Language-Models GPU-optimization On-device-AI Model-deployment

No Package No Dependents

Maintenance 10 / 25

Adoption 6 / 25

Maturity 11 / 25

Community 4 / 25

How are scores calculated?

Stars

Forks

Language

Rust

License

Apache-2.0

Higher-rated alternatives

Dicklesworthstone/frankenjax

Clean-room Rust reimplementation of JAX transform semantics (jit, grad, vmap) with canonical...

Explore AI Coding Tools

All categories Trending AI Coding directory Insights