Michael-A-Kuykendall/shimmytok

Pure Rust tokenizer for GGUF models - llama.cpp compatible

/ 100

Emerging

This is a developer tool for building applications that use Large Language Models (LLMs). It helps integrate LLMs by converting human-readable text into numerical tokens that models understand, and vice-versa, directly from a GGUF model file. The primary users are Rust developers building LLM inference engines, WASM applications, or command-line tools.

Use this if you are a Rust developer creating an application that needs to process text with GGUF-formatted LLMs and want a self-contained, C++-free solution for tokenization.

Not ideal if you are not a Rust developer, or if your primary need is general-purpose text processing unrelated to GGUF LLMs.

LLM development Rust programming AI application building natural language processing model inference

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 13 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Rust

License

Apache-2.0

Higher-rated alternatives

ModelCloud/GPTQModel

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD...

intel/auto-round

🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality...

pytorch/ao

PyTorch native quantization and sparsity for training and inference

bodaay/HuggingFaceModelDownloader

Simple go utility to download HuggingFace Models and Datasets

NVIDIA/kvpress

LLM KV cache compression made easy

Explore Transformer Models

All categories Trending Transformer directory Insights