Michael-A-Kuykendall/shimmytok

Pure Rust tokenizer for GGUF models - llama.cpp compatible

43
/ 100
Emerging

This is a developer tool for building applications that use Large Language Models (LLMs). It helps integrate LLMs by converting human-readable text into numerical tokens that models understand, and vice-versa, directly from a GGUF model file. The primary users are Rust developers building LLM inference engines, WASM applications, or command-line tools.

Use this if you are a Rust developer creating an application that needs to process text with GGUF-formatted LLMs and want a self-contained, C++-free solution for tokenization.

Not ideal if you are not a Rust developer, or if your primary need is general-purpose text processing unrelated to GGUF LLMs.

LLM development Rust programming AI application building natural language processing model inference
No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 13 / 25
Community 10 / 25

How are scores calculated?

Stars

14

Forks

2

Language

Rust

License

Apache-2.0

Last pushed

Jan 15, 2026

Monthly downloads

182

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Michael-A-Kuykendall/shimmytok"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.