jorge-menjivar/tekken-rs

Rust implementation of the Mistral Tekken tokenizer

46
/ 100
Emerging

This is a Rust library for developers building applications that process text and audio using Mistral AI's large language models. It takes raw text or WAV audio files and converts them into numerical tokens, or reconstructs text from tokens. Developers would use this to prepare data for or interpret outputs from Mistral AI models.

Use this if you are a Rust developer working with Mistral AI models and need a fast, efficient, and fully compatible tokenizer for both text and audio data.

Not ideal if you are not a Rust developer or if your project does not involve Mistral AI's tokenization scheme.

Rust-development NLP-engineering audio-processing AI-model-integration language-model-tooling
No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 15 / 25
Community 8 / 25

How are scores calculated?

Stars

8

Forks

1

Language

Rust

License

Apache-2.0

Last pushed

Mar 16, 2026

Monthly downloads

507

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/jorge-menjivar/tekken-rs"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.