openscilab/tocount

ToCount: Lightweight Token Estimator

/ 100

Emerging

This tool helps professionals working with large language models to estimate the "token" cost of text inputs. You provide a piece of text, and it quickly tells you approximately how many tokens that text represents. This is crucial for anyone managing budgets for API calls, optimizing prompt length, or handling context windows when interacting with AI models.

Available on PyPI.

Use this if you need a quick, reliable way to predict the token count of text before sending it to a large language model API.

Not ideal if you need perfectly precise token counts for every single model, as estimations might vary slightly from actual API results.

large-language-models prompt-engineering ai-cost-management text-analysis natural-language-processing

No Dependents

Maintenance 10 / 25

Adoption 6 / 25

Maturity 24 / 25

Community 4 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

guillaume-be/rust-tokenizers

Rust-tokenizer offers high-performance tokenizers for modern language models, including...

sugarme/tokenizer

NLP tokenizers written in Go language

elixir-nx/tokenizers

Elixir bindings for 🤗 Tokenizers

reinfer/blingfire-rs

Rust wrapper for the BlingFire tokenization library

Scurrra/ubpe

Universal (general sequence) Byte-Pair Encoding

Explore NLP Tools

All categories Trending NLP directory Insights