aidatatools/ollama-benchmark

LLM Benchmark for Throughput via Ollama (Local LLMs)

/ 100

Established

This tool helps you quickly understand the real performance of your local Large Language Models (LLMs) running via Ollama. It takes your existing local LLM setup and provides a clear tokens-per-second metric. AI/ML practitioners, researchers, or anyone experimenting with local LLMs can use this to assess different models and hardware configurations.

345 stars.

Use this if you need to measure the raw inference speed (throughput) of various LLMs on your local machine to compare performance or optimize your setup.

Not ideal if you are looking to benchmark the accuracy, quality, or specific application performance of an LLM, as this tool focuses solely on throughput.

local-LLMs machine-learning-operations AI-performance-tuning model-evaluation LLM-deployment

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

345

Forks

Language

Python

License

MIT

Compare

ollama-benchmark and LLMeBench ollama-benchmark and llm-optimizer-benchmark ollama-benchmark and llmBench

Related models

stanfordnlp/axbench

Stanford NLP Python library for benchmarking the utility of LLM interpretability methods

LarHope/ollama-benchmark

Ollama based Benchmark with detail I/O token per second. Python with Deepseek R1 example.

qcri/LLMeBench

Benchmarking Large Language Models

THUDM/LongBench

LongBench v2 and LongBench (ACL 25'&24')

microsoft/LLF-Bench

A benchmark for evaluating learning agents based on just language feedback

Explore Transformer Models

All categories Trending Transformer directory Insights