mlchrzan/pairadigm

Concept-Guided Chain-of-Thought (CGCoT) pairwise annotation tool for systematic text evaluation using LLMs. Generate breakdowns, compare items, compute scores, and validate against human judgments. Supports Ollama, Hugging Face, Google Gemini, OpenAI, and Anthropic models.

/ 100

Emerging

Available on PyPI.

Maintenance 10 / 25

Adoption 3 / 25

Maturity 24 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Jupyter Notebook

License

Apache-2.0

Category

evaluation-frameworks-metrics

Last pushed

Mar 09, 2026

Commits (30d)

Dependencies

GitHub PyPI

Evaluation Frameworks Metrics · 103 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/mlchrzan/pairadigm"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Featured in

You're Shipping AI You Can't Measure

Higher-rated alternatives

EvolvingLMMs-Lab/lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

vibrantlabsai/ragas

Supercharge Your LLM Application Evaluations 🚀

open-compass/VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

EuroEval/EuroEval

The robust European language model benchmark.

Giskard-AI/giskard-oss

🐢 Open-Source Evaluation & Testing library for LLM Agents

Explore LLM Tools

All categories Trending LLM Tool directory Insights