flashclub/ModelJudge

这是一个基于 Next.js 构建的多语言 AI 模型评估平台,支持多模型对比和实时流式响应。A multilingual AI model evaluation platform built with Next.js, allowing users to compare responses from multiple models and receive a final judgment.

39
/ 100
Emerging

This platform helps AI developers and researchers evaluate the performance of different AI models. You input a question, select up to three models to generate answers, and a fourth model provides a rating and a final answer. The end user is anyone working with AI models who needs to compare their outputs and receive an objective judgment.

Use this if you need to quickly compare the responses of multiple AI models to a specific prompt and get a consolidated judgment.

Not ideal if you need to perform deep, statistical analysis of model performance or integrate evaluations into a larger automated pipeline.

AI-evaluation model-comparison prompt-engineering AI-research developer-tool
No Package No Dependents
Maintenance 6 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 8 / 25

How are scores calculated?

Stars

95

Forks

5

Language

TypeScript

License

MIT

Last pushed

Dec 07, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/flashclub/ModelJudge"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.