CommissarSilver/PrismBench
PrismBench: A comprehensive framework for evaluating Large Language Model capabilities through Monte Carlo Tree Search. Systematically maps model strengths, automatically discovers challenging concept combinations, and provides detailed performance analysis with containerized deployment and OpenAI-compatible API support.
Stars
3
Forks
—
Language
Python
License
—
Category
Last pushed
Mar 06, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ai-coding/CommissarSilver/PrismBench"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
rynfar/meridian
Use your Claude Max subscription with OpenCode. Proxy that bridges Anthropic's official SDK to...
rullerzhou-afk/clawd-on-desk
A desktop pet that reacts to your Claude Code sessions in real-time — thinking, typing, ...
calesthio/OpenMontage
World's first open-source, agentic video production system. 11 pipelines, 49 tools, 400+ agent...
cruzyjapan/Gemini-CLI-UI
A responsive web-based UI that provides an intuitive interface for Google's Gemini CLI, enabling...
admincodes7/zor
An Open-Source Claude-Code like Terminal based AI Pair Programmer