Evaluation Frameworks Metrics AI Agents

There are 3 evaluation frameworks metrics agents tracked. The highest-rated is broomva/nous at 22/100 with 0 stars.

Get all 3 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=agents&subcategory=evaluation-frameworks-metrics&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Agent	Score	Tier	Stars	Language
1	broomva/nous Metacognitive evaluation — real-time quality scoring with inline heuristics...	22	Experimental	—	Rust
2	prabdeb/agenteval-sample AgentEval (AutoGen 0.4) Sample Implementation	17	Experimental	1	Jupyter Notebook
3	grgong/agent-exam-model-eval Agent exam built from Posit’s model-eval R LLM benchmark (baseline snapshot...	13	Experimental	—	JavaScript