Llm Comparison Evaluation Generative AI Tools

There are 3 llm comparison evaluation tools tracked. The highest-rated is ml-energy/leaderboard-v2 at 40/100 with 50 stars.

Get all 3 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=generative-ai&subcategory=llm-comparison-evaluation&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 ml-energy/leaderboard-v2

A canonical source of GenAI energy benchmark and meausrements

40
Emerging
2 dessertlab/Human_vs_AI_Code_Quality

This repository allows the replication of our study "Human-Written vs....

23
Experimental
3 Trust4AI/MUSE

AI-driven Metamorphic Testing Inputs Generator

13
Experimental