youdotcom-oss/web-search-agent-evals
Extensible benchmarking suite for evaluating AI coding agents on web search tasks. Compare native search vs MCP servers (You.com, expanding) across multiple agents (Claude Code, Gemini, Droid, Codex, expanding) with automated Docker workflows and statistical analysis.
14
/ 100
Experimental
No License
No Package
No Dependents
Maintenance
10 / 25
Adoption
1 / 25
Maturity
3 / 25
Community
0 / 25
Stars
1
Forks
—
Language
TypeScript
License
—
Category
Last pushed
Feb 27, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/youdotcom-oss/web-search-agent-evals"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.