usestrix/benchmarks

Evaluation harness for Strix agent

34
/ 100
Emerging

This tool helps cybersecurity professionals evaluate how well Strix agents perform against common web security threats. You provide a Strix agent, and it runs it through a series of simulated capture-the-flag (CTF) challenges, reporting back on its ability to identify and respond to exploits. Security engineers and red teamers would find this useful for assessing agent effectiveness.

Use this if you need to rigorously test and benchmark the performance of your Strix security agent in identifying web vulnerabilities.

Not ideal if you are looking for a general web vulnerability scanner or a tool to evaluate security products other than Strix agents.

cybersecurity-evaluation web-security-testing ctf-benchmarking security-agent-assessment red-teaming-tools
No Package No Dependents
Maintenance 10 / 25
Adoption 5 / 25
Maturity 11 / 25
Community 8 / 25

How are scores calculated?

Stars

9

Forks

1

Language

Python

License

Apache-2.0

Last pushed

Jan 23, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/usestrix/benchmarks"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.