qualifire-dev/rogue
AI Agent Evaluator & Red Team Platform
This tool helps you rigorously test your AI agents, such as chatbots or automated assistants, to ensure they behave as expected and are secure against potential attacks. You provide your AI agent and define your business rules or choose from various attack simulations. Rogue then interacts with your agent, giving you detailed reports on compliance and vulnerabilities. This is for AI product managers, security engineers, and compliance officers who need to validate AI agent safety and reliability.
1,012 stars. Actively maintained with 2 commits in the last 30 days.
Use this if you need to automatically verify your AI agents adhere to business policies, prevent unintended behaviors, or proactively identify security vulnerabilities before deployment.
Not ideal if you are looking for a tool to develop or train AI agents, as its sole purpose is evaluation and red teaming, not agent creation.
Stars
1,012
Forks
160
Language
Python
License
—
Category
Last pushed
Mar 04, 2026
Commits (30d)
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/qualifire-dev/rogue"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Related agents
StonyBrookNLP/appworld
🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and...
microsoft/WindowsAgentArena
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of...
future-agi/ai-evaluation
Evaluation Framework for all your AI related Workflows
RouteWorks/RouterArena
RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics,...
dreadnode/AIRTBench-Code
Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models