qualifire-dev/rogue

AI Agent Evaluator & Red Team Platform

/ 100

Established

This tool helps you rigorously test your AI agents, such as chatbots or automated assistants, to ensure they behave as expected and are secure against potential attacks. You provide your AI agent and define your business rules or choose from various attack simulations. Rogue then interacts with your agent, giving you detailed reports on compliance and vulnerabilities. This is for AI product managers, security engineers, and compliance officers who need to validate AI agent safety and reliability.

1,012 stars. Actively maintained with 2 commits in the last 30 days.

Use this if you need to automatically verify your AI agents adhere to business policies, prevent unintended behaviors, or proactively identify security vulnerabilities before deployment.

Not ideal if you are looking for a tool to develop or train AI agents, as its sole purpose is evaluation and red teaming, not agent creation.

AI-safety AI-governance AI-security AI-compliance QA-testing

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 23 / 25

How are scores calculated?

Stars

1,012

Forks

160

Language

Python

License

—

Featured in

You're Shipping AI You Can't Measure

Related agents

StonyBrookNLP/appworld

🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and...

microsoft/WindowsAgentArena

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of...

future-agi/ai-evaluation

Evaluation Framework for all your AI related Workflows

RouteWorks/RouterArena

RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics,...

dreadnode/AIRTBench-Code

Code Repository for: AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models

Explore AI Agents

All categories Trending AI Agent directory Insights