KazKozDev/system-prompt-benchmark

Test your LLM system prompts against 287 real-world attack vectors including prompt injection, jailbreaks, and data leaks.

/ 100

Experimental

When you're building products with Large Language Models (LLMs), this tool helps you automatically test your LLM's core instructions, known as 'system prompts.' You provide your system prompt and select an LLM provider, and the tool runs hundreds of simulated 'attack' scenarios to see how well your prompt defends against things like jailbreaks or data leaks. This is for product managers, AI safety engineers, and anyone deploying LLM-powered applications who needs to ensure their AI behaves as intended.

Use this if you need to rigorously test your LLM system prompts against real-world adversarial inputs before deploying your AI product, ensuring it's robust and secure.

Not ideal if you're testing the entire application pipeline, including user interface elements and complex multi-turn workflows, rather than primarily evaluating the core system prompt's resilience.

AI-safety LLM-security prompt-engineering AI-product-development red-teaming

No Package No Dependents

Maintenance 6 / 25

Adoption 5 / 25

Maturity 13 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

protectai/llm-guard

The Security Toolkit for LLM Interactions

MaxMLang/pytector

Easy to use LLM Prompt Injection Detection / Detector Python Package with support for local...

utkusen/promptmap

a security scanner for custom LLM applications

agencyenterprise/PromptInject

PromptInject is a framework that assembles prompts in a modular fashion to provide a...

Resk-Security/Resk-LLM

Resk is a robust Python library designed to enhance security and manage context when...

Explore Prompt Engineering Tools

All categories Trending Prompt Engineering directory Insights