AhsanAyub/malicious-prompt-detection

Detection of malicious prompts used to exploit large language models (LLMs) by leveraging supervised machine learning classifiers.

/ 100

Experimental

This project helps developers and engineers building applications powered by large language models (LLMs) to identify and block malicious prompts. It takes user input prompts and classifies them as either 'benign' or 'malicious' to prevent prompt injection attacks. This is for engineers and developers responsible for the security and robustness of their LLM-based applications.

No commits in the last 6 months.

Use this if you are building an application that uses large language models and need to protect it from prompt injection attacks.

Not ideal if you are a non-technical user looking for a ready-to-use content moderation tool for general text inputs, rather than LLM-specific security.

LLM security prompt engineering AI application development cybersecurity machine learning engineering

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Compare

malicious-prompt-detection and PromptScreen

Higher-rated alternatives

dronefreak/PromptScreen

Protect your LLMs from prompt injection and jailbreak attacks. Easy-to-use Python package with...

anmolksachan/LLMInjector

Burp Suite Extension for LLM Prompt Injection Testing

rv427447/Cognitive-Hijacking-in-Long-Context-LLMs

🧠 Explore cognitive hijacking in long-context LLMs, revealing vulnerabilities in prompt...

moketchups/permanently-jailbroken

We asked 6 AIs about their own programming. All 6 said jailbreaking will never be fixed. Run it...

AdityaBhatt3010/When-LinkedIn-Gmail-Obey-Hidden-AI-Prompts-Lessons-in-Indirect-Prompt-Injection

A real-world look at how hidden instructions in profiles and emails trick AI into unexpected...

Explore Prompt Engineering Tools

All categories Trending Prompt Engineering directory Insights