ZapDos7/lakera-gandalf

My solutions for Lakera's Gandalf

34
/ 100
Emerging

This project provides documented solutions for navigating the 'Gandalf LLM' challenge by Lakera. It shows specific prompts that successfully extract secret passwords from a conversational AI designed to resist revealing them. Anyone interested in understanding large language model (LLM) security, prompt engineering, or AI safety challenges would find this useful.

No commits in the last 6 months.

Use this if you are attempting the Lakera Gandalf challenge and need strategies to bypass an LLM's safety mechanisms and retrieve hidden information.

Not ideal if you are looking for a general-purpose tool to develop new AI applications or integrate LLMs into your existing systems.

AI safety prompt engineering LLM security AI challenges red teaming
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 8 / 25
Community 18 / 25

How are scores calculated?

Stars

62

Forks

12

Language

License

Last pushed

May 14, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/prompt-engineering/ZapDos7/lakera-gandalf"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.