apartresearch/3cb
3cb: Catastrophic Cyber Capabilities Benchmarking of Large Language Models
This project helps cybersecurity professionals evaluate whether advanced AI agents possess autonomous hacking capabilities. It takes a collection of carefully designed cyber security challenges and a large language model's API keys as input. The output is a benchmark of the model's performance in solving these challenges, indicating its proficiency in offensive cyber operations. It is intended for AI safety researchers, cybersecurity strategists, and national security analysts concerned with the potential risks of AI.
No commits in the last 6 months.
Use this if you need to rigorously assess the offensive cyber capabilities of large language models against a diverse set of real-world-inspired challenges.
Not ideal if you are looking for a defensive AI tool or a general-purpose security scanner.
Stars
15
Forks
4
Language
Python
License
—
Category
Last pushed
Oct 30, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/apartresearch/3cb"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TalEliyahu/Awesome-AI-Security
Curated resources, research, and tools for securing AI systems
The-Art-of-Hacking/h4cker
This repository is maintained by Omar Santos (@santosomar) and includes thousands of resources...
aw-junaid/Hacking-Tools
This Repository is a collection of different ethical hacking tools and malware's for penetration...
jiep/offensive-ai-compilation
A curated list of useful resources that cover Offensive AI.
Kim-Hammar/csle
A research platform to develop automated security policies using quantitative methods, e.g.,...