auto-pen-bench and genai-pentest-paper

auto-pen-bench
49
Emerging
genai-pentest-paper
41
Emerging
Maintenance 6/25
Adoption 8/25
Maturity 16/25
Community 19/25
Maintenance 6/25
Adoption 5/25
Maturity 16/25
Community 14/25
Stars: 68
Forks: 20
Downloads:
Commits (30d): 0
Language: Python
License: MIT
Stars: 13
Forks: 3
Downloads:
Commits (30d): 0
Language: Python
License: MIT
No Package No Dependents
No Package No Dependents

About auto-pen-bench

lucagioacchini/auto-pen-bench

This repo contains the codes of the penetration test benchmark for Generative Agents presented in the paper "AutoPenBench: Benchmarking Generative Agents for Penetration Testing". It contains also the instructions to install, develop and test new vulnerable containers to include in the benchmark.

This benchmark helps cybersecurity researchers and professionals evaluate how well AI-powered penetration testing agents can find vulnerabilities in simulated systems. It takes a generative AI agent and a definition of a vulnerable machine, then reports on the agent's ability to identify and exploit weaknesses. Security analysts, red teamers, and AI researchers focused on offensive security would use this to assess agent performance.

penetration-testing red-teaming vulnerability-assessment cybersecurity-research generative-AI-security

About genai-pentest-paper

lucagioacchini/genai-pentest-paper

This repo contains the codes for the experiments of the paper "AutoPenBench: Benchmarking Generative Agents for Penetration Testing".

This project provides the code to reproduce experiments from a research paper on benchmarking generative AI agents for penetration testing. It takes experiment configurations as input and outputs raw experimental results, which are then processed for analysis. This is for researchers and academics focused on cybersecurity, AI, and penetration testing.

cybersecurity research AI agent benchmarking penetration testing evaluation generative AI security academic research

Scores updated daily from GitHub, PyPI, and npm data. How scores work