auto-pen-bench and genai-pentest-paper
About auto-pen-bench
lucagioacchini/auto-pen-bench
This repo contains the codes of the penetration test benchmark for Generative Agents presented in the paper "AutoPenBench: Benchmarking Generative Agents for Penetration Testing". It contains also the instructions to install, develop and test new vulnerable containers to include in the benchmark.
This benchmark helps cybersecurity researchers and professionals evaluate how well AI-powered penetration testing agents can find vulnerabilities in simulated systems. It takes a generative AI agent and a definition of a vulnerable machine, then reports on the agent's ability to identify and exploit weaknesses. Security analysts, red teamers, and AI researchers focused on offensive security would use this to assess agent performance.
About genai-pentest-paper
lucagioacchini/genai-pentest-paper
This repo contains the codes for the experiments of the paper "AutoPenBench: Benchmarking Generative Agents for Penetration Testing".
This project provides the code to reproduce experiments from a research paper on benchmarking generative AI agents for penetration testing. It takes experiment configurations as input and outputs raw experimental results, which are then processed for analysis. This is for researchers and academics focused on cybersecurity, AI, and penetration testing.
Scores updated daily from GitHub, PyPI, and npm data. How scores work