Icyrockton/MegaVul
MegaVul - The largest, high-quality, extensible, continuously updated, C/C++/Java vulnerability dataset
This project provides a comprehensive dataset of C, C++, and Java code functions, categorized as either vulnerable or non-vulnerable. It helps security researchers and developers train models to automatically detect software vulnerabilities. You input a code function (and optionally its Joern graph representation), and the output is a classification indicating whether that function is vulnerable.
139 stars. No commits in the last 6 months.
Use this if you are a security researcher or software developer building or evaluating machine learning models for automated vulnerability detection in C, C++, or Java codebases.
Not ideal if you need a real-time vulnerability scanner for active codebases or a tool to analyze application security posture without building custom detection models.
Stars
139
Forks
18
Language
Python
License
GPL-3.0
Category
Last pushed
Jan 12, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Icyrockton/MegaVul"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
nmap/nmap
Nmap - the Network Mapper. Github mirror of official SVN repository.
e-m-b-a/emba
EMBA - The firmware security analyzer
ait-testbed/attackbed
The AttackBed is a simulated enterprise network with numerous vulnerabilities. Attacks in this...
ritesh-gupta-git/AI-Powered-Vulnerability-Management
AI-VMF: AI-Powered Vulnerability Management Framework demo (ensemble exploit prediction + risk scoring)
scorpiondefense/cyberweapons
Automated Cyber Offense