0xh3xa/awesome-malware-benign-datasets

🪲 A list of malware and benign datasets for malware research

44
/ 100
Emerging

This is a curated list of high-quality datasets of both malicious and benign software, including Windows executables, Android applications, and document files. It helps cybersecurity researchers, malware analysts, and AI cybersecurity researchers find the right data to develop and test their detection models. You can easily locate datasets based on their type, size, and whether they contain malware families, benign samples, or specific types of threats.

Use this if you need to find comprehensive, pre-collected datasets to train and evaluate your malware detection systems or conduct malware analysis research.

Not ideal if you are looking for real-time threat intelligence feeds or tools to analyze live malware samples.

cybersecurity research malware analysis threat intelligence security data science binary analysis
No Package No Dependents
Maintenance 10 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 11 / 25

How are scores calculated?

Stars

33

Forks

4

Language

License

CC0-1.0

Last pushed

Jan 31, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/0xh3xa/awesome-malware-benign-datasets"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.