iosifache/DikeDataset

Dataset with labeled benign and malicious files 🗃️

44
/ 100
Emerging

This dataset provides a collection of executable (PE) and Office (OLE) files, carefully labeled as either benign or malicious, and further categorized by malware family. It allows cybersecurity researchers and machine learning engineers to train AI models for classifying unknown files, helping to identify threats and specific malware types. You input raw PE and OLE files, and the output is a set of labels indicating malice and malware family membership.

150 stars. No commits in the last 6 months.

Use this if you are a cybersecurity researcher or a machine learning engineer building a system to automatically detect and classify malware from executable or Office files.

Not ideal if you are looking for a tool that directly scans and protects your system, as this is a dataset for training AI, not an active security product.

malware-detection cybersecurity-research threat-intelligence machine-learning-engineering file-analysis
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

150

Forks

25

Language

TeX

License

MIT

Last pushed

Jul 19, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/iosifache/DikeDataset"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.