bliutech/nlp-pdf-malware-detection

ECE 188: Computer Security. Repository for "NLP-based Malware Detection on PDFs". Utilizing NLP techniques & transformer models to perform malware detection in PDFs.

24
/ 100
Experimental

This project helps security analysts and IT professionals determine if a PDF document contains malicious code without needing to open it. You provide a PDF file, and it classifies whether the file is benign or malicious based on an NLP model's analysis of its byte strings. This is for anyone responsible for protecting systems and users from document-borne malware.

No commits in the last 6 months.

Use this if you need a way to statically analyze PDF files for malware as part of a security screening or content ingestion process.

Not ideal if you need to detect malware in file types other than PDFs, or if you require dynamic analysis by executing the PDF in a sandboxed environment.

cybersecurity malware-analysis document-security threat-detection IT-security
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 8 / 25
Community 9 / 25

How are scores calculated?

Stars

33

Forks

3

Language

Python

License

Last pushed

Dec 03, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/bliutech/nlp-pdf-malware-detection"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.