bliutech/nlp-pdf-malware-detection
ECE 188: Computer Security. Repository for "NLP-based Malware Detection on PDFs". Utilizing NLP techniques & transformer models to perform malware detection in PDFs.
This project helps security analysts and IT professionals determine if a PDF document contains malicious code without needing to open it. You provide a PDF file, and it classifies whether the file is benign or malicious based on an NLP model's analysis of its byte strings. This is for anyone responsible for protecting systems and users from document-borne malware.
No commits in the last 6 months.
Use this if you need a way to statically analyze PDF files for malware as part of a security screening or content ingestion process.
Not ideal if you need to detect malware in file types other than PDFs, or if you require dynamic analysis by executing the PDF in a sandboxed environment.
Stars
33
Forks
3
Language
Python
License
—
Category
Last pushed
Dec 03, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/bliutech/nlp-pdf-malware-detection"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jackaduma/SecBERT
pretrained BERT model for cyber security text, learned CyberSecurity Knowledge
koudounasalkis/PoliToHFI-SemEval2023-Task6
This repository contains the code for the paper "PoliToHFI at SemEval-2023 Task 6"
wellcometrust/grants_tagger
Tag grants with MeSH and other tags
harsh-ps-2003/CS787
Text2Med