davidsvy/Neural-Scam-Artist

Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.

32
/ 100
Emerging

This project helps researchers and developers working on email fraud detection by providing a specialized dataset. It scrapes scam emails from anti-fraud websites, processes them to remove duplicates and extraneous metadata, and then organizes them into a clean, ready-to-use dataset. The output is a high-quality collection of fraudulent email examples, ideal for training intelligent email assistants.

No commits in the last 6 months.

Use this if you need a pre-curated, deduplicated dataset of fraudulent emails to train or evaluate AI models for detecting scams.

Not ideal if you need a real-time email analysis tool or a solution for detecting general spam rather than specifically scam-related content.

email-fraud-detection anti-scam-research natural-language-processing text-data-preparation machine-learning-datasets
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 9 / 25

How are scores calculated?

Stars

28

Forks

3

Language

Python

License

MIT

Last pushed

Oct 30, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/davidsvy/Neural-Scam-Artist"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.