davidsvy/Neural-Scam-Artist
Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.
This project helps researchers and developers working on email fraud detection by providing a specialized dataset. It scrapes scam emails from anti-fraud websites, processes them to remove duplicates and extraneous metadata, and then organizes them into a clean, ready-to-use dataset. The output is a high-quality collection of fraudulent email examples, ideal for training intelligent email assistants.
No commits in the last 6 months.
Use this if you need a pre-curated, deduplicated dataset of fraudulent emails to train or evaluate AI models for detecting scams.
Not ideal if you need a real-time email analysis tool or a solution for detecting general spam rather than specifically scam-related content.
Stars
28
Forks
3
Language
Python
License
MIT
Category
Last pushed
Oct 30, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/davidsvy/Neural-Scam-Artist"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jklu-jaipur/Political-Biasness-Detection
Our ML model calculates the biasness of a political article based on linguistic features and...
yamanalab/why-darkpattern
[Proc of IEEE BigData 2023] Why is the User Interface a Dark Pattern? : Explainable...
sdarjunwadkar/Political-Idealogies-Prediction-in-News-Articles
Media diversity shapes perspectives, yet biased news distorts reality, fostering misinformation....
nerdimite/bert-web-app
Code for the FullStack AI Live Coding Series- Part 2 (CellStrat AI Lab)
MichiganNLP/deceptiondetection
Deception Detection project website