davidsvy/Neural-Scam-Artist

Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.

/ 100

Emerging

This project helps researchers and developers working on email fraud detection by providing a specialized dataset. It scrapes scam emails from anti-fraud websites, processes them to remove duplicates and extraneous metadata, and then organizes them into a clean, ready-to-use dataset. The output is a high-quality collection of fraudulent email examples, ideal for training intelligent email assistants.

No commits in the last 6 months.

Use this if you need a pre-curated, deduplicated dataset of fraudulent emails to train or evaluate AI models for detecting scams.

Not ideal if you need a real-time email analysis tool or a solution for detecting general spam rather than specifically scam-related content.

email-fraud-detection anti-scam-research natural-language-processing text-data-preparation machine-learning-datasets

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

jklu-jaipur/Political-Biasness-Detection

Our ML model calculates the biasness of a political article based on linguistic features and...

yamanalab/why-darkpattern

[Proc of IEEE BigData 2023] Why is the User Interface a Dark Pattern? : Explainable...

sdarjunwadkar/Political-Idealogies-Prediction-in-News-Articles

Media diversity shapes perspectives, yet biased news distorts reality, fostering misinformation....

nerdimite/bert-web-app

Code for the FullStack AI Live Coding Series- Part 2 (CellStrat AI Lab)

MichiganNLP/deceptiondetection

Deception Detection project website

Explore NLP Tools

All categories Trending NLP directory Insights