claws-lab/petgen

A PyTorch implementation of the ACM SIGKDD 2021 paper titled "PETGEN: Personalized Text Generation Attack on Deep Sequence Embedding-based Classification Models"

/ 100

Emerging

This project helps online platform moderators and trust & safety teams evaluate the resilience of their deep learning models designed to detect malicious user behavior. It takes a sequence of user posts and a model's existing classification of those posts to generate new, personalized text that malicious users might write to bypass detection. The output is generated text designed to fool the detection model while still appearing plausible.

No commits in the last 6 months.

Use this if you are responsible for maintaining the security and integrity of online platforms and need to proactively test your malicious user detection systems against sophisticated text-based adversarial attacks.

Not ideal if you are looking for a general-purpose text generation tool or a solution to directly filter or block malicious content.

online-trust-and-safety content-moderation fraud-detection user-behavior-analysis platform-security

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

nlpcloud/nlpcloud-js

NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis,...

langformers/langformers

🚀 Unified NLP Pipelines for Language Models

Hellisotherpeople/CX_DB8

a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the...

EQTPartners/TSDE

TSDE is a novel SSL framework for TSRL, the first of its kind, effectively harnessing a...

nlpcloud/nlpcloud-php

NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis,...

Explore Embedding Tools

All categories Trending Embeddings directory Insights