LukasStruppek/Plug-and-Play-Attacks

[ICML 2022 / ICLR 2024] Source code for our papers "Plug & Play Attacks: Towards Robust and Flexible Model Inversion Attacks" and "Be Careful What You Smooth For".

/ 100

Emerging

This project helps privacy and security researchers evaluate how much sensitive information a machine learning model might be leaking about its training data. It takes an existing image classifier and a pre-trained image generator (like StyleGAN2) as input, then produces synthetic images that reveal characteristic features from the model's training data. This tool is designed for researchers focused on model privacy, enabling them to test the robustness of their models against advanced inversion attacks and understand the implications of different training techniques like label smoothing.

No commits in the last 6 months.

Use this if you are a machine learning researcher or privacy engineer who needs to assess the privacy leakage of image classification models by generating class-representative samples that mimic the characteristics of private training data.

Not ideal if you are looking for a general-purpose image generation tool or a method to anonymize datasets directly, as this tool is specifically for auditing model privacy via inversion attacks.

model-privacy data-leakage adversarial-attacks AI-security facial-recognition-privacy

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

QData/TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model...

ebagdasa/backdoors101

Backdoors Framework for Deep Learning and Federated Learning. A light-weight tool to conduct...

THUYimingLi/backdoor-learning-resources

A list of backdoor learning resources

zhangzp9970/MIA

Unofficial pytorch implementation of paper: Model Inversion Attacks that Exploit Confidence...

VinAIResearch/Warping-based_Backdoor_Attack-release

WaNet - Imperceptible Warping-based Backdoor Attack (ICLR 2021)

Explore ML Frameworks

All categories Trending ML Framework directory Insights