cmu-sei/feud

AI Division, Reverse Engineering CNN Trojans

/ 100

Experimental

This project helps security researchers and AI assurance specialists understand and reverse-engineer poisoned CNN models. You input a compromised Convolutional Neural Network and a set of salient images for the target class, and it outputs a refined, human-interpretable description and image of the hidden 'trojan' trigger. This helps you identify and mitigate malicious manipulations within AI systems.

No commits in the last 6 months.

Use this if you need to investigate and characterize a 'trojan' or adversarial patch embedded within a Convolutional Neural Network.

Not ideal if you are looking for a general-purpose CNN interpretability tool for benign models or a solution to remove trojans automatically.

AI-security model-auditing adversarial-AI deep-learning-security AI-assurance

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

obss/sahi

Framework agnostic sliced/tiled inference + interactive ui + error analysis plots

tensorflow/tcav

Code for the TCAV ML interpretability project

MAIF/shapash

🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent...

TeamHG-Memex/eli5

A library for debugging/inspecting machine learning classifiers and explaining their predictions

csinva/imodels

Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling...

Explore ML Frameworks

All categories Trending ML Framework directory Insights