jind11/TextFooler

A Model for Natural Language Attack on Text Classification and Inference

/ 100

Emerging

This tool generates 'adversarial examples' by making tiny, human-imperceptible changes to text, which can fool AI models designed for text classification or natural language inference. It takes an existing text dataset and a trained AI model, then outputs modified text examples that trick the model into incorrect predictions. Anyone involved in evaluating or stress-testing the robustness of AI-powered text analysis systems would find this useful.

529 stars. No commits in the last 6 months.

Use this if you need to test the resilience and potential vulnerabilities of your AI text classification or natural language inference models against subtle textual manipulations.

Not ideal if you are looking to improve the general accuracy of your AI models or if your primary concern is with typical data noise rather than targeted adversarial attacks.

AI-safety model-robustness NLP-security text-analysis-evaluation machine-learning-auditing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

529

Forks

Language

Python

License

MIT

Higher-rated alternatives

thunlp/OpenAttack

An Open-Source Package for Textual Adversarial Attack.

thunlp/TAADpapers

Must-read Papers on Textual Adversarial Attack and Defense

thunlp/OpenBackdoor

An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)

thunlp/SememePSO-Attack

Code and data of the ACL 2020 paper "Word-level Textual Adversarial Attacking as Combinatorial...

osoleve/glitchlings

Enemies for your LLM

Explore NLP Tools

All categories Trending NLP directory Insights