chong-z/nlp-second-order-attack

[NAACL 2021] Code for "Double Perturbation: On the Robustness of Robustness and Counterfactual Bias Evaluation"

/ 100

Experimental

This project helps evaluate the reliability of natural language processing (NLP) models. It takes an existing NLP model and a dataset of text, then systematically modifies the text to identify vulnerabilities and hidden biases in the model's predictions. The output helps machine learning engineers and researchers understand how robust their NLP models truly are when faced with slight variations in input data.

No commits in the last 6 months.

Use this if you need to deeply assess the stability and fairness of your NLP models by finding subtle weaknesses that standard evaluations might miss.

Not ideal if you are looking for a tool to simply train or deploy an NLP model without needing to rigorously test its adversarial robustness or counterfactual bias.

natural-language-processing model-evaluation ai-ethics text-analytics bias-detection

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

thunlp/OpenAttack

An Open-Source Package for Textual Adversarial Attack.

thunlp/TAADpapers

Must-read Papers on Textual Adversarial Attack and Defense

jind11/TextFooler

A Model for Natural Language Attack on Text Classification and Inference

thunlp/OpenBackdoor

An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)

thunlp/SememePSO-Attack

Code and data of the ACL 2020 paper "Word-level Textual Adversarial Attacking as Combinatorial...

Explore NLP Tools

All categories Trending NLP directory Insights