Bias Measurement Evaluation NLP Tools
Tools and datasets for detecting, measuring, and quantifying bias in NLP models and language systems. Includes benchmarks, metrics, and evaluation methods for assessing fairness across different demographic groups and intersectional categories. Does NOT include general bias mitigation techniques, debiasing methods without evaluation focus, or application-specific bias detection (e.g., hate speech or toxic comment detection).
There are 42 bias measurement evaluation tools tracked. 1 score above 50 (established tier). The highest-rated is dccuchile/wefe at 53/100 with 183 stars.
Get all 42 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=bias-measurement-evaluation&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
dccuchile/wefe
WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework... |
|
Established |
| 2 |
dreji18/Fairness-in-AI
Detecting Bias and ensuring Fairness in AI solutions |
|
Emerging |
| 3 |
amazon-science/bold
Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in... |
|
Emerging |
| 4 |
dhfbk/variationist
Variationist: Exploring Multifaceted Variation and Bias in Written Language... |
|
Emerging |
| 5 |
soarsmu/BiasFinder
BiasFinder | IEEE TSE | Metamorphic Test Generation to Uncover Bias for... |
|
Emerging |
| 6 |
microsoft/SafeNLP
Safety Score for Pre-Trained Language Models |
|
Emerging |
| 7 |
CAMeL-Lab/gender-rewriting-shared-task
Evaluation code and data for the gender rewriting shared task |
|
Experimental |
| 8 |
jasonshaoshun/SAL
code for "Spectral Removal of Guarded Attribute Information" |
|
Experimental |
| 9 |
grecosalvatore/nlpguard
NLPGuard: A Framework for Mitigating the use of Protected Attributes in NLP |
|
Experimental |
| 10 |
princeton-nlp/MABEL
EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data"... |
|
Experimental |
| 11 |
darenr/gender-bias
Real-time Javascipt gender bias detector |
|
Experimental |
| 12 |
krangelie/bias-in-german-nlg
Master thesis: Exploring bias in German NLG (GPT-3 & GerPT-2). Applies... |
|
Experimental |
| 13 |
feyzaakyurek/bbnli
Bias Benchmark for Natural Language Inference. Code repo for the Findings of... |
|
Experimental |
| 14 |
candacelax/bias-in-vision-and-language
Code for paper "Measuring Social Biases in Grounded Vision and Language Embeddings" |
|
Experimental |
| 15 |
cs329yangzhong/WIKIBIAS
Code and data for EMNLP2021 paper: WIKIBIAS: Detecting Multi-Span Subjective... |
|
Experimental |
| 16 |
yipenglai/Wikipedia-Gender-Bias
Measure gender bias in English Wikipedia biographies through text analysis in R |
|
Experimental |
| 17 |
sathvikn/word_embedding_bias
Companion to my blog post: How Biases in Language get Perpetuated by Technology |
|
Experimental |
| 18 |
minnesotanlp/Quantifying-Annotation-Disagreement
Official implementation of Wan et al's paper "Everyone's Voice Matters:... |
|
Experimental |
| 19 |
PieTempesti98/biases_in_hiring_decisions
Review of the most studied biases in the hiring process made by Pietro... |
|
Experimental |
| 20 |
google-research-datasets/nlp-fairness-for-india
Contains data resources to replicate results from the paper... |
|
Experimental |
| 21 |
groovychoons/GlobalBias
The official repo for the GlobalBias dataset and associated paper: 'Who is... |
|
Experimental |
| 22 |
jasonshaoshun/AMSAL
code for "Erasure of Unaligned Attributes from Neural Representations" |
|
Experimental |
| 23 |
tinotavingeyi-droid/ubuntu-xai
An open-source research platform for evaluating AI bias, fairness, and... |
|
Experimental |
| 24 |
CAMeL-Lab/gender-rewriting
Code, models, and data for "User-Centric Gender Rewriting". NAACL 2022. |
|
Experimental |
| 25 |
martinsjaavik/llm-bias-norwegian
Master thesis on subtler biases |
|
Experimental |
| 26 |
feyzaakyurek/bias-textgen
Code for the paper "Challenges in Measuring Bias in Open-Ended Language... |
|
Experimental |
| 27 |
venkatasg/interpersonal-bias
Code and data for the paper ' How people talk about each other: Modeling... |
|
Experimental |
| 28 |
Ahmad-AlSubaie/CS499-DL-debaising
Repository for research done into the methods used to debias ML models.... |
|
Experimental |
| 29 |
VSteinborn/s_jsd-multilingual-bias
Code and data for the paper "An Information-Theoretic Approach and Dataset... |
|
Experimental |
| 30 |
iamshnoo/soc_bias
Reproduction for NAACL paper on Socially Aware Bias Measurements for Hindi |
|
Experimental |
| 31 |
VSteinborn/politeness-attacks
Code and data for the paper "Politeness Stereotypes and Attack Vectors:... |
|
Experimental |
| 32 |
asimokby/formality-bias-analysis
This repo contains the annotations and other artifacts of the paper titled:... |
|
Experimental |
| 33 |
erica-dessi/Modelli-linguistici-e-discriminazione-nascosta-il-bias-di-genere-nelle-professioni
La presente tesi esplora il fenomeno del bias di genere nei Large Language... |
|
Experimental |
| 34 |
iampeti/Thesis_Gender_Bias
📊 Investigate gender bias in clinical research through statistical analysis... |
|
Experimental |
| 35 |
hyoungjo/lipstick-on-a-pig
Debiasing methods on contextualised embeddings are ineffective - CS475 |
|
Experimental |
| 36 |
ShamikRoy/Moral-Role-Prediction
This repository contains the dataset and codes for the task of Morality... |
|
Experimental |
| 37 |
spidersouris/GeNRe
[ACL 2025 Findings] GeNRe: A French Gender-Neutral Rewriting System Using... |
|
Experimental |
| 38 |
B-VARUN-REDDY/FairwAI-Bias-Detection
Submission for the FairwAI Hospitality Intern Challenge. This project... |
|
Experimental |
| 39 |
thesofakillers/badder-seeds
Official repository for the paper "[Re] Badder Seeds: Reproducing the... |
|
Experimental |
| 40 |
sunyam/bias-literary-classification
Measuring the Effects of Bias in Training Data for Literary Classification |
|
Experimental |
| 41 |
koc-lab/legalbias
This repository contains the required codes for reproducing the results in... |
|
Experimental |
| 42 |
Carolinecasey17/Thesis_NLP_GenderBias_AustralianJobDescriptions
Scripts for Cognitive Science Masters Thesis - Investigating Implicit Gender... |
|
Experimental |