google-research-datasets/nlp-fairness-for-india
Contains data resources to replicate results from the paper “Re-contextualizing Fairness in NLP: The Case of India”.
This project provides data to examine how Natural Language Processing (NLP) models might reflect specific biases relevant to India. It takes in lists of identity terms (like 'Gujarati') and attributes (like 'entrepreneur'), along with human annotations about their stereotypical associations. The output helps researchers and practitioners understand and reproduce analysis of these biases within NLP corpora and models, especially for the Indian geo-cultural context.
No commits in the last 6 months.
Use this if you are an NLP researcher, data scientist, or ethicist focusing on fairness and bias in AI, particularly within the Indian linguistic and cultural landscape.
Not ideal if you are looking for a general-purpose dataset on global NLP fairness without a specific focus on India, or if you need to perform bias analysis on non-textual data.
Stars
12
Forks
—
Language
—
License
Apache-2.0
Category
Last pushed
Jul 04, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/google-research-datasets/nlp-fairness-for-india"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
dccuchile/wefe
WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes...
dreji18/Fairness-in-AI
Detecting Bias and ensuring Fairness in AI solutions
amazon-science/bold
Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language...
dhfbk/variationist
Variationist: Exploring Multifaceted Variation and Bias in Written Language Data (ACL 2024 demo track)
soarsmu/BiasFinder
BiasFinder | IEEE TSE | Metamorphic Test Generation to Uncover Bias for Sentiment Analysis Systems