NLP Model Interpretability NLP Tools
Tools and frameworks for explaining, visualizing, and understanding the decisions of NLP and ML models through techniques like feature attribution, concept activation vectors, attention analysis, and model-agnostic explanations. Does NOT include general model evaluation, performance metrics, or bias detection frameworks without interpretability focus.
There are 36 nlp model interpretability tools tracked. 4 score above 50 (established tier). The highest-rated is rmovva/HypotheSAEs at 64/100 with 77 stars.
Get all 36 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=nlp-model-interpretability&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
rmovva/HypotheSAEs
HypotheSAEs: hypothesizing interpretable relationships in text datasets... |
|
Established |
| 2 |
interpretml/interpret-text
A library that incorporates state-of-the-art explainers for text-based... |
|
Established |
| 3 |
fdalvi/NeuroX
A Python library that encapsulates various methods for neuron interpretation... |
|
Established |
| 4 |
jalammar/ecco
Explain, analyze, and visualize NLP language models. Ecco creates... |
|
Established |
| 5 |
alexdyysp/ESIM-pytorch
中国高校计算机大赛--大数据挑战赛 |
|
Emerging |
| 6 |
MultiplEYE-COST/wg1-experiment-implementation
In this repository we keep the code for the implementation of the... |
|
Emerging |
| 7 |
NeuroLIAA/reading-et
Eye-tracking during reading of short stories |
|
Emerging |
| 8 |
adaamko/POTATO
XAI based human-in-the-loop framework for automatic rule-learning. |
|
Emerging |
| 9 |
RiccardoSpolaor/Verbal-Explanations-of-Spatio-Temporal-Graph-Neural-Networks-for-Traffic-Forecasting
An eXplainable AI system to elucidate short-term speed forecasts in traffic... |
|
Emerging |
| 10 |
octanove/expats
EXPATS: A Toolkit for Explainable Automated Text Scoring |
|
Emerging |
| 11 |
ymcui/mrc-model-analysis
Multilingual Multi-Aspect Explainability Analyses on Machine Reading... |
|
Emerging |
| 12 |
mohsenfayyaz/DecompX
DecompX: Explaining Transformers Decisions by Propagating Token... |
|
Emerging |
| 13 |
ravipatelxyz/nlp-ethics
In depth evaluation of the ETHICS utilitarianism task dataset. An assessment... |
|
Emerging |
| 14 |
jwliao1209/Explainable-NLP
2022 AI CUP Explainable Information Tagging Competition for Natural Language... |
|
Emerging |
| 15 |
robinvanschaik/interpret-flair
A small repository to test Captum Explainable AI with a trained Flair... |
|
Emerging |
| 16 |
hint-lab/doctrack
Dataset for EMNLP'23 Paper "DocTrack: A Visually-Rich Document Dataset... |
|
Experimental |
| 17 |
synapticore-io/ethics-model
A modern, modular PyTorch framework for ethical text analysis, manipulation... |
|
Experimental |
| 18 |
MichiganNLP/micromodels
Micromodels -- A framework for accurate, explainable, data efficient, and... |
|
Experimental |
| 19 |
fursovia/tcav_nlp
"Interpretability Beyond Feature Attribution: Quantitative Testing with... |
|
Experimental |
| 20 |
avijit-thawani/numeracy-literacy
Code for paper: "Numeracy Enhances the Literacy of Language Models" |
|
Experimental |
| 21 |
christianwarmuth/explainable-predictive-process-monitoring-with-text
On the Potential of Textual Data for Explainable Predictive Process... |
|
Experimental |
| 22 |
mainlp/gaze-guided-text-generation
Code and data for the paper "Controlling Reading Ease with Gaze-Guided Text... |
|
Experimental |
| 23 |
amcrisan/interactive-model-cards
An experimental project that examines whether interactivity can augment... |
|
Experimental |
| 24 |
MortadhaMannai/XAI_ConstrainedAttentionVerifier
Code for the NLDB 2023 paper. Work partially funded by grant... |
|
Experimental |
| 25 |
shresthasingh1501/legal_document_analysis
Legal document analysis using BERT and FlanT5 |
|
Experimental |
| 26 |
christophsk/classifier-lit
PAIR Code's Language Interpretability Tool (LIT) for Text Classification |
|
Experimental |
| 27 |
StonyBrookNLP/irene
[ACL 2021] IrEne: Interpretable Energy Prediction for Transformers |
|
Experimental |
| 28 |
pradippramanick/coexp-iros24
Code and data for the IROS 2024 paper - Multimodal Coherent Explanation... |
|
Experimental |
| 29 |
EBAnO-Ecosystem/Text-EBAnO-Express
T-EBAnO: Explaining deep learning black-box models for Natural Language Processing. |
|
Experimental |
| 30 |
shreyas-kowshik/nlp4if
Code for the runners up entry on the English subtask on the... |
|
Experimental |
| 31 |
danchern97/tda4la
This is an official repository for "Acceptability Judgements via Examining... |
|
Experimental |
| 32 |
meiyor/DeepGaze-Text-Embedding-Map
DeepGaze + Text-Embedding-Map project developed in Cardiff University -... |
|
Experimental |
| 33 |
DIME-XAI/dime-xai
Implementation of Dual Interpretable Model-agnostic Explanations for Rasa... |
|
Experimental |
| 34 |
yang-su2000/State-Space-Interpretability
Investigation of state space model interpretability using SHAP (SHapley... |
|
Experimental |
| 35 |
MortadhaMannai/XAIPalette-Exploring-Image-Structure-via-Color-Distillation-for-Explainable-AI
This repository contains the PyTorch implementation of XAIPalette, a novel... |
|
Experimental |
| 36 |
lennox55555/Legal-BERT-RLHF
This web app is part of a research project to identify and address biases in... |
|
Experimental |