Model Evaluation Diagnostics Transformer Models
Tools for systematically evaluating, diagnosing, and benchmarking transformer models across NLI, WSD, and other NLP tasks using standard test sets and evaluation frameworks. Does NOT include general model training, fine-tuning without evaluation focus, or language-specific model overviews.
There are 57 model evaluation diagnostics models tracked. 1 score above 50 (established tier). The highest-rated is LoicGrobol/zeldarose at 51/100 with 28 stars.
Get all 57 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=model-evaluation-diagnostics&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
LoicGrobol/zeldarose
Train transformer-based models. |
|
Established |
| 2 |
CPJKU/wechsel
Code for WECHSEL: Effective initialization of subword embeddings for... |
|
Emerging |
| 3 |
yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理) |
|
Emerging |
| 4 |
minggnim/nlp-models
A repository for training transformer based models |
|
Emerging |
| 5 |
IntelLabs/nlp-architect
A model library for exploring state-of-the-art deep learning topologies and... |
|
Emerging |
| 6 |
MahmoudWahdan/dialog-nlu
Tensorflow and Keras implementation of the state of the art researches in... |
|
Emerging |
| 7 |
yuanzhoulvpi2017/quick_sentence_transformers
sentence-transformers to onnx 让sbert模型推理效率更快 |
|
Emerging |
| 8 |
ukairia777/tensorflow-nlp-tutorial
tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림... |
|
Emerging |
| 9 |
soldni/pyterrier_sentence_transformers
Create PyTerrier compatible dense indices using any sentence_transformers model |
|
Emerging |
| 10 |
HarderThenHarder/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification,... |
|
Emerging |
| 11 |
g8a9/ferret
A python package for benchmarking interpretability techniques on Transformers. |
|
Emerging |
| 12 |
sinanuozdemir/oreilly-bert-nlp
This repository contains code for the O'Reilly Live Online Training for BERT |
|
Emerging |
| 13 |
Azure/nlp-samples
Japanese NLP sample codes |
|
Emerging |
| 14 |
ManashJKonwar/NLP-Transformers
Transformer (BERT, GPT2, etc.) based Training Module for popular NLP tasks |
|
Emerging |
| 15 |
rajaswa/indic-syntax-evaluation
Vyākarana: A Colorless Green Benchmark for Syntactic Evaluation in Indic Languages |
|
Emerging |
| 16 |
shunk031/allennlp-shiba-model
AllenNLP integration for Shiba: Japanese CANINE model |
|
Emerging |
| 17 |
ropensci/pangoling
An R package for estimating the log-probabilities of words in a given... |
|
Emerging |
| 18 |
prajjwal1/generalize_lm_nli
Code for the paper EMNLP 2021 workshop paper "Generalization in NLI: Ways... |
|
Emerging |
| 19 |
VirtualRoyalty/gan-plus-nlp
Generative adversarial approach to most popular NLP tasks |
|
Emerging |
| 20 |
matteomedioli/BERT-KG
Enriching Language Models Representations via Knowledge Graphs Regularisation |
|
Emerging |
| 21 |
Nickil21/weakly-supervised-parsing
Official Code for our Findings of ACL 2022 paper: Co-training an... |
|
Emerging |
| 22 |
stevezheng23/fewshot_nlp_pt
Few-shot NLP in PyTorch |
|
Emerging |
| 23 |
CyberAgentAILab/japanese-nli-model
This repository provides the code for Japanese NLI model, a fine-tuned... |
|
Emerging |
| 24 |
th789/mbr-for-nmt
Characterizing the performance of minimum Bayes risk (MBR) decoding for... |
|
Emerging |
| 25 |
ai-forever/model-zoo
NLP model zoo for Russian |
|
Experimental |
| 26 |
yucc2018/share
一些代码实践分享。 |
|
Experimental |
| 27 |
Beomi/transformers-language-modeling
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3 |
|
Experimental |
| 28 |
proycon/deepfrog
An NLP-suite powered by deep learning |
|
Experimental |
| 29 |
TRISTAN-ORF/RiboTIE
Scripts and instructions to apply RiboTIE on Ribo-seq data |
|
Experimental |
| 30 |
ishan00/meta-learning-for-multi-task-multilingual
Official Repository for the paper titled "Meta-Learning for Effective... |
|
Experimental |
| 31 |
DFKI-NLP/gevalm
Code and data for the paper "Evaluating German Transformer Language Models... |
|
Experimental |
| 32 |
hppRC/simple-simcse-ja
Exploring Japanese SimCSE |
|
Experimental |
| 33 |
zhestyatsky/MCL-WiC
Research on Multilingual and Cross-lingual Word-in-Context Disambiguation |
|
Experimental |
| 34 |
SapienzaNLP/xl-wsd-code
Code to train and test Word Sense Disambiguation models based on different... |
|
Experimental |
| 35 |
princeton-nlp/MultilingualAnalysis
Repository for the paper titled: "When is BERT Multilingual? Isolating... |
|
Experimental |
| 36 |
RobinSmits/Dutch-NLP-Experiments
This repository contains a number of experiments with Multi Lingual... |
|
Experimental |
| 37 |
brihijoshi/granular-similarity-COLING-2020
Code for the paper "The Devil is in the Details: Evaluating Limitations of... |
|
Experimental |
| 38 |
iamlxb3/UMAMGT
Code for the publication of LREC'22 |
|
Experimental |
| 39 |
HannaAbiAkl/PSYCHIC
The official repository for the PSYCHIC model |
|
Experimental |
| 40 |
TRISTAN-ORF/RiboTIE_article
Scripts run to produce the RiboTIE paper |
|
Experimental |
| 41 |
skomban/seq-unscrambler
Unscrambles shuffled letters in a word sequence. |
|
Experimental |
| 42 |
SambhawDrag/XLNet.jl
A Julia-based implementation of XLNet: A Generalized Autoregressive... |
|
Experimental |
| 43 |
bglid/haitian-creole-nlu
Project designed to reimplement and build upon CreoleVal's Reading... |
|
Experimental |
| 44 |
loubnabnl/canine-mednli
CANINE for Medical Natural Language Inference on MedNLI data, as part of the... |
|
Experimental |
| 45 |
aarnetalman/nli-with-transformers
Fine-tune transformers with NLI data |
|
Experimental |
| 46 |
mhdr3a/transformers-diagnostics
Model Evaluation using SuperGLUE Diagnostic Dataset |
|
Experimental |
| 47 |
DudalaShrujana/nlp-transformers-toolkit
ModularNLP pipeline utilizing Hugging Face Transformers for Sentiment... |
|
Experimental |
| 48 |
mhdr3a/transformers-snli
Model Evaluation using SNLI Development Set |
|
Experimental |
| 49 |
tranquoctrinh/huggingface-transformers-examples
Fine-tuning (or training from scratch) the library models for language... |
|
Experimental |
| 50 |
mhdr3a/transformers-hans
Adversarial evaluation of model performances [Updated] |
|
Experimental |
| 51 |
mhdr3a/transformers-wanli
Model Evaluation using WANLI Test Set |
|
Experimental |
| 52 |
MatteoFasulo/Transformers-Sentence-Reconstruction
Sentence Reconstruction using Transformer Model |
|
Experimental |
| 53 |
Evfidiw/LMs_NLU
Exploring different language models on text classification tasks. |
|
Experimental |
| 54 |
chloeskt/nlp_ensae
Final project of the Machine Learning for Natural Language Processing at... |
|
Experimental |
| 55 |
Mrpatekful/dialogue-graph
Enriching pre-trained language models with knowledge graphs for dialogue generation. |
|
Experimental |
| 56 |
FaresGh1997/Contexto_3lang
implementation of Contexto (guess the word game) for three diffrent... |
|
Experimental |
| 57 |
LeonardoEmili/neural-wsd
Neural WSD with Transformers and candidate masking |
|
Experimental |