Model Evaluation Diagnostics Transformer Models

Tools for systematically evaluating, diagnosing, and benchmarking transformer models across NLI, WSD, and other NLP tasks using standard test sets and evaluation frameworks. Does NOT include general model training, fine-tuning without evaluation focus, or language-specific model overviews.

There are 57 model evaluation diagnostics models tracked. 1 score above 50 (established tier). The highest-rated is LoicGrobol/zeldarose at 51/100 with 28 stars.

Get all 57 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=model-evaluation-diagnostics&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 LoicGrobol/zeldarose

Train transformer-based models.

51
Established
2 CPJKU/wechsel

Code for WECHSEL: Effective initialization of subword embeddings for...

49
Emerging
3 yuanzhoulvpi2017/zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

49
Emerging
4 minggnim/nlp-models

A repository for training transformer based models

49
Emerging
5 IntelLabs/nlp-architect

A model library for exploring state-of-the-art deep learning topologies and...

49
Emerging
6 MahmoudWahdan/dialog-nlu

Tensorflow and Keras implementation of the state of the art researches in...

46
Emerging
7 yuanzhoulvpi2017/quick_sentence_transformers

sentence-transformers to onnx 让sbert模型推理效率更快

46
Emerging
8 ukairia777/tensorflow-nlp-tutorial

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림...

45
Emerging
9 soldni/pyterrier_sentence_transformers

Create PyTerrier compatible dense indices using any sentence_transformers model

43
Emerging
10 HarderThenHarder/transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification,...

41
Emerging
11 g8a9/ferret

A python package for benchmarking interpretability techniques on Transformers.

39
Emerging
12 sinanuozdemir/oreilly-bert-nlp

This repository contains code for the O'Reilly Live Online Training for BERT

36
Emerging
13 Azure/nlp-samples

Japanese NLP sample codes

36
Emerging
14 ManashJKonwar/NLP-Transformers

Transformer (BERT, GPT2, etc.) based Training Module for popular NLP tasks

34
Emerging
15 rajaswa/indic-syntax-evaluation

Vyākarana: A Colorless Green Benchmark for Syntactic Evaluation in Indic Languages

32
Emerging
16 shunk031/allennlp-shiba-model

AllenNLP integration for Shiba: Japanese CANINE model

32
Emerging
17 ropensci/pangoling

An R package for estimating the log-probabilities of words in a given...

31
Emerging
18 prajjwal1/generalize_lm_nli

Code for the paper EMNLP 2021 workshop paper "Generalization in NLI: Ways...

31
Emerging
19 VirtualRoyalty/gan-plus-nlp

Generative adversarial approach to most popular NLP tasks

31
Emerging
20 matteomedioli/BERT-KG

Enriching Language Models Representations via Knowledge Graphs Regularisation

31
Emerging
21 Nickil21/weakly-supervised-parsing

Official Code for our Findings of ACL 2022 paper: Co-training an...

31
Emerging
22 stevezheng23/fewshot_nlp_pt

Few-shot NLP in PyTorch

31
Emerging
23 CyberAgentAILab/japanese-nli-model

This repository provides the code for Japanese NLI model, a fine-tuned...

30
Emerging
24 th789/mbr-for-nmt

Characterizing the performance of minimum Bayes risk (MBR) decoding for...

30
Emerging
25 ai-forever/model-zoo

NLP model zoo for Russian

29
Experimental
26 yucc2018/share

一些代码实践分享。

28
Experimental
27 Beomi/transformers-language-modeling

Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3

28
Experimental
28 proycon/deepfrog

An NLP-suite powered by deep learning

28
Experimental
29 TRISTAN-ORF/RiboTIE

Scripts and instructions to apply RiboTIE on Ribo-seq data

27
Experimental
30 ishan00/meta-learning-for-multi-task-multilingual

Official Repository for the paper titled "Meta-Learning for Effective...

26
Experimental
31 DFKI-NLP/gevalm

Code and data for the paper "Evaluating German Transformer Language Models...

25
Experimental
32 hppRC/simple-simcse-ja

Exploring Japanese SimCSE

24
Experimental
33 zhestyatsky/MCL-WiC

Research on Multilingual and Cross-lingual Word-in-Context Disambiguation

23
Experimental
34 SapienzaNLP/xl-wsd-code

Code to train and test Word Sense Disambiguation models based on different...

22
Experimental
35 princeton-nlp/MultilingualAnalysis

Repository for the paper titled: "When is BERT Multilingual? Isolating...

21
Experimental
36 RobinSmits/Dutch-NLP-Experiments

This repository contains a number of experiments with Multi Lingual...

20
Experimental
37 brihijoshi/granular-similarity-COLING-2020

Code for the paper "The Devil is in the Details: Evaluating Limitations of...

20
Experimental
38 iamlxb3/UMAMGT

Code for the publication of LREC'22

19
Experimental
39 HannaAbiAkl/PSYCHIC

The official repository for the PSYCHIC model

19
Experimental
40 TRISTAN-ORF/RiboTIE_article

Scripts run to produce the RiboTIE paper

19
Experimental
41 skomban/seq-unscrambler

Unscrambles shuffled letters in a word sequence.

18
Experimental
42 SambhawDrag/XLNet.jl

A Julia-based implementation of XLNet: A Generalized Autoregressive...

17
Experimental
43 bglid/haitian-creole-nlu

Project designed to reimplement and build upon CreoleVal's Reading...

17
Experimental
44 loubnabnl/canine-mednli

CANINE for Medical Natural Language Inference on MedNLI data, as part of the...

17
Experimental
45 aarnetalman/nli-with-transformers

Fine-tune transformers with NLI data

15
Experimental
46 mhdr3a/transformers-diagnostics

Model Evaluation using SuperGLUE Diagnostic Dataset

13
Experimental
47 DudalaShrujana/nlp-transformers-toolkit

ModularNLP pipeline utilizing Hugging Face Transformers for Sentiment...

13
Experimental
48 mhdr3a/transformers-snli

Model Evaluation using SNLI Development Set

11
Experimental
49 tranquoctrinh/huggingface-transformers-examples

Fine-tuning (or training from scratch) the library models for language...

11
Experimental
50 mhdr3a/transformers-hans

Adversarial evaluation of model performances [Updated]

11
Experimental
51 mhdr3a/transformers-wanli

Model Evaluation using WANLI Test Set

11
Experimental
52 MatteoFasulo/Transformers-Sentence-Reconstruction

Sentence Reconstruction using Transformer Model

11
Experimental
53 Evfidiw/LMs_NLU

Exploring different language models on text classification tasks.

11
Experimental
54 chloeskt/nlp_ensae

Final project of the Machine Learning for Natural Language Processing at...

11
Experimental
55 Mrpatekful/dialogue-graph

Enriching pre-trained language models with knowledge graphs for dialogue generation.

11
Experimental
56 FaresGh1997/Contexto_3lang

implementation of Contexto (guess the word game) for three diffrent...

10
Experimental
57 LeonardoEmili/neural-wsd

Neural WSD with Transformers and candidate masking

10
Experimental