THU-KEG/COPEN

The official code and dataset for EMNLP 2022 paper "COPEN: Probing Conceptual Knowledge in Pre-trained Language Models".

/ 100

Experimental

This project provides a benchmark to evaluate how well Pre-trained Language Models (PLMs) understand concepts, not just words. It takes a PLM and a set of conceptual tasks (like judging similarity or properties) as input. The output helps researchers understand if their PLM can grasp human-like conceptual knowledge. This is for AI researchers or language model developers who want to analyze and improve their models' cognitive abilities.

No commits in the last 6 months.

Use this if you are an AI researcher or developer working on Pre-trained Language Models and need a standardized way to test their conceptual understanding, beyond basic linguistic tasks.

Not ideal if you are looking for a tool to directly apply language models for downstream applications like text generation, summarization, or translation.

AI research language model evaluation natural language understanding cognitive AI computational linguistics

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 4 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

ymcui/cmrc2018

A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)

thunlp/MultiRD

Code and data of the AAAI-20 paper "Multi-channel Reverse Dictionary Model"

princeton-nlp/DensePhrases

[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval...

IndexFziQ/KMRC-Papers

A list of recent papers regarding knowledge-based machine reading comprehension.

danqi/rc-cnn-dailymail

CNN/Daily Mail Reading Comprehension Task

Explore NLP Tools

All categories Trending NLP directory Insights