hitz-zentroa/This-is-not-a-Dataset

We introduce a large semi-automatically generated dataset of ~400,000 descriptive sentences about commonsense knowledge that can be true or false in which negation is present in about 2/3 of the corpus in different forms that we use to evaluate LLMs

/ 100

Experimental

This project provides a large dataset of nearly 400,000 sentences about everyday knowledge that include negation (e.g., "a cat is not a dog"). It helps researchers and AI developers evaluate how well large language models (LLMs) understand and process negation. You input an LLM, and it outputs an evaluation of the LLM's ability to correctly determine if negated statements are true or false.

No commits in the last 6 months.

Use this if you are an AI researcher or developer building or evaluating large language models and need a robust benchmark for negation understanding.

Not ideal if you are looking for a dataset for general language understanding or for training models on tasks unrelated to logical negation in commonsense knowledge.

AI evaluation natural-language-processing language-model-benchmarking computational-linguistics commonsense-reasoning

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

PaddlePaddle/PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started...

arcee-ai/mergekit

Tools for merging pretrained large language models.

changyeyu/LLM-RL-Visualized

🌟100+ 原创 LLM / RL 原理图📚，《大模型算法》作者巨献！💥（100+ LLM/RL Algorithm Maps ）

mindspore-lab/step_into_llm

MindSpore online courses: Step into LLM

Explore Transformer Models

All categories Trending Transformer directory Insights