hitz-zentroa/This-is-not-a-Dataset
We introduce a large semi-automatically generated dataset of ~400,000 descriptive sentences about commonsense knowledge that can be true or false in which negation is present in about 2/3 of the corpus in different forms that we use to evaluate LLMs
This project provides a large dataset of nearly 400,000 sentences about everyday knowledge that include negation (e.g., "a cat is not a dog"). It helps researchers and AI developers evaluate how well large language models (LLMs) understand and process negation. You input an LLM, and it outputs an evaluation of the LLM's ability to correctly determine if negated statements are true or false.
No commits in the last 6 months.
Use this if you are an AI researcher or developer building or evaluating large language models and need a robust benchmark for negation understanding.
Not ideal if you are looking for a dataset for general language understanding or for training models on tasks unrelated to logical negation in commonsense knowledge.
Stars
13
Forks
1
Language
Python
License
Apache-2.0
Category
Last pushed
May 13, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/hitz-zentroa/This-is-not-a-Dataset"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PaddlePaddle/PaddleNLP
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
meta-llama/llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started...
arcee-ai/mergekit
Tools for merging pretrained large language models.
changyeyu/LLM-RL-Visualized
๐100+ ๅๅ LLM / RL ๅ็ๅพ๐๏ผใๅคงๆจกๅ็ฎๆณใไฝ่ ๅทจ็ฎ๏ผ๐ฅ๏ผ100+ LLM/RL Algorithm Maps ๏ผ
mindspore-lab/step_into_llm
MindSpore online courses: Step into LLM