dmlls/cannot-dataset

CANNOT: Compilation of ANnotated, Negation-Oriented Text-pairs

12
/ 100
Experimental

This project provides a specialized dataset to help you evaluate how well language models understand negation in text. It takes in pairs of sentences, some of which contain negated meanings (like "will" vs. "won't") and some that are paraphrases without negation, and outputs a clear label indicating if negation is present. This is designed for researchers and practitioners who develop or test natural language processing (NLP) systems, especially those focused on text generation and understanding.

No commits in the last 6 months.

Use this if you need to specifically test and improve the ability of your natural language processing models to correctly interpret and handle negated statements.

Not ideal if your primary goal is general sentiment analysis or semantic similarity without a specific focus on negation.

natural-language-processing text-analysis language-model-evaluation computational-linguistics negation-detection
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 8 / 25
Community 0 / 25

How are scores calculated?

Stars

8

Forks

Language

Python

License

Last pushed

Jul 25, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/dmlls/cannot-dataset"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.