dmlls/cannot-dataset
CANNOT: Compilation of ANnotated, Negation-Oriented Text-pairs
This project provides a specialized dataset to help you evaluate how well language models understand negation in text. It takes in pairs of sentences, some of which contain negated meanings (like "will" vs. "won't") and some that are paraphrases without negation, and outputs a clear label indicating if negation is present. This is designed for researchers and practitioners who develop or test natural language processing (NLP) systems, especially those focused on text generation and understanding.
No commits in the last 6 months.
Use this if you need to specifically test and improve the ability of your natural language processing models to correctly interpret and handle negated statements.
Not ideal if your primary goal is general sentiment analysis or semantic similarity without a specific focus on negation.
Stars
8
Forks
—
Language
Python
License
—
Category
Last pushed
Jul 25, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/dmlls/cannot-dataset"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jenojp/negspacy
spaCy pipeline object for negating concepts in text
Mascerade/supervised-product-matching
⚖️ Neural network for product matching, aka classifying whether two product titles represent the...
dmlls/negate
negate_sentence(A Python module that doesn't negate sentences.)
sugatagh/E-commerce-Text-Classification
Proper categorization of e-commerce products enhances the user experience and achieves better...
MaggieMeow/neko
Product categorization using sequence-to-sequence models and others.