surrey-nlp/S3D
This repository contains our sarcasm annotated datasets along with notebooks to use our fine-tuned language models for our EMNLP 2022 Workshop Paper: "Utilizing Weak Supervision to Create S3D: A Sarcasm Annotated Dataset"
This project offers specialized datasets and pre-trained models to help you automatically detect sarcasm in social media text. It provides sets of Twitter posts, each labeled as sarcastic or not, that you can use as input. The output is a tool to improve the accuracy of understanding social media conversations. This is for researchers, data scientists, or analysts working with social media data.
No commits in the last 6 months.
Use this if you need high-quality, pre-labeled social media text data (specifically tweets) to train or evaluate models for identifying sarcasm.
Not ideal if you need to detect sarcasm in languages other than English or in very different text formats like formal documents or long-form articles.
Stars
7
Forks
1
Language
Jupyter Notebook
License
CC-BY-SA-4.0
Category
Last pushed
Jan 21, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/surrey-nlp/S3D"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Hironsan/HateSonar
Hate Speech Detection Library for Python.
t-davidson/hate-speech-and-offensive-language
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive...
franciellevargas/HateBR
HateBR is the first large-scale expert annotated dataset of Brazilian Instagram comments for...
rishabhmisra/News-Headlines-Dataset-For-Sarcasm-Detection
High quality dataset for the task of Sarcasm Detection
b4k0/CBDA
Cyber Bullying Detection Application (CBDA)