luciamariaalvarezcrespo/GalMisoCorpus2023
:bookmark_tabs: Galician corpus for misogyny detection
This project provides a unique collection of social media posts (tweets and toots) in Galician, specifically curated to identify misogynistic content. Researchers and computational linguists studying online hate speech can use this corpus as input to train and evaluate models, with the output being a system capable of detecting misogyny in Galician text. It is designed for academics and experts focused on digital humanities and natural language processing in less-resourced languages.
No commits in the last 6 months.
Use this if you are a researcher focused on detecting hate speech or misogyny in the Galician language on social media platforms.
Not ideal if you are looking for a pre-built, ready-to-deploy tool for live content moderation or if your primary interest is in languages other than Galician.
Stars
17
Forks
—
Language
Python
License
MPL-2.0
Category
Last pushed
Jul 17, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/luciamariaalvarezcrespo/GalMisoCorpus2023"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
DerwenAI/pytextrank
Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
Tiiiger/bert_score
BERT score for text generation
BrikerMan/Kashgari
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for...
asyml/texar
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. ...
yohasebe/wp2txt
A command-line tool to extract plain text from Wikipedia dumps with category and section filtering