luciamariaalvarezcrespo/GalMisoCorpus2023

:bookmark_tabs: Galician corpus for misogyny detection

/ 100

Experimental

This project provides a unique collection of social media posts (tweets and toots) in Galician, specifically curated to identify misogynistic content. Researchers and computational linguists studying online hate speech can use this corpus as input to train and evaluate models, with the output being a system capable of detecting misogyny in Galician text. It is designed for academics and experts focused on digital humanities and natural language processing in less-resourced languages.

No commits in the last 6 months.

Use this if you are a researcher focused on detecting hate speech or misogyny in the Galician language on social media platforms.

Not ideal if you are looking for a pre-built, ready-to-deploy tool for live content moderation or if your primary interest is in languages other than Galician.

social-media-analysis computational-linguistics hate-speech-detection Galician-language digital-humanities

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MPL-2.0

Higher-rated alternatives

DerwenAI/pytextrank

Python implementation of TextRank algorithms ("textgraphs") for phrase extraction

Tiiiger/bert_score

BERT score for text generation

BrikerMan/Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for...

asyml/texar

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. ...

yohasebe/wp2txt

A command-line tool to extract plain text from Wikipedia dumps with category and section filtering

Explore NLP Tools

All categories Trending NLP directory Insights