rezacsedu/Bengali-Hate-Speech-Dataset

Dataset for identifying potential hates (e.g., political, religious, personal, gender abusive, geopolitical, etc.) for under-resourced Bengali language.

41
/ 100
Emerging

This dataset helps researchers and computational linguists analyze and detect hate speech specifically in the Bengali language. It takes raw Bengali text from various sources like news, social media, and books, and provides annotated samples categorized into different types of hate speech. Researchers, especially those focused on natural language processing for under-resourced languages, would use this to train and evaluate hate speech detection models.

No commits in the last 6 months.

Use this if you are developing or evaluating automated systems to identify and classify hate speech in Bengali language content.

Not ideal if you need a dataset for general Bengali text classification or if you are not working on hate speech detection.

computational-linguistics hate-speech-detection natural-language-processing bengali-language-research
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

29

Forks

16

Language

License

MIT

Last pushed

Apr 14, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/rezacsedu/Bengali-Hate-Speech-Dataset"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.