dimitreOliveira/Jigsaw-Multilingual-Toxic-Comment-Classification

:3rd_place_medal: (Bronze medal - 100th place - Top 7%) Repository for the "Jigsaw Multilingual Toxic Comment Classification" Kaggle competition.

/ 100

Experimental

This project helps online communities and content moderators automatically identify toxic comments in multiple languages to foster healthier discussions. It takes in user-generated text from online forums or social media, regardless of the language (English, Portuguese, Russian, French, Italian, Spanish), and determines if the comment is toxic or not. This is for anyone managing online platforms who needs to filter out disrespectful or offensive contributions.

No commits in the last 6 months.

Use this if you need to implement a system for automatically detecting and flagging toxic content across multilingual online platforms, helping to maintain a positive environment.

Not ideal if you need to classify nuanced forms of content beyond general toxicity or if your primary focus is on a single language.

content-moderation online-community-management brand-safety social-media-management digital-trust-and-safety

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

unitaryai/detoxify

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built...

kensk8er/chicksexer

A Python package for gender classification.

Infinitode/ValX

ValX is an open-source Python package for text cleaning tasks, including profanity detection and...

dimitreOliveira/Jigsaw-Multilingual-Toxic-Comment-Classification

Higher-rated alternatives

Explore NLP Tools