gokriznastic/20-newsgroups_text-classification

"20 newsgroups" dataset - Text Classification using Multinomial Naive Bayes in Python.

36
/ 100
Emerging

This project helps classify text documents into predefined categories, much like sorting incoming emails or articles into relevant folders. You feed it a collection of text documents, and it outputs which category each document belongs to. This is useful for anyone who needs to automatically organize or understand large volumes of text, such as researchers analyzing public discussions or content managers categorizing news articles.

No commits in the last 6 months.

Use this if you need a basic, clear example of how to automatically sort text documents into established categories using common machine learning techniques.

Not ideal if you need to classify documents into categories that are not already known or if you are looking for advanced, state-of-the-art text classification models.

document-categorization news-analysis content-sorting information-organization text-mining
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 8 / 25
Community 20 / 25

How are scores calculated?

Stars

42

Forks

32

Language

Jupyter Notebook

License

Last pushed

Nov 02, 2018

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/gokriznastic/20-newsgroups_text-classification"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.