gokriznastic/20-newsgroups_text-classification
"20 newsgroups" dataset - Text Classification using Multinomial Naive Bayes in Python.
This project helps classify text documents into predefined categories, much like sorting incoming emails or articles into relevant folders. You feed it a collection of text documents, and it outputs which category each document belongs to. This is useful for anyone who needs to automatically organize or understand large volumes of text, such as researchers analyzing public discussions or content managers categorizing news articles.
No commits in the last 6 months.
Use this if you need a basic, clear example of how to automatically sort text documents into established categories using common machine learning techniques.
Not ideal if you need to classify documents into categories that are not already known or if you are looking for advanced, state-of-the-art text classification models.
Stars
42
Forks
32
Language
Jupyter Notebook
License
—
Category
Last pushed
Nov 02, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/gokriznastic/20-newsgroups_text-classification"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
angelosalatino/cso-classifier
Python library that classifies content from scientific papers with the topics of the Computer...
newsgac/platform
Platform for machine learning experiments developed in the project NEWSGAC
giuseppebonaccorso/Reuters-21578-Classification
Text classification with Reuters-21578 datasets using Gensim Word2Vec and Keras LSTM
aqibsaeed/Research-Paper-Categorization
Research paper classification using machine learning and NLP
tblock/10kGNAD
Ten Thousand German News Articles Dataset for Topic Classification