News Category Classification NLP Tools

Projects for classifying news articles into predefined categories (business, sports, politics, entertainment, tech, etc.) using NLP and machine learning. Does NOT include sentiment analysis, fake news detection, topic modeling, or document classification for non-news domains.

There are 109 news category classification tools tracked. 1 score above 50 (established tier). The highest-rated is angelosalatino/cso-classifier at 63/100 with 95 stars.

Get all 109 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=news-category-classification&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 angelosalatino/cso-classifier

Python library that classifies content from scientific papers with the...

63
Established
2 newsgac/platform

Platform for machine learning experiments developed in the project NEWSGAC

46
Emerging
3 giuseppebonaccorso/Reuters-21578-Classification

Text classification with Reuters-21578 datasets using Gensim Word2Vec and Keras LSTM

45
Emerging
4 aqibsaeed/Research-Paper-Categorization

Research paper classification using machine learning and NLP

43
Emerging
5 tblock/10kGNAD

Ten Thousand German News Articles Dataset for Topic Classification

42
Emerging
6 timthedev07/news-articles-classification

Classification model for news articles with deep learning.

41
Emerging
7 mdh266/TextClassificationApp

Building and Deploying A Serverless Text Classification Web App

41
Emerging
8 NirantK/Hinglish

Hinglish Text Classification

40
Emerging
9 yassersouri/classify-text

"20 Newsgroups" text classification with python

40
Emerging
10 sndsabin/Nepali-News-Classifier

Text Classification of Nepali Language Document. This Mini Project was done...

39
Emerging
11 vectorsss/news_classification

卷积神经网络&&爬虫 实现网易新闻自动爬取并分类

39
Emerging
12 elleros/text-classification-challenge

Supervised text classification

38
Emerging
13 FantacherJOY/Arabic-text-classification

Arabic text documents classified using SVM, k-nn and Naive bayes classifers.

38
Emerging
14 uhh-lt/BlurbGenreCollection-HMC

Hierarchical multi-label text classification of the BlurbGenreCollection...

38
Emerging
15 yutkin/News-Aggregator

Classification and aggregation of russian news articles. University coursework.

37
Emerging
16 saidziani/Arabic-News-Article-Classification

Automatic categorization of documents, consists in assigning a category to a...

37
Emerging
17 tomeliud/Natural-Language-Processings-NLP-on-Swahili-News-Classification-

Develop a multi-class classification model to classify news content...

36
Emerging
18 jdvala/lazytext

LazyText is inspired by the idea of lazypredict, a library which helps build...

36
Emerging
19 gokriznastic/20-newsgroups_text-classification

"20 newsgroups" dataset - Text Classification using Multinomial Naive Bayes...

36
Emerging
20 aditya00kumar/document-classification

This project is an attempt to provide a generic pipeline for document...

34
Emerging
21 PetropoulakisPanagiotis/text-classification

Text Classification using scikit-learn. Classify BBC articles.

33
Emerging
22 searchisko/project-classifier-poc

Searchisko: A semantic search service over categorised content.

33
Emerging
23 urigoren/nlp_classification_workshop

NLP Classification Workshop

33
Emerging
24 jarif87/movie-tv-genre-detector

Building a Multilabel Movie and TV Show Classifier with Blurr

32
Emerging
25 sayakpaul/Generating-categories-from-arXiv-paper-titles

This project takes the arXiv dataset and builds an automatic tag classifier...

32
Emerging
26 Fresearch99/patent_classification_project

Python collection of programs to classify and identify patents that are...

32
Emerging
27 saurabh1907/document-classification-ml-nlp

Document classification into four defined categories (World, Sports,...

31
Emerging
28 sudharsan13296/Document-Classification-using-LSA

Document classification using Latent semantic analysis in python

31
Emerging
29 shashankvmaiya/Movie-Genre-Multi-Label-Text-Classification

Multi Label Text Classification of movie genres using its plot

31
Emerging
30 Davisy/Swahili-News-Classifier-Streamlit-App

A simple Streamlit App to classify swahili news into different categories.

31
Emerging
31 rootally/News-Category-Classification-with-BERT

Identify the type of news based on headlines and short descriptions

31
Emerging
32 yihong1120/Reuters-News-Classification-and-Analysis

Train a model to categorize news articles, scrape and translate articles,...

31
Emerging
33 deepklarity/poster2plot

Generate movie/t.v show plot from a poster

31
Emerging
34 msi1427/MultiLabel-Book-Genre-Classifier

A multi-label text classifier that can classify 141 different book genres...

31
Emerging
35 yogeshnile/Movies-Genre-Classification

In this repo i have created a Movies Genre Classification project in machine...

30
Emerging
36 diem-ai/text-classification

The project aims to apply Naives Bayes on TF-IDF and Word2Vec Models .Use...

30
Emerging
37 Davisy/Swahili-News-Classifier-API

An API developed by using the FastAPI library and uses an NLP model to...

30
Emerging
38 cssturkiye/migration-tr

Migration-TR: Turkish migration discourse dataset with 6M tweets...

30
Emerging
39 shibuiwilliam/aozora_classification

This project aims to classify Japanese sentence to how well similar to some...

29
Experimental
40 ViralLab/TurkishBERTweet

TurkishBERTweet: Fast and Reliable Large Language Model for Social Media Analysis

29
Experimental
41 rcmckee/Document-Classification

Patent Classification with Machine Learning

29
Experimental
42 SpringerNLP/Chapter3

Chapter 3: Text and Speech Basics

29
Experimental
43 camoverride/language-classifier

Tutorial for building an end-to-end machine learning application! 😊

28
Experimental
44 tomkeith/Multi-label-classification-with-NLP

Genre Genie - Movie Genre Predictor (multi-label classification with NLP)

28
Experimental
45 cawfree/reuters-dataset

🗞️ A tool for downloading and parsing Reuters-21578. These are a collection...

28
Experimental
46 mohamedELBAHA/Text-classification-of-BBC-news-dataset

Application of NLP, word embedding, LSTM, PCA, TSNE.

28
Experimental
47 xiangyi-njust/FWS

the code for this paper : Automatic Recognition and Classification of Future...

27
Experimental
48 dkhd/text-classification

Code examples and presentation slides from Machine Learning ID #2 Meetup - Yogyakarta

27
Experimental
49 Wonuabimbola/movie-genre-prediction

Multiclass Text Classification problem where I used movie plot summaries to...

27
Experimental
50 savasy/TurkishTextClassification

text classification for Turkish language

26
Experimental
51 shivam5992/classification_pipeline

:orange_book: A complete document classification pipeline using Apache Spark in scala

26
Experimental
52 Balakishan77/News-Articles-Classification

This dataset contains around 125k news headlines from the year 2013 to 2018...

26
Experimental
53 sarufi-io/Swahili-sentiment-Analysis-using-transformers

The special repository to demonstrate how you can use transformers for...

25
Experimental
54 Reslan-Tinawi/20-newsgroups-Text-Classification

Implementing text classification algorithms using the 20 newsgroups...

24
Experimental
55 0AlphaZero0/Text-Classification-Data-Citation-EuropePMC

Text classification on Citations in Scientific Papers

24
Experimental
56 tknishh/Text-Classification-Ag-News

The objective of this task is to build a text classification model using the...

23
Experimental
57 J-Mourad/MNAD

The dataset contains +1 Million Moroccan news articles that are classified...

23
Experimental
58 sbera7/Dialogue-act-classification

Dialogue Act classification

23
Experimental
59 Alvant/20-newsgroups-secrets

Some hidden knowledge found in the 20 Newsgroups dataset

22
Experimental
60 robolab-pavia/slrkit_NLP_Nocera

Classification of the NLP dataset - Nocera

22
Experimental
61 TomMakesThings/Movie-Genre-Predictor

Natural language processing (NLP) group project to create a multi-label...

22
Experimental
62 G0rav/Automatic_News_Tagging

Automatic News Categories Segmentation using Machine Learning and NLP

22
Experimental
63 alfagama/arxiv_predicting_paper_category

Predicting categories of scientific papers with advanced machine learning...

22
Experimental
64 hakancangunerli/turkish_binary_classifier

this is a logistic regression binary classification system for classifying...

21
Experimental
65 parsa-abbasi/Persian-News-Classifier

Implementing a news classifier on Persian news dataset

21
Experimental
66 trinhdoduyhungss/classification_sentence_knn_simple

It's an example of a sentence classification project using KNN algorithm.

20
Experimental
67 avestura/PersianNews

📰 My final project for NLP course

20
Experimental
68 ulkudilemre/Turkish-text-author-classification

Author-text classification in Turkish texts with 80% accuracy in svm linear kernel

19
Experimental
69 apdullahyayik/Tr-topicter

🔍 A simple topic detector.

19
Experimental
70 ezosa/topic-labelling

Multilingual topic labelling of news topics

19
Experimental
71 X-XENDROME-X/News-Classification-Transformer

Advanced news topic classification system using DistilBERT transformer...

18
Experimental
72 KsLimon/Multilabel-imdb-movie-genre-classifier

A multi-label movie description classifier that can classify 21 different...

18
Experimental
73 AdelAdool/News-Category-Classifier

News Category Classification using AG News dataset. Implements text...

18
Experimental
74 StefanHeng/Personalized-Productivity-Dataset

Personalized Text Classification dataset with transient labels inspired by...

17
Experimental
75 Arian-Rahman/Otaku-Tag

Multi-label category classifier for Anime and Mange

17
Experimental
76 ahmedbendebba1/News-classification-webapp

Train and deploy a News Classifier using Logitic Regression - Serverless container

17
Experimental
77 mbfakourii/NLP-persian-category

NLP persian category with digimag and persian_news Dataset

17
Experimental
78 ardauzunoglu/edebizeka

Modern Türk edebiyatı sınırları içerisinde yer alan dönemleri, eğilimleri,...

14
Experimental
79 fatihbarutcu1/turkish-text-classifier

Turkish tweet classifier with 90% accuracy — scikit-learn & Streamlit

14
Experimental
80 TanmoyGG/Dhaka_Tribune-Scraping-and-Classification-XGBoost

An end-to-end R pipeline for scraping, processing, and classifying Dhaka...

14
Experimental
81 chrislemke/nlp-text-classifier

NLP for classifying text. Using word Word2Vec word embedding and a neural...

13
Experimental
82 kurtisdavid/animeSynopsisClassifier

Multilabel classification of synopses from all the anime listed on MyAnimeList.net

12
Experimental
83 NitzanBarzilay/KnessetTopicClassification

פרויקט ליצירת מודל קלסיפיקציה המסווג ציטוטים מפרוטוקלי ישיבות הכנסת לשמונה נושאים.

12
Experimental
84 Lilykos/classy

A visual analytics environment for supervised text classification and model...

12
Experimental
85 adarshmalviya/News-Classifier

The News Classifier web app was built using concepts of NLP(Natural Language...

12
Experimental
86 meysamraz/bbc_persian_news_classification

Persian News Classification In this notebook, I tried to categorize BBC...

11
Experimental
87 Cody-Lange/Milestone-2-Text-Difficulty-Classifier

Binary text difficulty classification with tf-idf, word2vec, and other...

11
Experimental
88 p-karisani/FirstPHM

A classifier for extracting personal reports from user-generated data

11
Experimental
89 sujik18/EACL-2026-Abjad-NLP

Files for Shared Task 4: Medical Text Classification in Arabic...

11
Experimental
90 PaKicek/News-Classifier

A classifier which collects news via RSS and trains ML models to classify...

11
Experimental
91 Ryohskay/peshitta_cj

Attempt to classify OT Peshitta (a [Classical] Syriac Translation of the Old...

11
Experimental
92 Rahul-404/bbc-news-sorting

📰 BBC News Article Classifier: A project that categorizes BBC News articles...

11
Experimental
93 Simran0192004/Movie_Genre_Classification

A Machine Learning model that can predict the genre of a movie based on its...

11
Experimental
94 Sensente/News_Text_Classification_Demo

This is a demo project for news text classification, part of this work has...

11
Experimental
95 CU-S25-MSDSSB-DSCI-01-DataScienceLab/Movie-Project

Data Science Lab Project Repository run by Matvei Trifanov, Lu Zerong and...

11
Experimental
96 SajjadHossain43/app-multi-label-game-genre-classification-imdb

This project focuses on classifying video games into multiple genres using...

11
Experimental
97 dilohn/movie-revenue-predictor

predict movie revenue based on movie summary and poster only

11
Experimental
98 BeaEsparcia/spanish-news-classification

News classification project using Natural Language Processing (NLP) and...

11
Experimental
99 youssefdlfk/movie-genre-classification

Multi-label text classification of movie genres based on their synopsis

11
Experimental
100 manik000-zx/text_classifier_ai

AI-powered text classification system built with Python and Streamlit....

11
Experimental
101 esra-polat/turkish-nlp-ngrams-collacations-classifiers-ner-sentiment

Collocations, n-grams, deep learning classifiers, named entity recognition,...

11
Experimental
102 TheNova22/PlotCreator

A movie plot generating python notebook

10
Experimental
103 FabriDeCastelli/HLT-News-Category

42 way classification of news

10
Experimental
104 filippofinke/tutti-category-classification

📒 Attempt to classify insertions on the tutti.ch

10
Experimental
105 MohammedAly22/BBCNewsClassifier

This project is an NLP (Natural Language Processing) application that...

10
Experimental
106 hongyuntw/News-classification-And-Speech-IR

text classfication

10
Experimental
107 SohailaDiab/Hespress-Stories

An Arabic NLP Project.

10
Experimental
108 SimonaMnv/ArachneClassifier

Greek articles crawling and classification

10
Experimental
109 nabeelshan78/document-classification-nlp

Automated document classification system using PyTorch & TorchText. Loads...

10
Experimental