Traditional ML Text Classification NLP Tools

Tools implementing classical machine learning approaches (scikit-learn, naive bayes, SVM, TF-IDF vectorization) for text classification tasks. Does NOT include deep learning methods, neural networks, or transformer-based models like BERT.

There are 51 traditional ml text classification tools tracked. 2 score above 50 (established tier). The highest-rated is giacbrd/ShallowLearn at 52/100 with 198 stars.

Get all 51 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=traditional-ml-text-classification&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 giacbrd/ShallowLearn

An experiment about re-implementing supervised learning models based on...

52
Established
2 javedsha/text-classification

Machine Learning and NLP: Text Classification using python, scikit-learn and NLTK

51
Established
3 Wluper/edm

Python package for understanding the difficulty of text classification...

48
Emerging
4 fendouai/Awesome-Text-Classification

Awesome-Text-Classification Projects,Papers,Tutorial .

46
Emerging
5 chicago-justice-project/article-tagging

Natural Language Processing of Chicago news articles

46
Emerging
6 opennlp/Large-Scale-Text-Classification

Large Scale benchmarking of state of the art text vectorizers

42
Emerging
7 eannefawcett/lexile-determination-v2

NLP to classify a text's lexile level

41
Emerging
8 FelixHo/Text-Classification-Benchmark

文本分类基准测试

40
Emerging
9 Pritishsingla/nbsvm_sklearn

Scikit-learn style wrapper for the NBSVM algorithm for text classification

39
Emerging
10 scotthlee/document-classification

Simple command-line scripts for document classification

38
Emerging
11 amankedia/Question-Classification

Classifying questions from UIUC's CogComp QC Dataset

37
Emerging
12 akshaybhatia10/Book-Genre-Classification

Classification of books based on titles without prior knowledge of context or author

36
Emerging
13 uosdmlab/playdata-zeppelin-notebook

Zeppelin 화재 뉴스 기사 분류 예제

35
Emerging
14 cahya-wirawan/text-classification

Text Classification engine using several algorithms in machine learning

35
Emerging
15 lpq29743/text_classification

Text classification models (including traditional machine learning methods...

35
Emerging
16 cair/TextUnderstandingTsetlinMachine

Using the Tsetlin Machine to learn human-interpretable rules for...

34
Emerging
17 p-karisani/self_pretraining

A classification model

33
Emerging
18 dwulff/text2sdg

Detect UN Sustainable Development Goals in Text

30
Emerging
19 UNDP-Data/dsc-sdgi-corpus

Model benchmarks on SDGi Corpus, a multilingual dataset for text...

29
Experimental
20 madhurimamandal/Text-classification-into-difficulty-levels

Text classification into various reading difficulty levels using various...

28
Experimental
21 muggin/string-kernels

Implementation of the "Text Classification using String Kernels" publication...

28
Experimental
22 kushagra2103/Auto-Tagging-System

The project is based on a multi-label classification problem in NLP.

28
Experimental
23 hlbao/classification_in_CSS

The tutorial on scraping, processing, and classification of text-based...

27
Experimental
24 Authentic10/daxmod

A Python toolbox for text classification

27
Experimental
25 swapkh91/Question-Classification

Classify questions as - what, when, who, affirmation, unknown

25
Experimental
26 andreaschandra/feature-selection-papers

A list of paper feature selection for text classification

24
Experimental
27 RenzeLou/Text-to-Text-Classification

Investigating the NLP classification tasks, text-to-text vs. text-to-indices.

23
Experimental
28 maty-bohacek/xgboost-vs-gpt4

Official Implementation of the 'When XGBoost Outperforms GPT-4 on Text...

23
Experimental
29 tim5go/awesome-one-class-classification

A curated list of awesome resources dedicated to One Class Classification.

23
Experimental
30 MJahangeerQureshi/Text-Classification

A collection text classification methodologies

23
Experimental
31 ARUNAGIRINATHAN-K/Text-Classifier-using-NLP_Techniques

A Text Classification using NLP Techniques.

22
Experimental
32 ZhengZixiang/TCPapers

Worth-reading papers and related resources on text classification....

22
Experimental
33 Qinzhizhou/NLP-Classfication

Two NLP assignments for sentiment and topic classfication

21
Experimental
34 RussianNLP/TAPE

TAPE benchmark

20
Experimental
35 atom-zh/SA_Classification

Multi Label Classification of short Texts of Science Association 机器学习 神经网络...

19
Experimental
36 YipingNUS/contextual-eval-dataset

Evaluation Dataset for "Bootstrapping Large-Scale Fine-Grained Contextual...

19
Experimental
37 stepthom/autotext

Run AutoML tools on a dataframe with text

18
Experimental
38 SupreethRao99/eXtreme-Classification

eXtreme Classification, classifying gigabytes of natural language text in minutes

17
Experimental
39 unlbslk/python-easy-text-classification

An easy machine learning Python text classification script with scikit-learn

17
Experimental
40 mohammad-oghli/Text-Classification

Classify text to different categories using trained ML model

17
Experimental
41 BM-K/Question-Difficulty-Estimation

Question Difficulty Estimation

12
Experimental
42 Mariam-Badr-MB/Book-s-genre

Discover hidden micro-genres in books using unsupervised machine learning....

12
Experimental
43 qiaochen/TextClsApp

A web app for text classification

12
Experimental
44 TextXD/introduction-to-text-classification

Introduction to text classification

11
Experimental
45 davidetaraborrelli/textkd-p1-clean-prep

A very simple baseline for text preprocessing + linear classification with...

11
Experimental
46 manyuzhang1996/Building-SDG-Classifiers-on-Investment-Document-with-Machine-Learning-to-Assist-MultiLabeling

Machine Learning & NLP Project

11
Experimental
47 nadeeshafdo/text-classifier

A simple text classifier for classifying ideas into categories.

11
Experimental
48 carlosparaciari/abstract-classification-embedding

A small project investigating, building on a previous study, to search for...

11
Experimental
49 vera-bernhard/SwissTextSDG

Entry for the shared task at SwissText 2024 - Automatic Classification of...

10
Experimental
50 ertiaM/Text-Classification-of-Course-Designs

Depositary the text classification of course designs

10
Experimental
51 Sahel-Eskandar/Text-Mining-Classification-Analysis

Text mining, Text classification, Applied ML algorithm

10
Experimental