kk7nc/Text_Classification

Text Classification Algorithms: A Survey

51
/ 100
Established

This project provides practical guidance and code examples for preparing text data for classification. It helps anyone working with unstructured text by demonstrating how to clean and standardize documents, turning raw text into a format suitable for analysis. The user persona is a data analyst, researcher, or anyone needing to pre-process text for machine learning or statistical tasks.

1,832 stars. No commits in the last 6 months.

Use this if you need to understand and apply techniques like tokenization, stop word removal, stemming, and lemmatization to clean and prepare text for classification or other NLP tasks.

Not ideal if you are looking for a complete, out-of-the-box text classification model or a platform for deploying NLP solutions, as it focuses on the pre-processing steps rather than the full classification pipeline.

text analytics natural language processing data preparation content analysis information retrieval
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

1,832

Forks

543

Language

Python

License

MIT

Last pushed

Apr 01, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/kk7nc/Text_Classification"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.