kk7nc/Text_Classification
Text Classification Algorithms: A Survey
This project provides practical guidance and code examples for preparing text data for classification. It helps anyone working with unstructured text by demonstrating how to clean and standardize documents, turning raw text into a format suitable for analysis. The user persona is a data analyst, researcher, or anyone needing to pre-process text for machine learning or statistical tasks.
1,832 stars. No commits in the last 6 months.
Use this if you need to understand and apply techniques like tokenization, stop word removal, stemming, and lemmatization to clean and prepare text for classification or other NLP tasks.
Not ideal if you are looking for a complete, out-of-the-box text classification model or a platform for deploying NLP solutions, as it focuses on the pre-processing steps rather than the full classification pipeline.
Stars
1,832
Forks
543
Language
Python
License
MIT
Category
Last pushed
Apr 01, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/kk7nc/Text_Classification"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
castorini/hedwig
PyTorch deep learning models for document classification
AnubhavGupta3377/Text-Classification-Models-Pytorch
Implementation of State-of-the-art Text Classification Models in Pytorch
inspirehep/magpie
Deep neural network framework for multi-label text classification
InseeFrLab/torchTextClassifiers
A unified framework for text classification in PyTorch.
davidberenstein1957/classy-classification
This repository contains an easy and intuitive approach to few-shot classification using...