Shubha23/Text-processing-NLP

This notebook contains entire text preprocessing pipeline for NLP problems. The ready-to-use functions require NLTK and SKlearn package installations. It also contains some prominent text classification models.

37
/ 100
Emerging

This project helps data scientists and NLP practitioners quickly prepare text data for analysis. It takes raw text datasets, like those from customer feedback or social media, and transforms them through a series of preprocessing steps. The output is clean, structured text ready for building machine learning models for tasks such as classification.

Use this if you need a pre-built, standardized pipeline to clean and prepare text data for NLP applications without writing all the boilerplate code from scratch.

Not ideal if your dataset is purely numerical, or if you are working on non-text-based classification, regression, or clustering problems.

text-preprocessing NLP-pipeline data-cleaning text-classification machine-learning-preparation
No License No Package No Dependents
Maintenance 6 / 25
Adoption 6 / 25
Maturity 8 / 25
Community 17 / 25

How are scores calculated?

Stars

15

Forks

8

Language

Jupyter Notebook

License

Last pushed

Dec 20, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Shubha23/Text-processing-NLP"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.