jfilter/clean-text

🧹 Python package for text cleaning

53
/ 100
Established

This tool helps anyone working with user-generated content, like social media posts or scraped web data, to clean up messy text. It takes raw, potentially garbled input with strange characters, broken formatting, and unwanted elements, and transforms it into a normalized, readable format. This is ideal for data analysts, researchers, and content managers who need consistent text for further analysis or presentation.

1,004 stars.

Use this if you need to reliably clean and standardize unstructured text data from various online sources before analysis or processing.

Not ideal if your text data is already perfectly clean or if you need highly specialized, domain-specific linguistic parsing beyond general normalization.

data-cleaning text-preprocessing social-media-analysis web-scraping content-moderation
No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 17 / 25

How are scores calculated?

Stars

1,004

Forks

81

Language

Python

License

Last pushed

Jan 28, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/jfilter/clean-text"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.