rhnfzl/SqueakyCleanText
Text preprocessing and PII anonymisation for NLP/ML. ONNX NER ensemble, language detection, stopword removal. Built for statistical ML and language models.
Available on PyPI.
Stars
7
Forks
—
Language
Python
License
MIT
Category
Last pushed
Feb 28, 2026
Commits (30d)
0
Dependencies
12
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/rhnfzl/SqueakyCleanText"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
microsoft/presidio-research
This package features data-science related tasks for developing new recognizers for Presidio. It...
rushilpatel21/Redactify
Redactify is an efficient data redaction tool that secures sensitive text using advanced NLP and...
zulqarnainalipk/PII-Data-Detection
🔐 NLP-powered pipeline for detecting and removing Personally Identifiable Information (PII) from...
4n33sh/REDACT
REDACT is an info-sec tool that automates redaction with minimal user interaction. It utilizes...