HC200ok/manual-data-masking
A lightweight javascript library for manual data masking
This tool helps you manually identify and obscure sensitive information within text, like customer comments or reviews. You input raw text, mark the sensitive portions, and then receive both a list of the masked data with its category (e.g., "Phone Number") and a new version of the text with the sensitive parts hidden. This is designed for data annotators, content moderators, or anyone preparing text datasets where privacy or compliance requires masking specific details.
No commits in the last 6 months. Available on npm.
Use this if you need to visually select and categorize sensitive data points within text for privacy, compliance, or to create training datasets for automated masking tools.
Not ideal if you need an automated solution for large volumes of text or if you are looking for a backend-only data processing library.
Stars
21
Forks
1
Language
JavaScript
License
MIT
Category
Last pushed
Jul 19, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/HC200ok/manual-data-masking"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
DataFog/datafog-python
Python SDK for PII detection and redaction in text and images, combining regex + NLP pipelines...
vmenger/deduce
Deduce: de-identification method for Dutch medical text
aphp/eds-pseudo
EDS-Pseudo is a hybrid model for detecting personally identifying entities in clinical reports
seanpedrick-case/doc_redaction
Redact PDF/image-based documents, Word, or CSV/XLSX files using a graphical user interface....
martincjespersen/DaAnonymization
Simple customizable pipeline tool for anonymizing Danish text.