icescentral/MASK_public
Masking identifiable information from health related documents.
This tool helps healthcare researchers and privacy officers de-identify sensitive patient information from medical documents. You provide raw health-related text files, and it produces redacted versions where names, dates, and other protected health information (PHI) are masked. It's designed for professionals who need to share or analyze clinical data while strictly adhering to patient privacy regulations.
No commits in the last 6 months.
Use this if you need to automatically remove or mask personal health information from a collection of clinical text documents to ensure patient privacy.
Not ideal if you need a solution for non-textual data, such as images or structured database entries, or if you don't require high-throughput batch processing.
Stars
21
Forks
9
Language
Python
License
Apache-2.0
Category
Last pushed
Jun 23, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/icescentral/MASK_public"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
DataFog/datafog-python
Python SDK for PII detection and redaction in text and images, combining regex + NLP pipelines...
vmenger/deduce
Deduce: de-identification method for Dutch medical text
aphp/eds-pseudo
EDS-Pseudo is a hybrid model for detecting personally identifying entities in clinical reports
seanpedrick-case/doc_redaction
Redact PDF/image-based documents, Word, or CSV/XLSX files using a graphical user interface....
martincjespersen/DaAnonymization
Simple customizable pipeline tool for anonymizing Danish text.