jftuga/deidentification
Deidentify people's names and gender specific pronouns
This tool helps compliance officers, researchers, and data analysts protect privacy by automatically removing sensitive personal information from text. It takes any text document, email, or report and replaces names and gender-specific pronouns with generic placeholders or custom terms. The result is a de-identified document that maintains readability while safeguarding individual identities.
No commits in the last 6 months.
Use this if you need to share or analyze text data without revealing personal names or gender, such as in legal documents, research transcripts, or customer feedback.
Not ideal if you need to de-identify complex PII beyond names and pronouns, or if you require preservation of gender information in your de-identified text.
Stars
44
Forks
4
Language
Python
License
MIT
Category
Last pushed
May 03, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/jftuga/deidentification"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
DataFog/datafog-python
Python SDK for PII detection and redaction in text and images, combining regex + NLP pipelines...
vmenger/deduce
Deduce: de-identification method for Dutch medical text
aphp/eds-pseudo
EDS-Pseudo is a hybrid model for detecting personally identifying entities in clinical reports
seanpedrick-case/doc_redaction
Redact PDF/image-based documents, Word, or CSV/XLSX files using a graphical user interface....
martincjespersen/DaAnonymization
Simple customizable pipeline tool for anonymizing Danish text.