aphp/eds-pseudo
EDS-Pseudo is a hybrid model for detecting personally identifying entities in clinical reports
This tool helps healthcare professionals and researchers anonymize sensitive patient data within clinical reports. It takes raw clinical text and identifies elements like names, dates, addresses, and hospital identifiers, then removes or replaces them. This ensures patient privacy when sharing data for research or analysis.
Use this if you need to quickly and accurately remove personally identifying information from large volumes of clinical documents to comply with privacy regulations or enable data sharing.
Not ideal if your documents are not clinical reports or if you need to pseudonymize very specialized types of identifying data not covered by the listed categories.
Stars
67
Forks
9
Language
Python
License
—
Category
Last pushed
Feb 05, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/aphp/eds-pseudo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
DataFog/datafog-python
Python SDK for PII detection and redaction in text and images, combining regex + NLP pipelines...
vmenger/deduce
Deduce: de-identification method for Dutch medical text
seanpedrick-case/doc_redaction
Redact PDF/image-based documents, Word, or CSV/XLSX files using a graphical user interface....
martincjespersen/DaAnonymization
Simple customizable pipeline tool for anonymizing Danish text.
thoughtbot/top_secret
Filter sensitive information from free text before sending it to external services or APIs, such...