zacharykzhao/CA4P-483
NLP dataset: Chinese Android Privacy Policy Dataset
This dataset provides a collection of Chinese Android application privacy policies, meticulously annotated for fine-grained details related to data privacy. It takes raw Chinese privacy policy text and outputs labeled segments, highlighting compliance information. This is valuable for researchers and legal professionals who analyze mobile app privacy policies.
No commits in the last 6 months.
Use this if you need a specialized dataset to train models or conduct research on identifying privacy policy compliance and specific data handling practices in Chinese Android applications.
Not ideal if you are looking for a dataset of English privacy policies or if your focus is not specifically on mobile application privacy or compliance analysis.
Stars
23
Forks
3
Language
HTML
License
—
Category
Last pushed
Mar 14, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/zacharykzhao/CA4P-483"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
DataFog/datafog-python
Python SDK for PII detection and redaction in text and images, combining regex + NLP pipelines...
vmenger/deduce
Deduce: de-identification method for Dutch medical text
aphp/eds-pseudo
EDS-Pseudo is a hybrid model for detecting personally identifying entities in clinical reports
seanpedrick-case/doc_redaction
Redact PDF/image-based documents, Word, or CSV/XLSX files using a graphical user interface....
martincjespersen/DaAnonymization
Simple customizable pipeline tool for anonymizing Danish text.