fraunhoferportugal/pymdma

pymdma

48
/ 100
Emerging

This tool helps researchers and practitioners evaluate the quality of their datasets, whether they are real or synthetically generated. It takes in datasets across various formats like images, tables, or time series, and outputs metrics that describe the data's integrity and usefulness. The ideal users are scientists, data analysts, or machine learning engineers working with diverse data.

Available on PyPI.

Use this if you need to rigorously audit the quality of real-world datasets or assess how well synthetic data replicates real data's characteristics across different data types.

Not ideal if you are looking for a simple, no-code solution for basic data validation or if your primary concern is only data privacy compliance without needing quality metrics.

data-auditing synthetic-data machine-learning-quality dataset-evaluation data-integrity
Maintenance 10 / 25
Adoption 8 / 25
Maturity 25 / 25
Community 5 / 25

How are scores calculated?

Stars

57

Forks

2

Language

Python

License

LGPL-3.0

Last pushed

Feb 06, 2026

Commits (30d)

0

Dependencies

9

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/fraunhoferportugal/pymdma"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.