datacleaner/DataCleaner

The premier open source Data Quality solution

67
/ 100
Established

This tool helps businesses, analysts, and data professionals ensure their data is accurate and reliable. You input raw, messy datasets, and it helps you identify inconsistencies, correct errors, and enrich information to produce clean, high-quality data. It's used by anyone who needs to trust their data for reporting, analysis, or operational processes.

647 stars. Actively maintained with 5 commits in the last 30 days.

Use this if you need a versatile solution for ad-hoc data analysis, recurring data cleansing tasks, or managing master data effectively.

Not ideal if you require active, ongoing feature development or a project with a very large, rapidly growing community.

data-quality-management data-cleansing data-profiling master-data-management data-governance
No Package No Dependents
Maintenance 16 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

647

Forks

183

Language

Java

License

LGPL-3.0

Last pushed

Mar 14, 2026

Commits (30d)

5

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/datacleaner/DataCleaner"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.