J535D165/data-matching-software
A list of free data matching and record linkage software.
This is a curated list of free and open-source tools designed to help you clean and consolidate your data. It addresses the common problem of having slightly different entries that actually refer to the same person, product, or event across various datasets, or even within a single dataset. The list helps you choose software that takes your messy, inconsistent records and identifies which ones are duplicates or belong together, giving you a cleaner, more reliable dataset. Data analysts, researchers, and operations managers who work with customer lists, inventory, or sensor data would find this resource useful.
401 stars. No commits in the last 6 months.
Use this if you need to find and merge records that might have typos, different spellings, or inconsistent formats across multiple lists or within a single dataset.
Not ideal if you are looking for commercial, proprietary data matching solutions or a tool that doesn't involve some level of technical setup or coding.
Stars
401
Forks
42
Language
—
License
—
Category
Last pushed
Feb 21, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/J535D165/data-matching-software"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
voxel51/fiftyone
Refine high-quality datasets and visual AI models
academic/awesome-datascience
:memo: An awesome Data Science repository to learn and apply for real world problems.
sacridini/Awesome-Geospatial
Long list of geospatial tools and resources
r0f1/datascience
Curated list of Python resources for data science.
nhivp/Awesome-Embedded
A curated list of awesome embedded programming.