AI-team-UoA/pyJedAI

An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.

55
/ 100
Established

This tool helps data analysts and scientists identify and merge duplicate records across various datasets, a process known as Entity Resolution. You input structured or semi-structured data from different sources, and it outputs a cleaned, unified dataset where redundant entries are linked or resolved. It's designed for anyone dealing with inconsistent or fragmented data who needs a single, accurate view of entities like customers, products, or locations.

Available on PyPI.

Use this if you need to find and consolidate duplicate entries or link related records across multiple datasets without requiring pre-labeled examples.

Not ideal if you primarily work with Java, as a dedicated Java-based version of JedAI with a web application is also available.

data-matching record-linkage data-harmonization customer-360 data-quality
Maintenance 6 / 25
Adoption 9 / 25
Maturity 25 / 25
Community 15 / 25

How are scores calculated?

Stars

89

Forks

12

Language

Python

License

Apache-2.0

Last pushed

Nov 03, 2025

Commits (30d)

0

Dependencies

17

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/AI-team-UoA/pyJedAI"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.