AI-team-UoA/pyJedAI
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
This tool helps data analysts and scientists identify and merge duplicate records across various datasets, a process known as Entity Resolution. You input structured or semi-structured data from different sources, and it outputs a cleaned, unified dataset where redundant entries are linked or resolved. It's designed for anyone dealing with inconsistent or fragmented data who needs a single, accurate view of entities like customers, products, or locations.
Available on PyPI.
Use this if you need to find and consolidate duplicate entries or link related records across multiple datasets without requiring pre-labeled examples.
Not ideal if you primarily work with Java, as a dedicated Java-based version of JedAI with a web application is also available.
Stars
89
Forks
12
Language
Python
License
Apache-2.0
Category
Last pushed
Nov 03, 2025
Commits (30d)
0
Dependencies
17
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/AI-team-UoA/pyJedAI"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
pykt-team/pykt-toolkit
pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models
microsoft/archai
Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.
google-research/morph-net
Fast & Simple Resource-Constrained Learning of Deep Network Structure
IDEALLab/EngiBench
Benchmarks for automated engineering design
automl/neps
Neural Pipeline Search (NePS): Helps deep learning experts find the best neural pipeline.