arashdn/dtt
DTT: A Deep Learning Framework to Transform Tabular Data for Joinability by Leveraging Large Language Models
This project helps data professionals clean and standardize messy tabular data so it can be accurately combined, or 'joined,' with other datasets. It takes a source table and a target table, applies a deep learning model to transform the source data's format, and outputs a revised source table ready for joining. It's designed for data analysts, data scientists, or database administrators who deal with inconsistent data formats.
No commits in the last 6 months.
Use this if you frequently encounter situations where you need to join multiple tables but their text-based columns (like names, addresses, or product descriptions) don't match perfectly due to variations in spelling, formatting, or abbreviations.
Not ideal if your data cleaning needs are limited to numerical data transformations or simple string operations that can be handled with standard scripting or spreadsheet functions.
Stars
10
Forks
—
Language
Python
License
—
Category
Last pushed
Jun 13, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/arashdn/dtt"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.