HKUSTDial/megatran
[VLDB'25] Official repo for Paper "Weak-to-Strong Prompts with Lightweight-to-Powerful LLMs for High-Accuracy, Low-Cost, and Explainable Data Transformation"
This tool helps data professionals or anyone needing to clean or reformat data by automatically generating robust code. You provide simple, natural language requests describing how you want to transform your data, and it outputs high-quality, executable code. It's designed for users who regularly handle messy or inconsistently formatted datasets and need reliable transformations.
No commits in the last 6 months.
Use this if you frequently need to transform datasets and want to automate code generation for tasks like formatting, extraction, or standardization, reducing manual coding and errors.
Not ideal if you prefer to write all your data transformation code manually or have very simple, one-off data cleaning needs that don't warrant an automated code generation approach.
Stars
11
Forks
—
Language
Python
License
MIT
Category
Last pushed
Aug 21, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/HKUSTDial/megatran"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jncraton/languagemodels
Explore large language models in 512MB of RAM
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
haizelabs/verdict
Inference-time scaling for LLMs-as-a-judge.
albertan017/LLM4Decompile
Reverse Engineering: Decompiling Binary Code with Large Language Models
bytedance/Sa2VA
Official Repo For Pixel-LLM Codebase