data-prompt-query/dpq
dpq is an open-source python library that makes prompt-based data transformations and feature engineering easy
This library helps data scientists and machine learning engineers easily prepare and transform text data using generative AI. It takes raw text inputs, like customer reviews or product descriptions, and produces structured outputs such as sentiment labels or extracted entities, which can then be used for analysis or model training. This is for professionals who need to quickly create new features or categorize large datasets without extensive manual labeling.
No commits in the last 6 months.
Use this if you need to rapidly categorize, extract information, or transform large volumes of text data using AI prompts, even without deep natural language processing expertise.
Not ideal if you require absolute cost or speed guarantees, or if your organization's policies restrict the use of external generative AI APIs for data processing.
Stars
25
Forks
1
Language
Python
License
Apache-2.0
Category
Last pushed
Apr 28, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/data-prompt-query/dpq"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NVIDIA-NeMo/Curator
Scalable data pre processing and curation toolkit for LLMs
MigoXLab/dingo
Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool
data-prep-kit/data-prep-kit
Open source project for data preparation for GenAI applications
TheDataStation/pneuma
LLM-Powered Data Discovery System for Tabular Data
cleanlab/cleanlab-studio
Client interface to Cleanlab Studio