DataKitchen/dataops-testgen
DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling, new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring
This tool helps data professionals like data engineers and data analysts ensure the quality of their datasets. It automatically profiles new data, generates validation tests, and continuously monitors existing data for anomalies. The result is fewer data errors reaching end-users and clearer communication with data suppliers.
Available on PyPI.
Use this if you need to quickly identify and address data quality issues in new or refreshed datasets, saving time and improving data reliability.
Not ideal if your primary need is complex data transformation or advanced data modeling, as this tool focuses specifically on data quality testing and monitoring.
Stars
72
Forks
5
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 28, 2026
Monthly downloads
320
Commits (30d)
0
Dependencies
50
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/DataKitchen/dataops-testgen"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.