sodadata/soda-core

Data Contracts engine for the modern data stack. https://www.soda.io

70
/ 100
Verified

This tool helps data professionals ensure the accuracy and reliability of their datasets. It allows you to define "data contracts" in a human-readable YAML format, specifying expected schema and data quality rules for tables in your data warehouse. You input these contract definitions and connect to your data sources (like Snowflake or BigQuery), and the tool automatically verifies if your actual data adheres to these quality standards, alerting you to any discrepancies.

2,310 stars. Actively maintained with 31 commits in the last 30 days.

Use this if you need a systematic way to define and automatically validate the quality and structure of data moving through your data pipelines or residing in your data warehouses.

Not ideal if you are looking for a visual, no-code interface for data quality monitoring, as this tool primarily uses YAML configurations and a command-line interface.

data-quality data-governance data-pipeline-validation data-warehouse-management data-reliability
No Package No Dependents
Maintenance 23 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 21 / 25

How are scores calculated?

Stars

2,310

Forks

259

Language

Python

License

Last pushed

Mar 18, 2026

Commits (30d)

31

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/sodadata/soda-core"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.