trustedshops-public/schema2pyarrow
Converts AsyncApi and JsonSchema to PyArrow schema
This tool helps data engineers convert standard schema definitions (JSON Schema or AsyncAPI) into PyArrow schemas. It takes your existing schema files as input and outputs a PyArrow schema, which can then be used to load and validate data efficiently. Data engineers can use this to enforce data consistency and reliability in their pipelines.
Available on PyPI.
Use this if you need to ensure that data entering your systems strictly adheres to a predefined structure, preventing errors and improving data quality through a 'contract-first' approach.
Not ideal if your primary goal is real-time data streaming or if you are not working with PyArrow for data processing.
Stars
12
Forks
—
Language
Python
License
MIT
Category
Last pushed
Mar 16, 2026
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/trustedshops-public/schema2pyarrow"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.