moj-analytical-services/etl_manager

A python package to create a database on the platform using our moj data warehousing framework

44
/ 100
Emerging

This tool helps data engineers and analysts define the structure of their datasets stored in Amazon S3, making them easily queryable with SQL through Amazon Athena. You provide descriptions of your data files (like CSVs or Parquet files) and their columns, and it sets up the necessary metadata. The output is a defined data catalog in AWS Glue, allowing for straightforward SQL querying of your S3 data.

Use this if you need to create and manage schemas for your analytical datasets in AWS S3 and make them accessible for SQL queries using Amazon Athena, without manually configuring AWS Glue.

Not ideal if you do not use AWS S3 and Athena for your data storage and querying, or if you need robust data validation and conflict checking for column properties like patterns and enums.

data-warehousing cloud-analytics data-cataloging ETL-management data-governance
No License No Package No Dependents
Maintenance 13 / 25
Adoption 6 / 25
Maturity 8 / 25
Community 17 / 25

How are scores calculated?

Stars

21

Forks

11

Language

Python

License

Last pushed

Mar 16, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/moj-analytical-services/etl_manager"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.