AbsaOSS/cobrix
A COBOL parser and Mainframe/EBCDIC data source for Apache Spark
This tool helps data engineers and analysts integrate legacy mainframe data with modern analytics platforms. It takes raw COBOL/EBCDIC binary files, along with their COBOL copybooks (schemas), and transforms them into queryable Spark DataFrames or streams. This allows organizations to incorporate crucial historical or operational data into their data lakes and business intelligence systems, without needing deep COBOL expertise.
159 stars.
Use this if you need to extract and analyze data stored in traditional COBOL binary files from mainframes using Apache Spark.
Not ideal if your data is already in modern, structured formats like CSV, Parquet, or JSON, or if you don't use Apache Spark for your data processing.
Stars
159
Forks
89
Language
Scala
License
Apache-2.0
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/AbsaOSS/cobrix"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.