anwitars/grab
High-performance, declarative stream processor for delimited text data.
This tool helps operations engineers and data analysts clean up messy, delimited text data from logs or reports and turn it into structured JSON. You feed it raw text streams (like CSV files or command output), tell it which columns matter and what to call them, and it outputs perfectly formatted JSON records. It's designed for anyone who struggles with transforming plain text into a structured format for further analysis or integration.
Use this if you regularly process large volumes of inconsistent text data and need a reliable, high-performance way to extract specific fields and convert them into structured JSON for tools like 'jq' or other data pipelines.
Not ideal if your data is already in a structured format (like JSON, XML, or database records) or if you only need simple text searches without any data reformatting.
Stars
8
Forks
—
Language
Rust
License
—
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/anwitars/grab"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.