lotus-data/lotus

AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accurate query processing, that's as simple as writing Pandas code

58
/ 100
Established

This tool helps data professionals quickly and accurately process diverse datasets, including unstructured text and images, using large language models (LLMs). You provide your raw data and a natural language instruction describing the desired transformation, and it outputs the processed data, such as filtered lists or extracted information. It's designed for data scientists, analysts, and engineers who work with complex, varied data sources.

1,561 stars. Actively maintained with 4 commits in the last 30 days.

Use this if you need to perform advanced data cleaning, classification, or extraction tasks on large and varied datasets using AI, and want a simple, Pandas-like way to express these operations.

Not ideal if your data processing needs are purely numerical or strictly structured, and do not benefit from natural language-driven AI insights.

data-processing unstructured-data-analysis LLM-workflows data-science information-extraction
No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

1,561

Forks

139

Language

Python

License

Apache-2.0

Last pushed

Feb 19, 2026

Commits (30d)

4

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/lotus-data/lotus"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.