lotus-data/lotus
AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accurate query processing, that's as simple as writing Pandas code
This tool helps data professionals quickly and accurately process diverse datasets, including unstructured text and images, using large language models (LLMs). You provide your raw data and a natural language instruction describing the desired transformation, and it outputs the processed data, such as filtered lists or extracted information. It's designed for data scientists, analysts, and engineers who work with complex, varied data sources.
1,561 stars. Actively maintained with 4 commits in the last 30 days.
Use this if you need to perform advanced data cleaning, classification, or extraction tasks on large and varied datasets using AI, and want a simple, Pandas-like way to express these operations.
Not ideal if your data processing needs are purely numerical or strictly structured, and do not benefit from natural language-driven AI insights.
Stars
1,561
Forks
139
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 19, 2026
Commits (30d)
4
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/lotus-data/lotus"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
airweave-ai/airweave
Open-source context retrieval layer for AI agents
superduper-io/superduper
Superduper: End-to-end framework for building custom AI applications and agents.
supabase/headless-vector-search
Supabase Toolkit to perform vector similarity search on your knowledge base embeddings.
similigh/simili-bot
AI-powered GitHub issue intelligence - semantic duplicate detection, cross-repo search, and...
grumpyp/aixplora
AIxplora is a open-source tool which let's you query all kind of files not limited to any length...