TheDataStation/pneuma

LLM-Powered Data Discovery System for Tabular Data

50
/ 100
Established

This tool helps data analysts and researchers quickly find relevant datasets within a large collection of tabular data. You provide a natural language question, and it sifts through your registered tables to return the most pertinent ones, considering both the content and descriptive context. It's designed for anyone who regularly needs to locate specific datasets for analysis or reporting.

No commits in the last 6 months. Available on PyPI.

Use this if you have many tabular datasets and frequently struggle to find the right one for a specific question or analysis.

Not ideal if you only work with a few small datasets or need a tool for data transformation and cleaning rather than discovery.

data-discovery data-management dataset-search business-intelligence research-data
Stale 6m
Maintenance 2 / 25
Adoption 6 / 25
Maturity 25 / 25
Community 17 / 25

How are scores calculated?

Stars

24

Forks

8

Language

Python

License

MIT

Last pushed

Jul 14, 2025

Commits (30d)

0

Dependencies

15

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/TheDataStation/pneuma"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.