jkkummerfeld/text2sql-data

A collection of datasets that pair questions with SQL queries.

50
/ 100
Established

This collection provides carefully curated datasets for those developing systems that translate everyday questions into SQL database queries. It includes natural language sentences, their corresponding SQL queries, database schemas, and the actual database content across various domains like academic advising, flight information, and restaurant searches. This resource is for computational linguists, machine learning engineers, and researchers working on natural language understanding and database interaction.

585 stars. No commits in the last 6 months.

Use this if you are building or evaluating a system that allows users to ask questions in plain English and automatically get answers from a database without writing SQL.

Not ideal if you are looking for a tool to query databases directly or if your primary interest is in general natural language processing tasks not tied to database interaction.

natural-language-to-SQL semantic-parsing database-querying conversational-AI question-answering-systems
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 24 / 25

How are scores calculated?

Stars

585

Forks

116

Language

Python

License

Last pushed

Mar 03, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/jkkummerfeld/text2sql-data"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.