sqlflash/Awesome-Text2SQL-Dataset

a collection for SQL dataset, from Text2SQL.

29
/ 100
Experimental

This collection of datasets is for anyone building or evaluating systems that turn everyday questions into database queries (Text-to-SQL). It provides various datasets, from financial payments to geospatial queries, that help train and test how well a system can understand natural language and translate it into the correct SQL. Data scientists, machine learning engineers, and researchers can use this to improve systems that allow people to interact with databases using plain language.

Use this if you are developing or testing models that convert natural language questions into SQL queries, especially for specialized domains or complex interactions.

Not ideal if you are a business user looking for a ready-to-use tool to query your database with natural language, as this is a resource for building such tools.

natural-language-processing database-interaction machine-learning-datasets query-generation AI-model-evaluation
No License No Package No Dependents
Maintenance 10 / 25
Adoption 5 / 25
Maturity 7 / 25
Community 7 / 25

How are scores calculated?

Stars

10

Forks

1

Language

License

Last pushed

Feb 24, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/sqlflash/Awesome-Text2SQL-Dataset"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.