amao0o0/awesome-AI-Math-Datasets

A collection of recent open-source math datasets for training and evaluating Math LLMs

24
/ 100
Experimental

This collection provides high-quality mathematical datasets for training and evaluating AI models designed to solve math problems. It takes in various forms of mathematical content like web pages, textbooks, problem sets, and solutions. The output is a refined dataset that can be used to improve an AI's ability to understand and generate mathematical reasoning. This is for AI researchers and developers working on advanced mathematical AI models.

Use this if you are developing or fine-tuning Large Language Models (LLMs) and need specialized mathematical datasets for pre-training, supervised fine-tuning, or evaluation.

Not ideal if you are a student looking for math practice problems, or a teacher seeking to generate curriculum content directly.

mathematical reasoning AI model training natural language processing machine learning research computational mathematics
No License No Package No Dependents
Maintenance 6 / 25
Adoption 6 / 25
Maturity 8 / 25
Community 4 / 25

How are scores calculated?

Stars

24

Forks

1

Language

License

Last pushed

Dec 08, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/amao0o0/awesome-AI-Math-Datasets"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.