amao0o0/awesome-AI-Math-Datasets
A collection of recent open-source math datasets for training and evaluating Math LLMs
This collection provides high-quality mathematical datasets for training and evaluating AI models designed to solve math problems. It takes in various forms of mathematical content like web pages, textbooks, problem sets, and solutions. The output is a refined dataset that can be used to improve an AI's ability to understand and generate mathematical reasoning. This is for AI researchers and developers working on advanced mathematical AI models.
Use this if you are developing or fine-tuning Large Language Models (LLMs) and need specialized mathematical datasets for pre-training, supervised fine-tuning, or evaluation.
Not ideal if you are a student looking for math practice problems, or a teacher seeking to generate curriculum content directly.
Stars
24
Forks
1
Language
—
License
—
Category
Last pushed
Dec 08, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/amao0o0/awesome-AI-Math-Datasets"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
safe-graph/graph-fraud-detection-papers
A curated list of Graph/Transformer-based fraud, anomaly, and outlier detection papers & resources
isLinXu/paper-list
autoupdate paper list
mala-lab/Awesome-Anomaly-Detection-Foundation-Models
A curated list of papers & resources on anomaly detection foundation models using large language...
shuolucs/Awesome-Out-Of-Distribution-Detection
[ACM CSUR 2025] Out-of-Distribution Detection: A Task-Oriented Survey of Recent Advances
xueyongfu11/awesome-deep-learning-resource
个人总结的大模型、自然语言处理NLP、多模态、计算机视觉CV等方向paper的阅读笔记;收集到或者使用到的一些NLP、CV等领域的优秀开源仓库;其他:如数据集、评测leaderboard等