iiis-ai/TemplateMath

[ICLR 2025 DATA-FM] Training and Evaluating Language Models with Template-based Data Generation (https://arxiv.org/abs/2411.18104)

25
/ 100
Experimental

This project helps AI researchers and machine learning engineers develop and evaluate large language models (LLMs) that excel at mathematical reasoning. It provides a massive dataset of over 7.4 million synthetically generated grade-school math problems, each with a natural language explanation and a programmatically verified code solution. Researchers can use this high-quality data to train more capable and reliable AI models.

Use this if you are an AI researcher or machine learning engineer looking for a high-quality, large-scale dataset to train or fine-tune language models for complex mathematical reasoning tasks.

Not ideal if you are looking for real-world, human-generated math problems or if your primary focus is on non-mathematical language tasks.

AI model training large language models mathematical reasoning dataset generation machine learning research
No License No Package No Dependents
Maintenance 6 / 25
Adoption 5 / 25
Maturity 8 / 25
Community 6 / 25

How are scores calculated?

Stars

13

Forks

1

Language

Python

License

Last pushed

Nov 11, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/iiis-ai/TemplateMath"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.