InternLM/OREAL

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

/ 100

Emerging

This project offers models specifically trained to solve complex mathematical reasoning problems. It takes mathematical problems as input and generates detailed, step-by-step solutions that lead to a correct final answer. Researchers and developers working on advanced AI systems that require strong mathematical problem-solving capabilities would use this.

193 stars. No commits in the last 6 months.

Use this if you are a researcher or AI developer aiming to enhance large language models' ability to accurately solve challenging math problems through advanced reinforcement learning techniques.

Not ideal if you are a non-technical user looking for a simple math problem solver or a developer without significant GPU resources and expertise in training large language models.

mathematical-reasoning AI-research large-language-models deep-learning problem-solving

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 8 / 25

How are scores calculated?

Stars

193

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

cvs-health/uqlm

UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM...

PRIME-RL/TTRL

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

sapientinc/HRM

Hierarchical Reasoning Model Official Release

tigerchen52/query_level_uncertainty

query-level uncertainty in LLMs

reasoning-survey/Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

Explore Transformer Models

All categories Trending Transformer directory Insights