WooooDyy/MathCritique

Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".

/ 100

Experimental

This project helps improve how Large Language Models (LLMs) solve complex problems, especially in mathematics, coding, and science. It takes an LLM's initial attempt at a complex problem and then uses a 'critique' model to give step-by-step feedback. This feedback helps refine the original LLM's reasoning and produce more accurate and diverse solutions. It's for researchers and developers working on enhancing AI models for advanced reasoning tasks.

No commits in the last 6 months.

Use this if you are a researcher or AI developer looking to significantly improve the accuracy and reasoning capabilities of your language models on challenging problems by incorporating systematic feedback.

Not ideal if you are a casual user looking for a plug-and-play solution for basic text generation, as this project is focused on deep model refinement for complex reasoning.

AI-research mathematical-reasoning LLM-fine-tuning AI-model-evaluation complex-problem-solving

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 3 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

ExtensityAI/symbolicai

A neurosymbolic perspective on LLMs

TIGER-AI-Lab/MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding...

deep-symbolic-mathematics/LLM-SR

[ICLR 2025 Oral] This is the official repo for the paper "LLM-SR" on Scientific Equation...

microsoft/interwhen

A framework for verifiable reasoning with language models.

zhudotexe/fanoutqa

Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language...

Explore Transformer Models

All categories Trending Transformer directory Insights