Llm Reasoning Research Transformer Models
There are 57 llm reasoning research models tracked. 1 score above 70 (verified tier). The highest-rated is cvs-health/uqlm at 73/100 with 1,121 stars. 1 of the top 10 are actively maintained.
Get all 57 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-reasoning-research&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
cvs-health/uqlm
UQLM: Uncertainty Quantification for Language Models, is a Python package... |
|
Verified |
| 2 |
PRIME-RL/TTRL
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning |
|
Established |
| 3 |
sapientinc/HRM
Hierarchical Reasoning Model Official Release |
|
Emerging |
| 4 |
tigerchen52/query_level_uncertainty
query-level uncertainty in LLMs |
|
Emerging |
| 5 |
reasoning-survey/Awesome-Reasoning-Foundation-Models
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models |
|
Emerging |
| 6 |
HKUDS/LightReasoner
"LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?" |
|
Emerging |
| 7 |
spcl/x1
Official Implementation of "Reasoning Language Models: A Blueprint" |
|
Emerging |
| 8 |
hao-ai-lab/Dynasor
[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model... |
|
Emerging |
| 9 |
sail-sg/understand-r1-zero
Understanding R1-Zero-Like Training: A Critical Perspective |
|
Emerging |
| 10 |
Eclipsess/Awesome-Efficient-Reasoning-LLMs
[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large... |
|
Emerging |
| 11 |
TIGER-AI-Lab/Pixel-Reasoner
Pixel-Level Reasoning Model trained with RL [NeuIPS25] |
|
Emerging |
| 12 |
lqzxt/Time-R1
Time-R1 is a two-stage reinforcement fine-tuning framework that trains large... |
|
Emerging |
| 13 |
mbzuai-oryx/Awesome-LLM-Post-training
Awesome Reasoning LLM Tutorial/Survey/Guide |
|
Emerging |
| 14 |
TIGER-AI-Lab/VL-Rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of... |
|
Emerging |
| 15 |
iiis-ai/cumulative-reasoning
[TMLR] Cumulative Reasoning With Large Language Models... |
|
Emerging |
| 16 |
AlexanderVNikitin/kernel-language-entropy
Code for Fine-grained Uncertainty Quantification for LLMs from Semantic... |
|
Emerging |
| 17 |
yongchao98/R1-Code-Interpreter
R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and... |
|
Emerging |
| 18 |
Alsace08/Chain-of-Embedding
[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding... |
|
Emerging |
| 19 |
jqtangust/Robust-R1
🔥🔥🔥[AAAI 2026 Oral] Official Implementation of Robust-R1: Degradation-Aware... |
|
Emerging |
| 20 |
andrewliao11/LongPerceptualThoughts
[COLM'25] The official implementation of "LongPerceptualThoughts: Distilling... |
|
Emerging |
| 21 |
TIGER-AI-Lab/General-Reasoner
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25] |
|
Emerging |
| 22 |
InternLM/OREAL
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning |
|
Emerging |
| 23 |
rkinas/reasoning_models_how_to
This repository serves as a collection of research notes and resources on... |
|
Emerging |
| 24 |
Lanerra/reasoning-bank-slm
An experiment that applies Google Research's `ReasoningBank` technique to... |
|
Emerging |
| 25 |
SalesforceAIResearch/Elastic-Reasoning
Make reasoning models scalable |
|
Emerging |
| 26 |
The-Martyr/CausalMM
[ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal... |
|
Emerging |
| 27 |
Tebmer/Rereading-LLM-Reasoning
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply... |
|
Emerging |
| 28 |
Qwen-Applications/CLIPO
CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR |
|
Emerging |
| 29 |
cui-shaobo/defeasibility-in-causality
exploring the defeasibility inside causality |
|
Emerging |
| 30 |
JunyiYe/FaultyMathProblem
From Blind Solvers to Logical Thinkers: Benchmarking LLMs’ Logical Integrity... |
|
Emerging |
| 31 |
sdpkjc/SATQuest
🏞 A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs |
|
Emerging |
| 32 |
ulab-uiuc/Time-R1
Time-R1: Framework and resources for endowing LLMs with comprehensive... |
|
Emerging |
| 33 |
StringNLPLAB/MGS
Repository for the paper "Advancing General-Purpose Reasoning Models with... |
|
Emerging |
| 34 |
WooooDyy/LLM-Reverse-Curriculum-RL
Implementation of the ICML 2024 paper "Training Large Language Models for... |
|
Experimental |
| 35 |
PRIME-RL/Entropy-Mechanism-of-RL
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning. |
|
Experimental |
| 36 |
czg1225/VeriThinker
[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient |
|
Experimental |
| 37 |
sparkle-reasoning/sparkle
[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs... |
|
Experimental |
| 38 |
Hyun-Ryu/clover
Official code for "Divide and Translate: Compositional First-Order Logic... |
|
Experimental |
| 39 |
sastpg/RFTT
RFTT: Reasoning with Reinforced Functional Token Tuning |
|
Experimental |
| 40 |
Eric2i/LLM-MindMap
EMNLP 2025 - "Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning... |
|
Experimental |
| 41 |
msmrexe/neurosymbolic-vqa-program-generator
A comprehensive implementation of a Neurosymbolic framework for Visual... |
|
Experimental |
| 42 |
Siesher/Generator_for_reasoning
🧠 Reasoning data generator for LLM training |
|
Experimental |
| 43 |
safouaneelg/zeroshot-reasoning
Ollama structured output for visual zeroshot reasoning |
|
Experimental |
| 44 |
231sm/Eval_Multi-Step_Reasoning
Comprehensive Evaluation On Answer Calibration For Multi-Step Reasoning |
|
Experimental |
| 45 |
zhaochen0110/Cotempqa
Code and data for "Living in the Moment: Can Large Language Models Grasp... |
|
Experimental |
| 46 |
hewei2001/ReachQA
[EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs |
|
Experimental |
| 47 |
Ruiyang-061X/Awesome-MLLM-Uncertainty
✨A curated list of papers on the uncertainty in multi-modal large language... |
|
Experimental |
| 48 |
sastpg/CoVo
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for... |
|
Experimental |
| 49 |
genglinliu/UnknownBench
Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions... |
|
Experimental |
| 50 |
Zhaoyi-Li21/creme
[ACL 2024 Findings] "Understanding and Patching Compositional Reasoning in LLMs" |
|
Experimental |
| 51 |
nourdesoukizz/Reasoning-Rationalizing
we investigate whether models can maintain correct reasoning when exposed to... |
|
Experimental |
| 52 |
basicv8vc/LLM-Tool-Integrated-Reasoning-TIR-Papers
A curated collection of research papers on LLM Tool-Integrated Reasoning... |
|
Experimental |
| 53 |
OthoXIII/theoreme-innommables
Theorem of the Unnameable [⧉/⧉ₛ] — Epistemological framework for binary... |
|
Experimental |
| 54 |
ParthaPRay/neuro-symbolic_abductive_reasoning_ollama_fault_diagnosis
This repo presents codes that allows user to run localized Ollama based... |
|
Experimental |
| 55 |
jeffasante/latent-reasoning-transformer
Implemented a recurrent-depth LLM (PyTorch) based on arXiv:2502.05171.... |
|
Experimental |
| 56 |
YuxiangMai/RefRea
[AAAI 2026] RefRea: Reference-Guided Reasoning with Meta-Cognition for... |
|
Experimental |
| 57 |
hellokayas/MM-PoE
Implementation of Process of Elimination for Multiple Choice Reasoning in... |
|
Experimental |