Llm Reasoning Research Transformer Models

There are 57 llm reasoning research models tracked. 1 score above 70 (verified tier). The highest-rated is cvs-health/uqlm at 73/100 with 1,121 stars. 1 of the top 10 are actively maintained.

Get all 57 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-reasoning-research&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 cvs-health/uqlm

UQLM: Uncertainty Quantification for Language Models, is a Python package...

73
Verified
2 PRIME-RL/TTRL

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

52
Established
3 sapientinc/HRM

Hierarchical Reasoning Model Official Release

49
Emerging
4 tigerchen52/query_level_uncertainty

query-level uncertainty in LLMs

47
Emerging
5 reasoning-survey/Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

45
Emerging
6 HKUDS/LightReasoner

"LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"

44
Emerging
7 spcl/x1

Official Implementation of "Reasoning Language Models: A Blueprint"

44
Emerging
8 hao-ai-lab/Dynasor

[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model...

44
Emerging
9 sail-sg/understand-r1-zero

Understanding R1-Zero-Like Training: A Critical Perspective

42
Emerging
10 Eclipsess/Awesome-Efficient-Reasoning-LLMs

[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large...

41
Emerging
11 TIGER-AI-Lab/Pixel-Reasoner

Pixel-Level Reasoning Model trained with RL [NeuIPS25]

40
Emerging
12 lqzxt/Time-R1

Time-R1 is a two-stage reinforcement fine-tuning framework that trains large...

39
Emerging
13 mbzuai-oryx/Awesome-LLM-Post-training

Awesome Reasoning LLM Tutorial/Survey/Guide

38
Emerging
14 TIGER-AI-Lab/VL-Rethinker

The official code of "VL-Rethinker: Incentivizing Self-Reflection of...

37
Emerging
15 iiis-ai/cumulative-reasoning

[TMLR] Cumulative Reasoning With Large Language Models...

37
Emerging
16 AlexanderVNikitin/kernel-language-entropy

Code for Fine-grained Uncertainty Quantification for LLMs from Semantic...

37
Emerging
17 yongchao98/R1-Code-Interpreter

R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and...

36
Emerging
18 Alsace08/Chain-of-Embedding

[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding...

36
Emerging
19 jqtangust/Robust-R1

🔥🔥🔥[AAAI 2026 Oral] Official Implementation of Robust-R1: Degradation-Aware...

36
Emerging
20 andrewliao11/LongPerceptualThoughts

[COLM'25] The official implementation of "LongPerceptualThoughts: Distilling...

35
Emerging
21 TIGER-AI-Lab/General-Reasoner

General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]

35
Emerging
22 InternLM/OREAL

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

34
Emerging
23 rkinas/reasoning_models_how_to

This repository serves as a collection of research notes and resources on...

33
Emerging
24 Lanerra/reasoning-bank-slm

An experiment that applies Google Research's `ReasoningBank` technique to...

33
Emerging
25 SalesforceAIResearch/Elastic-Reasoning

Make reasoning models scalable

32
Emerging
26 The-Martyr/CausalMM

[ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal...

32
Emerging
27 Tebmer/Rereading-LLM-Reasoning

EMNLP 2024 "Re-reading improves reasoning in large language models". Simply...

32
Emerging
28 Qwen-Applications/CLIPO

CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

31
Emerging
29 cui-shaobo/defeasibility-in-causality

exploring the defeasibility inside causality

31
Emerging
30 JunyiYe/FaultyMathProblem

From Blind Solvers to Logical Thinkers: Benchmarking LLMs’ Logical Integrity...

31
Emerging
31 sdpkjc/SATQuest

🏞 A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs

30
Emerging
32 ulab-uiuc/Time-R1

Time-R1: Framework and resources for endowing LLMs with comprehensive...

30
Emerging
33 StringNLPLAB/MGS

Repository for the paper "Advancing General-Purpose Reasoning Models with...

30
Emerging
34 WooooDyy/LLM-Reverse-Curriculum-RL

Implementation of the ICML 2024 paper "Training Large Language Models for...

29
Experimental
35 PRIME-RL/Entropy-Mechanism-of-RL

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

29
Experimental
36 czg1225/VeriThinker

[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient

28
Experimental
37 sparkle-reasoning/sparkle

[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs...

27
Experimental
38 Hyun-Ryu/clover

Official code for "Divide and Translate: Compositional First-Order Logic...

27
Experimental
39 sastpg/RFTT

RFTT: Reasoning with Reinforced Functional Token Tuning

25
Experimental
40 Eric2i/LLM-MindMap

EMNLP 2025 - "Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning...

24
Experimental
41 msmrexe/neurosymbolic-vqa-program-generator

A comprehensive implementation of a Neurosymbolic framework for Visual...

21
Experimental
42 Siesher/Generator_for_reasoning

🧠 Reasoning data generator for LLM training

21
Experimental
43 safouaneelg/zeroshot-reasoning

Ollama structured output for visual zeroshot reasoning

20
Experimental
44 231sm/Eval_Multi-Step_Reasoning

Comprehensive Evaluation On Answer Calibration For Multi-Step Reasoning

19
Experimental
45 zhaochen0110/Cotempqa

Code and data for "Living in the Moment: Can Large Language Models Grasp...

19
Experimental
46 hewei2001/ReachQA

[EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs

18
Experimental
47 Ruiyang-061X/Awesome-MLLM-Uncertainty

✨A curated list of papers on the uncertainty in multi-modal large language...

16
Experimental
48 sastpg/CoVo

Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for...

15
Experimental
49 genglinliu/UnknownBench

Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions...

13
Experimental
50 Zhaoyi-Li21/creme

[ACL 2024 Findings] "Understanding and Patching Compositional Reasoning in LLMs"

13
Experimental
51 nourdesoukizz/Reasoning-Rationalizing

we investigate whether models can maintain correct reasoning when exposed to...

13
Experimental
52 basicv8vc/LLM-Tool-Integrated-Reasoning-TIR-Papers

A curated collection of research papers on LLM Tool-Integrated Reasoning...

13
Experimental
53 OthoXIII/theoreme-innommables

Theorem of the Unnameable [⧉/⧉ₛ] — Epistemological framework for binary...

13
Experimental
54 ParthaPRay/neuro-symbolic_abductive_reasoning_ollama_fault_diagnosis

This repo presents codes that allows user to run localized Ollama based...

13
Experimental
55 jeffasante/latent-reasoning-transformer

Implemented a recurrent-depth LLM (PyTorch) based on arXiv:2502.05171....

12
Experimental
56 YuxiangMai/RefRea

[AAAI 2026] RefRea: Reference-Guided Reasoning with Meta-Cognition for...

12
Experimental
57 hellokayas/MM-PoE

Implementation of Process of Elimination for Multiple Choice Reasoning in...

11
Experimental