Llm Reasoning Research Transformer Models

There are 57 llm reasoning research models tracked. 1 score above 70 (verified tier). The highest-rated is cvs-health/uqlm at 73/100 with 1,121 stars. 1 of the top 10 are actively maintained.

Get all 57 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-reasoning-research&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	cvs-health/uqlm UQLM: Uncertainty Quantification for Language Models, is a Python package...	73	Verified	1,121	Python
2	PRIME-RL/TTRL [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning	52	Established	1,014	Python
3	sapientinc/HRM Hierarchical Reasoning Model Official Release	49	Emerging	12,358	Python
4	tigerchen52/query_level_uncertainty query-level uncertainty in LLMs	47	Emerging	9	Python
5	reasoning-survey/Awesome-Reasoning-Foundation-Models ✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models	45	Emerging	652	—
6	HKUDS/LightReasoner "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"	44	Emerging	594	Python
7	spcl/x1 Official Implementation of "Reasoning Language Models: A Blueprint"	44	Emerging	94	Python
8	hao-ai-lab/Dynasor [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model...	44	Emerging	224	Python
9	sail-sg/understand-r1-zero Understanding R1-Zero-Like Training: A Critical Perspective	42	Emerging	1,224	Python
10	Eclipsess/Awesome-Efficient-Reasoning-LLMs [TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large...	41	Emerging	752	—
11	TIGER-AI-Lab/Pixel-Reasoner Pixel-Level Reasoning Model trained with RL [NeuIPS25]	40	Emerging	282	Python
12	lqzxt/Time-R1 Time-R1 is a two-stage reinforcement fine-tuning framework that trains large...	39	Emerging	94	Python
13	mbzuai-oryx/Awesome-LLM-Post-training Awesome Reasoning LLM Tutorial/Survey/Guide	38	Emerging	2,321	Python
14	TIGER-AI-Lab/VL-Rethinker The official code of "VL-Rethinker: Incentivizing Self-Reflection of...	37	Emerging	184	Python
15	iiis-ai/cumulative-reasoning [TMLR] Cumulative Reasoning With Large Language Models...	37	Emerging	308	Python
16	AlexanderVNikitin/kernel-language-entropy Code for Fine-grained Uncertainty Quantification for LLMs from Semantic...	37	Emerging	36	Python
17	yongchao98/R1-Code-Interpreter R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and...	36	Emerging	31	Python
18	Alsace08/Chain-of-Embedding [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding...	36	Emerging	95	Python
19	jqtangust/Robust-R1 🔥🔥🔥[AAAI 2026 Oral] Official Implementation of Robust-R1: Degradation-Aware...	36	Emerging	520	Python
20	andrewliao11/LongPerceptualThoughts [COLM'25] The official implementation of "LongPerceptualThoughts: Distilling...	35	Emerging	11	Python
21	TIGER-AI-Lab/General-Reasoner General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]	35	Emerging	222	Python
22	InternLM/OREAL Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning	34	Emerging	193	Python
23	rkinas/reasoning_models_how_to This repository serves as a collection of research notes and resources on...	33	Emerging	132	Python
24	Lanerra/reasoning-bank-slm An experiment that applies Google Research's `ReasoningBank` technique to...	33	Emerging	99	Python
25	SalesforceAIResearch/Elastic-Reasoning Make reasoning models scalable	32	Emerging	47	Jupyter Notebook
26	The-Martyr/CausalMM [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal...	32	Emerging	61	Python
27	Tebmer/Rereading-LLM-Reasoning EMNLP 2024 "Re-reading improves reasoning in large language models". Simply...	32	Emerging	29	Python
28	Qwen-Applications/CLIPO CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR	31	Emerging	10	Python
29	cui-shaobo/defeasibility-in-causality exploring the defeasibility inside causality	31	Emerging	4	Python
30	JunyiYe/FaultyMathProblem From Blind Solvers to Logical Thinkers: Benchmarking LLMs’ Logical Integrity...	31	Emerging	4	—
31	sdpkjc/SATQuest 🏞 A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs	30	Emerging	5	Python
32	ulab-uiuc/Time-R1 Time-R1: Framework and resources for endowing LLMs with comprehensive...	30	Emerging	66	Python
33	StringNLPLAB/MGS Repository for the paper "Advancing General-Purpose Reasoning Models with...	30	Emerging	19	Python
34	WooooDyy/LLM-Reverse-Curriculum-RL Implementation of the ICML 2024 paper "Training Large Language Models for...	29	Experimental	116	Python
35	PRIME-RL/Entropy-Mechanism-of-RL The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.	29	Experimental	421	Python
36	czg1225/VeriThinker [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient	28	Experimental	65	Python
37	sparkle-reasoning/sparkle [NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs...	27	Experimental	16	Python
38	Hyun-Ryu/clover Official code for "Divide and Translate: Compositional First-Order Logic...	27	Experimental	27	Python
39	sastpg/RFTT RFTT: Reasoning with Reinforced Functional Token Tuning	25	Experimental	29	Python
40	Eric2i/LLM-MindMap EMNLP 2025 - "Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning...	24	Experimental	12	Python
41	msmrexe/neurosymbolic-vqa-program-generator A comprehensive implementation of a Neurosymbolic framework for Visual...	21	Experimental	2	Python
42	Siesher/Generator_for_reasoning 🧠 Reasoning data generator for LLM training	21	Experimental	1	Jupyter Notebook
43	safouaneelg/zeroshot-reasoning Ollama structured output for visual zeroshot reasoning	20	Experimental	4	HTML
44	231sm/Eval_Multi-Step_Reasoning Comprehensive Evaluation On Answer Calibration For Multi-Step Reasoning	19	Experimental	4	Python
45	zhaochen0110/Cotempqa Code and data for "Living in the Moment: Can Large Language Models Grasp...	19	Experimental	32	Python
46	hewei2001/ReachQA [EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs	18	Experimental	59	Python
47	Ruiyang-061X/Awesome-MLLM-Uncertainty ✨A curated list of papers on the uncertainty in multi-modal large language...	16	Experimental	59	—
48	sastpg/CoVo Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for...	15	Experimental	22	Python
49	genglinliu/UnknownBench Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions...	13	Experimental	14	Jupyter Notebook
50	Zhaoyi-Li21/creme [ACL 2024 Findings] "Understanding and Patching Compositional Reasoning in LLMs"	13	Experimental	13	Python
51	nourdesoukizz/Reasoning-Rationalizing we investigate whether models can maintain correct reasoning when exposed to...	13	Experimental	—	Jupyter Notebook
52	basicv8vc/LLM-Tool-Integrated-Reasoning-TIR-Papers A curated collection of research papers on LLM Tool-Integrated Reasoning...	13	Experimental	6	—
53	OthoXIII/theoreme-innommables Theorem of the Unnameable [⧉/⧉ₛ] — Epistemological framework for binary...	13	Experimental	—	Python
54	ParthaPRay/neuro-symbolic_abductive_reasoning_ollama_fault_diagnosis This repo presents codes that allows user to run localized Ollama based...	13	Experimental	—	Python
55	jeffasante/latent-reasoning-transformer Implemented a recurrent-depth LLM (PyTorch) based on arXiv:2502.05171....	12	Experimental	1	Jupyter Notebook
56	YuxiangMai/RefRea [AAAI 2026] RefRea: Reference-Guided Reasoning with Meta-Cognition for...	12	Experimental	1	—
57	hellokayas/MM-PoE Implementation of Process of Elimination for Multiple Choice Reasoning in...	11	Experimental	—	Python

Comparisons in this category

uqlm and kernel-language-entropy (73 vs 37) uqlm and query_level_uncertainty (73 vs 47)