yongchao98/R1-Code-Interpreter

R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning

/ 100

Emerging

This project helps AI researchers and developers train Large Language Models (LLMs) to perform complex reasoning tasks by autonomously generating and executing code. It provides a framework and pre-trained models that take a reasoning or planning problem as input and output a step-by-step solution, potentially involving code execution for self-correction. The ideal user is an AI model developer or researcher looking to enhance LLMs' ability to reason and solve problems programmatically.

Use this if you are an AI researcher or developer aiming to improve an LLM's capacity for multi-step reasoning and problem-solving through code generation and execution.

Not ideal if you are an end-user without deep machine learning expertise simply looking for an off-the-shelf tool to solve a specific business problem.

LLM training AI research reasoning AI model fine-tuning code generation AI

No License No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

cvs-health/uqlm

UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM...

PRIME-RL/TTRL

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

sapientinc/HRM

Hierarchical Reasoning Model Official Release

tigerchen52/query_level_uncertainty

query-level uncertainty in LLMs

reasoning-survey/Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

Explore Transformer Models

All categories Trending Transformer directory Insights