rStar-RL/LoongRL

LoongRL: Reinforcement Learning for Advanced Reasoning over Long Contexts (ICLR 2026 Oral)

/ 100

Experimental

This project helps large language models (LLMs) understand and answer questions from very long texts, and solve complex math problems more effectively. It takes in existing LLMs and training data to produce models that can reason better over extensive documents or complex mathematical challenges. Scientists, data engineers, or machine learning researchers who work with advanced LLM development would use this.

Use this if you need to train or fine-tune large language models to excel at understanding information spread across very long documents or to solve advanced mathematical reasoning tasks with high accuracy.

Not ideal if you are looking for an out-of-the-box LLM application or if your primary need is for basic, short-context text generation rather than deep reasoning over long inputs.

large-language-models long-context-reasoning mathematical-reasoning model-fine-tuning AI-development

No License No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 5 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

—

Higher-rated alternatives

DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

google-deepmind/dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning...

Denys88/rl_games

RL implementations

pytorch/rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

yandexdataschool/Practical_RL

A course in reinforcement learning in the wild

Explore ML Frameworks

All categories Trending ML Framework directory Insights