GithubX-F/ProxMO-RL

Proximity-based Multi-turn Optimization (ProxMO) - Official Implementation

/ 100

Emerging

ProxMO helps AI developers train large language model (LLM) agents for multi-step tasks more effectively. It takes in raw agent interaction data and outputs optimized models that learn faster and perform better, especially on complex, multi-turn challenges. This is ideal for machine learning engineers and researchers building sophisticated AI agents.

Use this if you are developing LLM agents for tasks that require multiple steps and need a robust way to assign credit for actions taken across a long sequence of interactions.

Not ideal if you are working with single-turn LLM prompts or if you don't have experience with reinforcement learning for agent training.

LLM-agent-training reinforcement-learning multi-turn-AI AI-development agentic-AI

No Package No Dependents

Maintenance 10 / 25

Adoption 8 / 25

Maturity 11 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

langfengQ/verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is...

sotopia-lab/sotopia

Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)

zhudotexe/redel

ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive...

TIGER-AI-Lab/verl-tool

A version of verl to support diverse tool use

AMAP-ML/Tree-GRPO

[ICLR 2026] Tree Search for LLM Agent Reinforcement Learning

Explore LLM Tools

All categories Trending LLM Tool directory Insights