GithubX-F/ProxMO-RL
Proximity-based Multi-turn Optimization (ProxMO) - Official Implementation
ProxMO helps AI developers train large language model (LLM) agents for multi-step tasks more effectively. It takes in raw agent interaction data and outputs optimized models that learn faster and perform better, especially on complex, multi-turn challenges. This is ideal for machine learning engineers and researchers building sophisticated AI agents.
Use this if you are developing LLM agents for tasks that require multiple steps and need a robust way to assign credit for actions taken across a long sequence of interactions.
Not ideal if you are working with single-turn LLM prompts or if you don't have experience with reinforcement learning for agent training.
Stars
56
Forks
2
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 10, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/GithubX-F/ProxMO-RL"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
langfengQ/verl-agent
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is...
sotopia-lab/sotopia
Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)
zhudotexe/redel
ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive...
TIGER-AI-Lab/verl-tool
A version of verl to support diverse tool use
AMAP-ML/Tree-GRPO
[ICLR 2026] Tree Search for LLM Agent Reinforcement Learning