WangJingyao07/Awesome-GRPO
Codebase of GRPO: Implementations and Resources of GRPO and Its Variants
This repository provides a collection of code implementations and research papers for GRPO and its variations, which are advanced techniques for fine-tuning large language models (LLMs) using reinforcement learning. It helps AI researchers and practitioners improve LLM performance by offering a centralized hub for different optimization strategies. You input an LLM and training data, and the output is a more finely tuned, capable LLM.
276 stars.
Use this if you are an AI researcher or machine learning engineer looking to experiment with and apply advanced reinforcement learning techniques (specifically GRPO and its variants) to fine-tune large language models for better performance.
Not ideal if you are looking for a simple, out-of-the-box solution for general LLM usage without delving into advanced fine-tuning methodologies or if you are not familiar with reinforcement learning concepts for LLMs.
Stars
276
Forks
31
Language
Python
License
—
Category
Last pushed
Dec 06, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/WangJingyao07/Awesome-GRPO"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
agentscope-ai/Trinity-RFT
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement...
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO &...
zjunlp/EasyEdit
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
hyunwoongko/nanoRLHF
nanoRLHF: from-scratch journey into how LLMs and RLHF really work.