JianxXiong/AAPO

Implementation of AAPO (Arxiv: 2505.14264v2) paper

/ 100

Experimental

This project offers an advanced reinforcement learning algorithm (AAPO) designed to significantly improve how large language models (LLMs) solve complex mathematical problems. It takes an existing LLM and training data for mathematical reasoning tasks, and outputs a fine-tuned LLM with enhanced accuracy and problem-solving capabilities. Researchers and practitioners working on AI models that require strong logical and mathematical abilities would find this beneficial.

Use this if you need to train or fine-tune large language models to achieve superior performance on mathematical reasoning benchmarks.

Not ideal if you are looking for a pre-trained, ready-to-use LLM without needing to engage in the training and evaluation process.

AI model training mathematical reasoning large language models machine learning research AI development

No Package No Dependents

Maintenance 6 / 25

Adoption 6 / 25

Maturity 15 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

PaddlePaddle/PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started...

arcee-ai/mergekit

Tools for merging pretrained large language models.

changyeyu/LLM-RL-Visualized

🌟100+ 原创 LLM / RL 原理图📚，《大模型算法》作者巨献！💥（100+ LLM/RL Algorithm Maps ）

mindspore-lab/step_into_llm

MindSpore online courses: Step into LLM

Explore Transformer Models

All categories Trending Transformer directory Insights