StringNLPLAB/MGS

Repository for the paper "Advancing General-Purpose Reasoning Models with Modular Gradient Surgery"

/ 100

Emerging

This project helps AI researchers and practitioners improve Large Language Models (LLMs) by balancing multiple training objectives like mathematical reasoning, general conversation, and instruction following. It takes an existing LLM and training data from diverse sources (e.g., math problems, chat logs), and outputs a more versatile LLM capable of strong performance across these different domains. This tool is for those who are fine-tuning or training LLMs for multi-skill applications.

Use this if you need to train a single LLM that performs well across distinct tasks such as complex mathematical reasoning, general chat, and accurately following instructions, without sacrificing performance in any one area.

Not ideal if you are looking for a pre-trained, ready-to-use LLM for a single, highly specialized task, or if you don't have the technical expertise to fine-tune advanced models.

LLM-fine-tuning AI-model-training multi-task-learning natural-language-processing AI-research

No Package No Dependents

Maintenance 13 / 25

Adoption 6 / 25

Maturity 11 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

cvs-health/uqlm

UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM...

PRIME-RL/TTRL

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

sapientinc/HRM

Hierarchical Reasoning Model Official Release

tigerchen52/query_level_uncertainty

query-level uncertainty in LLMs

reasoning-survey/Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

Explore Transformer Models

All categories Trending Transformer directory Insights