relign-ai/relign
post train language models on multi-step reasoning with reinforcement learning
This library helps AI researchers and machine learning engineers develop and refine language models specifically for complex, multi-step reasoning tasks. It takes a pre-trained language model and applies reinforcement learning techniques to improve its ability to solve problems like advanced math or scientific queries. The output is a more capable, reasoning-enhanced language model that can tackle intricate challenges.
No commits in the last 6 months.
Use this if you are a researcher or engineer focused on advancing the reasoning capabilities of large language models through reinforcement learning.
Not ideal if you are looking for a pre-built, ready-to-deploy language model for general use, or if you do not have expertise in machine learning development.
Stars
20
Forks
2
Language
Python
License
MIT
Category
Last pushed
Mar 13, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/relign-ai/relign"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
galilai-group/stable-pretraining
Reliable, minimal and scalable library for pretraining foundation and world models
CognitiveAISystems/MAPF-GPT
[AAAI-2025] This repository contains MAPF-GPT, a deep learning-based model for solving MAPF...
UKPLab/gpl
Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled...
larslorch/avici
Amortized Inference for Causal Structure Learning, NeurIPS 2022
svdrecbd/mhc-mlx
MLX + Metal implementation of mHC: Manifold-Constrained Hyper-Connections by DeepSeek-AI.