Meaquadddd/DPO-Shift

DPO-Shift: Shifting the Distribution of Direct Preference Optimization

/ 100

Experimental

This project offers a method to improve how Large Language Models (LLMs) are fine-tuned using preference data. It takes an existing SFT (Supervised Fine-Tuned) model and preference datasets, then applies a new training strategy to produce a DPO-Shifted model that generates more favored responses. This is for machine learning engineers and researchers who are building and optimizing LLMs.

No commits in the last 6 months.

Use this if you are fine-tuning an LLM with Direct Preference Optimization (DPO) and want to address issues where the model's preferred responses decrease in probability during training.

Not ideal if you are looking for a ready-to-use LLM without needing to engage in the fine-tuning process, or if you are not familiar with DPO and LLM training pipelines.

LLM fine-tuning preference optimization natural language processing model alignment generative AI

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

stair-lab/mlhp

Machine Learning from Human Preferences

princeton-nlp/SimPO

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

uclaml/SPPO

The official implementation of Self-Play Preference Optimization (SPPO)

general-preference/general-preference-model

[ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment...

sail-sg/dice

Official implementation of Bootstrapping Language Models via DPO Implicit Rewards

Explore Transformer Models

All categories Trending Transformer directory Insights