GitHub - zhangjf-nlp/LatentDPO: Preference Alignment for LLMs through Aligning Latent Variables

This repository contains the source code for Latent DPO (we also name it as LPO in our source code), an computation-efficient preference alignment method introduced in "Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment (Coling 2025)", which reduces additional training time for each new preference to align by 80% to 90% in comparison with LoRA-based DPO or P-Tuning-based DPO.

We include all our code for model training in this repository, including DPO, LoRA-based DPO, P-Tuning-based DPO, and Latent DPO. Our experiment environment is mainly based on pyTorch + transformer + deepspeed. We include the main packages in requirements.txt.

The detailed introduction will be coming soon.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
LPO-dailydialog		LPO-dailydialog
LPO-sentiment		LPO-sentiment
LPO-summary		LPO-summary
method-v2.jpg		method-v2.jpg
readme.md		readme.md
requirements.txt		requirements.txt
unzip.py		unzip.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

zhangjf-nlp/LatentDPO

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages