cs.AI, cs.LG

Towards Disentangled Preference Optimization Dynamics Beyond Likelihood Displacement

arXiv:2604.18239v1 Announce Type: new
Abstract: Preference optimization is widely used to align large language models (LLMs) with human preferences. However, many margin-based objectives suppress the chosen response along with the rejected one, a phen…