cs.AI, cs.LG

Meta-Aligner: Bidirectional Preference-Policy Optimization for Multi-Objective LLMs Alignment

arXiv:2604.24178v1 Announce Type: cross
Abstract: Multi-Objective Alignment aims to align Large Language Models (LLMs) with diverse and often conflicting human values by optimizing multiple objectives simultaneously. Existing methods predominantly rel…