Aaron Broukhim, Nadir Weibel, Eshin Jolly

Same Words, Different Judgments: How Preferences Vary Across Modalities

Aaron Broukhim, Nadir Weibel, Eshin Jolly / May 8, 2026

arXiv:2602.22710v2 Announce Type: replace-cross
Abstract: Preference-based reinforcement learning (PbRL) is the dominant framework for aligning AI systems to human preferences. However, evaluation protocols for such data were designed for text and hav…

Author name: Aaron Broukhim, Nadir Weibel, Eshin Jolly

Same Words, Different Judgments: How Preferences Vary Across Modalities