cs.CL

P-Check: Advancing Personalized Reward Model via Learning to Generate Dynamic Checklist

arXiv:2601.02986v2 Announce Type: replace
Abstract: Recent approaches in personalized reward modeling have primarily focused on leveraging user interaction history to align model judgments with individual preferences. However, existing approaches larg…