cs.AI, cs.CL

CAMEL: Confidence-Gated Reflection for Reward Modeling

arXiv:2602.20670v2 Announce Type: replace
Abstract: Reward models play a fundamental role in aligning large language models with human preferences. Existing methods predominantly follow two paradigms: scalar discriminative preference models, which are…