cs.CV

Bias at the End of the Score

arXiv:2604.13305v1 Announce Type: new
Abstract: Reward models (RMs) are inherently non-neutral value functions designed and trained to encode specific objectives, such as human preferences or text-image alignment. RMs have become crucial components of…

Scroll to Top