Disha Singha - Provide.ai

Uncertainty-Aware Reward Discounting for Mitigating Reward Hacking

Disha Singha / April 30, 2026

arXiv:2604.26360v1 Announce Type: cross
Abstract: Reinforcement learning (RL) systems typically optimize scalar reward functions that assume precise and reliable evaluation of outcomes. However, real-world objectives–especially those derived from hum…

Author name: Disha Singha

Uncertainty-Aware Reward Discounting for Mitigating Reward Hacking