Faulty reward functions in the wildBy OpenAI News / December 21, 2016 Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.