Henrik Marklund, Alex Infanger, Benjamin Van Roy

Consequentialist Objectives and Catastrophe

Henrik Marklund, Alex Infanger, Benjamin Van Roy / April 27, 2026

arXiv:2603.15017v3 Announce Type: replace
Abstract: Because human preferences are too complex to codify, AIs operate with misspecified objectives. Optimizing such objectives often produces undesirable outcomes; this phenomenon is known as reward hacki…

Author name: Henrik Marklund, Alex Infanger, Benjamin Van Roy

Consequentialist Objectives and Catastrophe