ai-alignment-and-safety, artifical-intellegence, morality, psychology, reinforcement-learning

The Moral Ceiling of Reinforcement Learning

Psychology classified reward-and-punishment reasoning as the most primitive form of moral development seventy years ago. It remains the most sophisticated behavioural framework in AI. The next breakthrough may come from formalising what lies beyond it….