SVL: Goal-Conditioned Reinforcement Learning as Survival Learning
arXiv:2604.17551v1 Announce Type: new
Abstract: Standard approaches to goal-conditioned reinforcement learning (GCRL) that rely on temporal-difference learning can be unstable and sample-inefficient due to bootstrapping. While recent work has explored…