cs.AI, cs.LG

SVL: Goal-Conditioned Reinforcement Learning as Survival Learning

arXiv:2604.17551v1 Announce Type: new
Abstract: Standard approaches to goal-conditioned reinforcement learning (GCRL) that rely on temporal-difference learning can be unstable and sample-inefficient due to bootstrapping. While recent work has explored…