SpaAct: Spatially-Activated Transition Learning with Curriculum Adaptation for Vision-Language Navigation
arXiv:2604.27620v1 Announce Type: new
Abstract: Vision-and-Language Navigation (VLN) aims to enable an embodied agent to follow natural-language instructions and navigate to a target location in unseen 3D environments. We argue that adapting VLMs to V…