DecoVLN: Decoupling Observation, Reasoning, and Correction for Vision-and-Language Navigation
arXiv:2603.13133v3 Announce Type: replace
Abstract: Vision-and-Language Navigation (VLN) requires agents to follow long-horizon instructions and navigate complex 3D environments. However, existing approaches face two major challenges: constructing an …