Dongsheng Yang, Yinfeng Yu, Liejun Wang

Beyond Textual Knowledge-Leveraging Multimodal Knowledge Bases for Enhancing Vision-and-Language Navigation

Dongsheng Yang, Yinfeng Yu, Liejun Wang / March 31, 2026

arXiv:2603.26859v1 Announce Type: new
Abstract: Vision-and-Language Navigation (VLN) requires an agent to navigate through complex unseen environments based on natural language instructions. However, existing methods often struggle to effectively capt…

Author name: Dongsheng Yang, Yinfeng Yu, Liejun Wang

Beyond Textual Knowledge-Leveraging Multimodal Knowledge Bases for Enhancing Vision-and-Language Navigation