Wanrong Zheng, Yunhao Ge, Laurent Itti

Three-Step Nav: A Hierarchical Global-Local Planner for Zero-Shot Vision-and-Language Navigation

Wanrong Zheng, Yunhao Ge, Laurent Itti / April 30, 2026

arXiv:2604.26946v1 Announce Type: new
Abstract: Breakthrough progress in vision-based navigation through unknown environments has been achieved by using multimodal large language models (MLLMs). These models can plan a sequence of motions by evaluatin…

Author name: Wanrong Zheng, Yunhao Ge, Laurent Itti

Three-Step Nav: A Hierarchical Global-Local Planner for Zero-Shot Vision-and-Language Navigation