cs.RO

ImagineNav++: Prompting Vision-Language Models as Embodied Navigator through Scene Imagination

arXiv:2512.17435v3 Announce Type: replace
Abstract: Visual navigation is a fundamental capability for autonomous home-assistance robots, enabling long-horizon tasks such as object search. While recent methods have leveraged Large Language Models (LLMs…