TraversalBench: Challenging Paths to Follow for Vision Language Models
arXiv:2604.10999v1 Announce Type: new
Abstract: Vision-language models (VLMs) perform strongly on many multimodal benchmarks. However, the ability to follow complex visual paths — a task that human observers typically find straightforward — remains …