Author name: Alberto G. Rodriguez Salgado

From Pixels to BFS: High Maze Accuracy Does Not Imply Visual Planning

Alberto G. Rodriguez Salgado / May 14, 2026

arXiv:2603.26839v2 Announce Type: replace-cross
Abstract: How do multimodal models solve visual spatial tasks — through genuine planning, or through brute-force search in token space? We introduce \textsc{MazeBench}, a benchmark of 110 procedurally g…

cs.CV, cs.LG

From Pixels to BFS: High Maze Accuracy Does Not Imply Visual Planning

Alberto G. Rodriguez Salgado / March 31, 2026

arXiv:2603.26839v1 Announce Type: new
Abstract: How do multimodal models solve visual spatial tasks — through genuine planning, or through brute-force search in token space? We introduce \textsc{MazeBench}, a benchmark of 110 procedurally generated m…