cs.CV

SIRI-Bench: Challenging VLMs’ Spatial Intelligence through Complex Reasoning Tasks

arXiv:2506.14512v4 Announce Type: replace
Abstract: Large Language Models (LLMs) have undergone rapid progress, largely attributed to reinforcement learning on complex reasoning tasks. In contrast, while spatial intelligence is fundamental for Vision-…