360{\deg} Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method
arXiv:2603.16179v2 Announce Type: replace
Abstract: Multimodal Large Language Models (MLLMs) have shown impressive abilities in understanding and reasoning over conventional images. However, their perception of 360{\deg} images remains largely underex…