Why MLLMs Struggle to Determine Object Orientations
arXiv:2604.13321v1 Announce Type: new
Abstract: Multimodal Large Language Models (MLLMs) struggle with tasks that require reasoning about 2D object orientation in images, as documented in prior work. Tong et al. and Nichols et al. hypothesize that the…