SPR-128K: A New Benchmark for Spatial Plausibility Reasoning with Multimodal Large Language Models
arXiv:2505.23265v2 Announce Type: replace
Abstract: The performance of image generation has been significantly improved in recent years. However, the study of image screening is rare, and its performance with Multimodal Large Language Models (MLLMs) i…