Structured and Abstractive Reasoning on Multi-modal Relational Knowledge Images
arXiv:2510.21828v2 Announce Type: replace-cross
Abstract: Understanding and reasoning with abstractive information from the visual modality presents significant challenges for current multi-modal large language models (MLLMs). Among the various forms …