cs.CV

AICA-Bench: Holistically Examining the Capabilities of VLMs in Affective Image Content Analysis

arXiv:2604.05900v1 Announce Type: new
Abstract: Vision-Language Models (VLMs) have demonstrated strong capabilities in perception, yet holistic Affective Image Content Analysis (AICA), which integrates perception, reasoning, and generation into a unif…