cs.AI

Diffusion-CAM: Faithful Visual Explanations for dMLLMs

arXiv:2604.11005v1 Announce Type: new
Abstract: While diffusion Multimodal Large Language Models (dMLLMs) have recently achieved remarkable strides in multimodal generation, the development of interpretability mechanisms has lagged behind their archit…