Learning Invariant Modality Representation for Robust Multimodal Learning from a Causal Inference Perspective
arXiv:2604.18460v1 Announce Type: new
Abstract: Multimodal affective computing aims to predict humans’ sentiment, emotion, intention, and opinion using language, acoustic, and visual modalities. However, current models often learn spurious correlation…