cs.CV

Chain of Modality: From Static Fusion to Dynamic Orchestration in Omni-MLLMs

arXiv:2604.14520v1 Announce Type: new
Abstract: Omni-modal Large Language Models (Omni-MLLMs) promise a unified integration of diverse sensory streams. However, recent evaluations reveal a critical performance paradox: unimodal baselines frequently ou…