Cross-Modal-Domain Generalization Through Semantically Aligned Discrete Representations
arXiv:2605.12145v2 Announce Type: replace
Abstract: Multimodal learning seeks to integrate information across diverse sensory sources, yet current approaches struggle to balance cross-modal generalizability with modality-specific structure. Continuous…