Hierarchical Contrastive Learning for Multimodal Data
arXiv:2604.05462v1 Announce Type: new
Abstract: Multimodal representation learning is commonly built on a shared-private decomposition, treating latent information as either common to all modalities or specific to one. This binary view is often inadeq…