The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding
arXiv:2512.19693v5 Announce Type: replace
Abstract: Deep representations across modalities are inherently intertwined. In this paper, we systematically analyze the spectral characteristics of various semantic and pixel encoders. Interestingly, our stu…