SpecPL: Disentangling Spectral Granularity for Prompt Learning
arXiv:2605.04504v1 Announce Type: new
Abstract: Existing prompt learning for VLMs exhibits a modality asymmetry, predominantly optimizing text tokens while still relying on frozen visual encoder as holistic extractor and neglecting the spectral granul…