cs.CV

ORION: ORthonormal Text Encoding for Universal VLM AdaptatION

arXiv:2602.19530v2 Announce Type: replace
Abstract: Vision language models (VLMs) have demonstrated remarkable generalization across diverse tasks, yet their performance remains constrained by the quality and geometry of the textual prototypes used to…