cs.AI, cs.CV

Gram-Anchored Prompt Learning for Vision-Language Models via Second-Order Statistics

arXiv:2604.03980v1 Announce Type: new
Abstract: Parameter-efficient prompt learning has become the de facto standard for adapting Vision-Language Models (VLMs) to downstream tasks. Existing approaches predominantly focus on aligning text prompts with …