cs.CV

Structural Graph Probing of Vision-Language Models

arXiv:2603.27070v1 Announce Type: new
Abstract: Vision-language models (VLMs) achieve strong multimodal performance, yet how computation is organized across populations of neurons remains poorly understood. In this work, we study VLMs through the lens…