Arya Shah, Vaibhav Tripathi, Mayank Singh, Chaklam Silpasuwanchai

Gaslight, Gatekeep, V1-V3: Early Visual Cortex Alignment Shields Vision-Language Models from Sycophantic Manipulation

Arya Shah, Vaibhav Tripathi, Mayank Singh, Chaklam Silpasuwanchai / April 16, 2026

arXiv:2604.13803v1 Announce Type: cross
Abstract: Vision-language models are increasingly deployed in high-stakes settings, yet their susceptibility to sycophantic manipulation remains poorly understood, particularly in relation to how these models re…

Author name: Arya Shah, Vaibhav Tripathi, Mayank Singh, Chaklam Silpasuwanchai

Gaslight, Gatekeep, V1-V3: Early Visual Cortex Alignment Shields Vision-Language Models from Sycophantic Manipulation