Danae S\'anchez Villegas, Samuel Lewis-Lim, Nikolaos Aletras, Desmond Elliott

Reasoning Dynamics and the Limits of Monitoring Modality Reliance in Vision-Language Models

Danae S\'anchez Villegas, Samuel Lewis-Lim, Nikolaos Aletras, Desmond Elliott / April 17, 2026

arXiv:2604.14888v1 Announce Type: cross
Abstract: Recent advances in vision language models (VLMs) offer reasoning capabilities, yet how these unfold and integrate visual and textual information remains unclear. We analyze reasoning dynamics in 18 VLM…

Author name: Danae S\'anchez Villegas, Samuel Lewis-Lim, Nikolaos Aletras, Desmond Elliott

Reasoning Dynamics and the Limits of Monitoring Modality Reliance in Vision-Language Models