cs.CV, cs.LG

Beyond Perception Errors: Semantic Fixation in Large Vision-Language Models

arXiv:2604.12119v1 Announce Type: new
Abstract: Large vision-language models (VLMs) often rely on familiar semantic priors, but existing evaluations do not cleanly separate perception failures from rule-mapping failures. We study this behavior as sema…