cs.AI, cs.CV

Bad Seeing or Bad Thinking? Rewarding Perception for Vision-Language Reasoning

arXiv:2605.14054v1 Announce Type: new
Abstract: Achieving robust perception-reasoning synergy is a central goal for advanced Vision-Language Models (VLMs). Recent advancements have pursued this goal via architectural designs or agentic workflows. Howe…