Federico Tavella, Amber Drinkwater, Angelo Cangelosi

Fake or Real, Can Robots Tell? Evaluating VLM Robustness to Domain Shift in Single-View Robotic Scene Understanding

Federico Tavella, Amber Drinkwater, Angelo Cangelosi / April 24, 2026

arXiv:2506.19579v3 Announce Type: replace-cross
Abstract: Robotic scene understanding increasingly relies on Vision-Language Models (VLMs) to generate natural language descriptions of the environment. In this work, we systematically evaluate single-vi…

Author name: Federico Tavella, Amber Drinkwater, Angelo Cangelosi

Fake or Real, Can Robots Tell? Evaluating VLM Robustness to Domain Shift in Single-View Robotic Scene Understanding