Are Video Reasoning Models Ready to Go Outside?
arXiv:2603.10652v2 Announce Type: replace-cross
Abstract: In real-world deployment, vision-language models often encounter disturbances such as weather, occlusion, and camera motion. Under such conditions, their understanding and reasoning degrade sub…