cs.AI, cs.CV

Are Video Reasoning Models Ready to Go Outside?

arXiv:2603.10652v2 Announce Type: replace-cross
Abstract: In real-world deployment, vision-language models often encounter disturbances such as weather, occlusion, and camera motion. Under such conditions, their understanding and reasoning degrade sub…