Which Way Does Time Flow? A Psychophysics-Grounded Evaluation for Vision-Language Models
arXiv:2510.26241v4 Announce Type: replace
Abstract: Modern vision-language models (VLMs) excel at many multimodal tasks, yet their grasp of temporal information in video remains weak and has not been adequately evaluated. We probe this gap with a dece…