Fangxu Yu, Ziyao Lu, Liqiang Niu, Fandong Meng, Jie Zhou

ArrowGEV: Grounding Events in Video via Learning the Arrow of Time

Fangxu Yu, Ziyao Lu, Liqiang Niu, Fandong Meng, Jie Zhou / April 17, 2026

arXiv:2601.06559v2 Announce Type: replace
Abstract: Grounding events in videos serves as a fundamental capability in video analysis. While Vision Language Models (VLMs) are increasingly employed for this task, existing approaches predominantly train m…

Author name: Fangxu Yu, Ziyao Lu, Liqiang Niu, Fandong Meng, Jie Zhou

ArrowGEV: Grounding Events in Video via Learning the Arrow of Time