cs.CV

Infinite Gaze Generation for Videos with Autoregressive Diffusion

arXiv:2603.24938v1 Announce Type: new
Abstract: Predicting human gaze in video is fundamental to advancing scene understanding and multimodal interaction. While traditional saliency maps provide spatial probability distributions and scanpaths offer or…