Recurrent Video Masked Autoencoders
arXiv:2512.13684v2 Announce Type: replace
Abstract: We present Recurrent Video Masked-Autoencoders (RVM): a novel approach to video representation learning that leverages recurrent computation to model the temporal structure of video data. RVM couples…