Retrieving Any Relevant Moments: Benchmark and Models for Generalized Moment Retrieval
arXiv:2605.02623v1 Announce Type: new
Abstract: Video Moment Retrieval (VMR) aims to localize temporal segments in videos that correspond to a natural language query, but typically assumes only a single matching moment for each query. This assumption …