Ali Salamatian, Anthony Fuller, Pritam Sarkar, James R. Green, Leonid Sigal, Evan Shelhamer

LookWhen? Fast Video Recognition by Learning When, Where, and What to Compute

Ali Salamatian, Anthony Fuller, Pritam Sarkar, James R. Green, Leonid Sigal, Evan Shelhamer / May 11, 2026

arXiv:2605.06809v1 Announce Type: new
Abstract: Transformers dominate video recognition. They split videos into tokens, and processing them has expensive superlinear computational cost. Yet videos are filled with redundancy, so we can question the nee…

Author name: Ali Salamatian, Anthony Fuller, Pritam Sarkar, James R. Green, Leonid Sigal, Evan Shelhamer

LookWhen? Fast Video Recognition by Learning When, Where, and What to Compute