Efficient Spatial-Temporal Focal Adapter with SSM for Temporal Action Detection
arXiv:2604.09164v1 Announce Type: new
Abstract: Temporal human action detection aims to identify and localize action segments within untrimmed videos, serving as a pivotal task in video understanding. Despite the progress achieved by prior architectur…