$\text{PKS}^4$:Parallel Kinematic Selective State Space Scanners for Efficient Video Understanding
arXiv:2604.26461v1 Announce Type: new
Abstract: Temporal modeling remains a fundamental challenge in video understanding, particularly as sequence lengths scale. Traditional video models relying on dense spatiotemporal attention suffer from quadratic …