cs.CV

VecAttention: Vector-wise Sparse Attention for Accelerating Long Context Inference

arXiv:2603.29494v1 Announce Type: new
Abstract: Long-context video understanding and generation pose a significant computational challenge for Transformer-based video models due to the quadratic complexity of self-attention. While existing sparse atte…