cs.CV

Representative Attention For Vision Transformers

arXiv:2605.14913v1 Announce Type: new
Abstract: Linear attention has emerged as a promising direction for scaling Vision Transformers beyond the quadratic cost of dense self-attention. A prevalent strategy is to compress spatial tokens into a compact …