cs.CV, cs.LG

ELSA: Exact Linear-Scan Attention for Fast and Memory-Light Vision Transformers

arXiv:2604.23798v1 Announce Type: cross
Abstract: Existing attention accelerators often trade exact softmax semantics, depend on fused Tensor Core kernels, or incur sequential depth that limits FP32 throughput on long sequences. We present \textbf{ELS…