cs.CV

Dynamic Mode Decomposition along Depth in Vision Transformers

arXiv:2605.07556v1 Announce Type: new
Abstract: Recent work has shown that contiguous vision transformer (ViT) blocks (a) can be replaced by a linear map and (b) organize into recurrent phases of computation. We ask whether these observations coincide…