bViT: Investigating Single-Block Recurrence in Vision Transformers for Image Recognition
arXiv:2605.10661v1 Announce Type: cross
Abstract: Vision Transformers (ViTs) are built by stacking independently parameterized blocks, but it remains unclear how much of this depth requires layer specific transformations and how much can be realized t…