ThinkingViT: Matryoshka Thinking Vision Transformer for Elastic Inference
arXiv:2507.10800v3 Announce Type: replace
Abstract: ViTs deliver SOTA performance, yet their fixed computational budget prevents scalable deployment across heterogeneous hardware. Recent Matryoshka-style Transformer architectures mitigate this by embe…