cs.AI, cs.CV

Cognitive Alignment At No Cost: Inducing Human Attention Biases For Interpretable Vision Transformers

arXiv:2604.20027v1 Announce Type: new
Abstract: For state-of-the-art image understanding, Vision Transformers (ViTs) have become the standard architecture but their processing diverges substantially from human attentional characteristics. We investiga…