Hierarchical Vision Transformer Enhanced by Graph Convolutional Network for Image Classification
arXiv:2604.16823v1 Announce Type: new
Abstract: Vision Transformer (ViT) has brought new breakthroughs to the field of image classification by introducing the self-attention mechanism and Graph Convolutional Networks(GCN) have been proposed and succes…