WikiCLIP: An Efficient Contrastive Baseline for Open-domain Visual Entity Recognition
arXiv:2603.09921v2 Announce Type: replace
Abstract: Open-domain visual entity recognition (VER) seeks to associate images with entities in encyclopedic knowledge bases such as Wikipedia. Recent generative methods tailored for VER demonstrate strong pe…