cs.CL, cs.CV

Is CLIP Cross-Eyed? Revealing and Mitigating Center Bias in the CLIP Family

arXiv:2604.05971v1 Announce Type: new
Abstract: Recent research has shown that contrastive vision-language models such as CLIP often lack fine-grained understanding of visual content. While a growing body of work has sought to address this limitation,…