Is CLIP Cross-Eyed? Revealing and Mitigating Center Bias in the CLIP Family
arXiv:2604.05971v1 Announce Type: new
Abstract: Recent research has shown that contrastive vision-language models such as CLIP often lack fine-grained understanding of visual content. While a growing body of work has sought to address this limitation,…