cs.CV

PCSR: Pseudo-label Consistency-Guided Sample Refinement for Noisy Correspondence Learning

arXiv:2509.15623v2 Announce Type: replace
Abstract: Cross-modal retrieval aims to align different modalities via semantic similarity. However, existing methods often assume that image-text pairs are perfectly aligned, overlooking Noisy Correspondences…