Caption-Matching: A Multimodal Approach for Cross-Domain Image Retrieval
arXiv:2403.15152v3 Announce Type: replace
Abstract: Cross-Domain Image Retrieval (CDIR) is a challenging task in computer vision, aiming to match images across different visual domains such as sketches, paintings, and photographs. Existing CDIR method…