Decoupling Endpoint and Semantic Transition Learning for Zero-Shot Composed Image Retrieval
arXiv:2605.08389v1 Announce Type: new
Abstract: Zero-shot composed image retrieval (ZS-CIR) retrieves a target image from a reference image and a text modification without human-annotated CIR triplets. Projection-based ZS-CIR methods are attractive be…