cs.CV

TEMA: Anchor the Image, Follow the Text for Multi-Modification Composed Image Retrieval

arXiv:2604.21806v1 Announce Type: new
Abstract: Composed Image Retrieval (CIR) is an important image retrieval paradigm that enables users to retrieve a target image using a multimodal query that consists of a reference image and modification text. Al…