TEMA: Anchor the Image, Follow the Text for Multi-Modification Composed Image Retrieval
arXiv:2604.21806v1 Announce Type: new
Abstract: Composed Image Retrieval (CIR) is an important image retrieval paradigm that enables users to retrieve a target image using a multimodal query that consists of a reference image and modification text. Al…