cs.CV, cs.LG

FIRE-CIR: Fine-grained Reasoning for Composed Fashion Image Retrieval

arXiv:2604.09114v1 Announce Type: cross
Abstract: Composed image retrieval (CIR) aims to retrieve a target image that depicts a reference image modified by a textual description. While recent vision-language models (VLMs) achieve promising CIR perform…