cs.CV

ROSE: Retrieval-Oriented Segmentation Enhancement

arXiv:2604.14147v1 Announce Type: new
Abstract: Existing segmentation models based on multimodal large language models (MLLMs), such as LISA, often struggle with novel or emerging entities due to their inability to incorporate up-to-date knowledge. To…