cs.CV

Early Semantic Grounding in Image Editing Models for Zero-Shot Referring Image Segmentation

arXiv:2605.13122v1 Announce Type: new
Abstract: Instruction-based image editing (IIE) models have recently demonstrated strong capability in modifying specific image regions according to natural language instructions, which implicitly requires identif…