Early Semantic Grounding in Image Editing Models for Zero-Shot Referring Image Segmentation
arXiv:2605.13122v1 Announce Type: new
Abstract: Instruction-based image editing (IIE) models have recently demonstrated strong capability in modifying specific image regions according to natural language instructions, which implicitly requires identif…