cs.CV

InterCoG: Towards Spatially Precise Image Editing with Interleaved Chain-of-Grounding Reasoning

arXiv:2603.01586v3 Announce Type: replace
Abstract: Emerging unified editing models have demonstrated strong capabilities in general object editing tasks. However, it remains a significant challenge to perform fine-grained editing in complex multi-ent…