cs.CL, cs.CV

Open-Source Image Editing Models Are Zero-Shot Vision Learners

arXiv:2605.04566v1 Announce Type: new
Abstract: Recent studies have shown that large generative models can solve vision tasks they were not explicitly trained for. However, existing evidence relies on closed-source models~(Veo~3, Nano Banana Pro) or r…