cs.AI, cs.CV

Image Generators are Generalist Vision Learners

arXiv:2604.20329v2 Announce Type: replace
Abstract: Recent works show that image and video generators exhibit zero-shot visual understanding behaviors, in a way reminiscent of how LLMs develop emergent capabilities of language understanding and reason…

Scroll to Top