Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models
arXiv:2604.25636v1 Announce Type: new
Abstract: Unified multimodal models (UMMs) integrate visual understanding and generation within a single framework. For text-to-image (T2I) tasks, this unified capability allows UMMs to refine outputs after their …