cs.CV

SpatialFusion: Endowing Unified Image Generation with Intrinsic 3D Geometric Awareness

arXiv:2604.26341v1 Announce Type: new
Abstract: Recent unified image generation models have achieved remarkable success by employing MLLMs for semantic understanding and diffusion backbones for image generation. However, these models remain fundamenta…