FOCUS: Optimal Control for Multi-Entity World Modeling in Text-to-Image Generation
arXiv:2510.02315v2 Announce Type: replace
Abstract: Text-to-image (T2I) models excel on single-entity prompts but struggle with multi-entity scenes, often exhibiting attribute leakage, identity entanglement, and subject omissions. We present a princip…