cs.AI, cs.CV

Compositional Image Synthesis with Inference-Time Scaling

arXiv:2510.24133v2 Announce Type: replace
Abstract: Despite their impressive realism, modern text-to-image models still struggle with compositionality, often failing to render accurate object counts, attributes, and spatial relations. To address this …