cs.CV, cs.LG

EPIC: Efficient Predicate-Guided Inference-Time Control for Compositional Text-to-Image Generation

arXiv:2605.11722v1 Announce Type: new
Abstract: Recent text-to-image (T2I) generators can synthesize realistic images, but still struggle with compositional prompts involving multiple objects, counts, attributes, and relations. We introduce EPIC (Effi…