cs.CV

TextGround4M: A Prompt-Aligned Dataset for Layout-Aware Text Rendering

arXiv:2604.24459v1 Announce Type: new
Abstract: Despite recent advances in text-to-image generation, models still struggle to accurately render prompt-specified text with correct spatial layout — especially in multi-span, structured settings. This ch…