cs.CV, cs.SE

DOne: Decoupling Structure and Rendering for High-Fidelity Design-to-Code Generation

arXiv:2604.01226v1 Announce Type: new
Abstract: While Vision Language Models (VLMs) have shown promise in Design-to-Code generation, they suffer from a “holistic bottleneck-failing to reconcile high-level structural hierarchy with fine-grained visual …