cs.CL, cs.CV

BabelDOC: Better Layout-Preserving PDF Translation via Intermediate Representation

arXiv:2605.10845v1 Announce Type: new
Abstract: As global cross-lingual communication intensifies, language barriers in visually rich documents such as PDFs remain a practical bottleneck. Existing document translation pipelines face a tension between …