From Plausibility to Verifiability: Risk-Controlled Generative OCR with Vision-Language Models
arXiv:2603.19790v3 Announce Type: replace
Abstract: Modern vision-language models (VLMs) can act as generative OCR engines, yet open-ended decoding can expose rare but consequential failures. We identify a core deployment misalignment in generative OC…