cs.CL

PaperVoyager : Building Interactive Web with Visual Language Models

arXiv:2603.22999v2 Announce Type: replace
Abstract: Recent advances in visual language models have enabled autonomous agents for complex reasoning, tool use, and document understanding. However, existing document agents mainly transform papers into st…