PyFi: Toward Pyramid-like Financial Image Understanding for VLMs via Adversarial Agents
arXiv:2512.14735v2 Announce Type: replace-cross
Abstract: This paper proposes PyFi, a novel framework for pyramid-like financial image understanding that enables vision language models (VLMs) to reason through question chains in a progressive, simple-…