cs.AI, cs.CV

GENFIG1: Visual Summaries of Scholarly Work as a Challenge for Vision-Language Models

arXiv:2604.04172v1 Announce Type: new
Abstract: In many science papers, “Figure 1” serves as the primary visual summary of the core research idea. These figures are visually simple yet conceptually rich, often requiring significant effort and iteratio…