FigAgent: Towards Automatic Method Illustration Figure Generation for AI Scientific Papers

arXiv:2603.29590v1 Announce Type: cross Abstract: Method illustration figures (MIFs) play a crucial role in conveying the core ideas of scientific papers, yet their generation remains a labor-intensive process. In this paper, we identify three key characteristics that substantially influence MIF generation quality, i.e., \emph{compositional complexity}, \emph{component similarity}, and \emph{design dynamics}. To handle these characteristics, we take inspiration from human authors' drawing practices and propose \textbf{FigAgent}, a novel multi-agent framework for automatically generating high-quality MIFs. Through multi-agent collaboration, our FigAgent distills drawing experiences across similar components of MIFs and encapsulates them into reusable tools that can be invoked during MIF generation, while evolving these tools to adapt to dynamic design requirements. Besides, a novel Explore-and-Select drawing strategy is introduced to mimic the human-like trial-and-error manner for gradually constructing MIFs with complex structures. Extensive experiments show the efficacy of our method. Project is available \href{https://zhuolingli.github.io/FigAgent-page-project/}{here}.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top