cs.CV

Vision-EKIPL: External Knowledge-Infused Policy Learning for Visual Reasoning

arXiv:2506.06856v3 Announce Type: replace
Abstract: Visual reasoning is crucial for understanding complex multimodal data and advancing Artificial General Intelligence. Existing methods enhance the reasoning capability of Multimodal Large Language Mod…