cs.AI

Multi-modal Reasoning with LLMs for Visual Semantic Arithmetic

arXiv:2604.19567v1 Announce Type: new
Abstract: Reinforcement learning (RL) as post-training is crucial for enhancing the reasoning ability of large language models (LLMs) in coding and math. However, their capacity for visual semantic arithmetic, inf…