Seeing with You: Perception-Reasoning Coevolution for Multimodal Reasoning
arXiv:2603.28618v1 Announce Type: new
Abstract: Reinforcement learning with verifiable rewards (RLVR) has substantially enhanced the reasoning capabilities of multimodal large language models (MLLMs). However, existing RLVR approaches typically rely o…