Boosting Reasoning in Large Multimodal Models via Activation Replay
arXiv:2511.19972v3 Announce Type: replace
Abstract: Recently, Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as an effective approach to incentivizing reasoning capability in Large Multimodal Models (LMMs), while the underlying mech…